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To Allen Gerry and Neil Tame, 
who took the time 

to give a group of kids 

some really cool problems 


There once was a classical theory, 

Of which quantum disciples were leery. 
They said, “Why spend so long 

On a theory that’s wrong?” 

Well, it works for your everyday query! 
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Preface 


This book grew out of Harvard University’s honors freshman mechanics course. 
Itis essentially two books in one. Roughly half of each chapter follows the form of 
anormal textbook, consisting of text, along with exercises suitable for homework 
assignments. The other half takes the form of a “problem book,” with all sorts 
of problems (and solutions) of varying degrees of difficulty. I’ve always thought 
that doing problems is the best way to learn, so if you’ve been searching for a 
supply to puzzle over, I think this will keep you busy for a while. 

This book is somewhat of a quirky one, so let me say right at the start how I 
imagine it being used: 


e As the primary text for honors freshman mechanics courses. My original motivation 
for writing it was the fact that there didn’t exist a suitable book for Harvard’s freshman 
course. So after nine years of using updated versions in the class, here is the finished 
product. 

e As a supplementary text for standard freshman courses for physics majors. Although 
this book starts at the beginning of mechanics and is self contained, it doesn’t spend 
as much time on the introductory material as other freshman books do. I therefore 
don’t recommend using this as the only text for a standard freshman mechanics course. 
However, it will make an extremely useful supplement, both as a problem book for all 
students, and as a more advanced textbook for students who want to dive further into 
certain topics. 

e As a supplementary text for upper-level mechanics courses, or as the primary text which 
is supplemented with another book for additional topics often covered in upper-level 
courses, such as Hamilton’s equations, fluids, chaos, Fourier analysis, electricity and 
magnetism applications, etc. With all of the worked examples and in-depth discussions, 
you really can’t go wrong in pairing up this book with another one. 

e As a problem book for anyone who likes solving physics problems. This audience 
ranges from advanced high-school students, who I think will have a ball with it, to 
undergraduate and graduate students who want some amusing problems to ponder, to 
professors who are looking for a new supply of problems to use in their classes, and 
finally to anyone with a desire to learn about physics by doing problems. If you want, 
you can consider this to be a problem book that also happens to have comprehensive 
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introductions to each topic’s set of problems. With about 250 problems (with included 
solutions) and 350 exercises (without included solutions), in addition to all the examples 
in the text, I think you’ll get your money’s worth! But just in case, I threw in 600 figures, 


50 limericks, nine appearances of the golden ratio, and one cameo of e~”. 


The prerequisites for the book are solid high-school foundations in mechanics 
(no electricity and magnetism required) and single-variable calculus. There are 
two minor exceptions to this. First, a few sections rely on multivariable calcu- 
lus, so I have given a review of this in Appendix B. The bulk of it comes in 
Section 5.3 (which involves the curl), but this section can easily be skipped on 
a first reading. Other than that, there are just some partial derivatives, dot prod- 
ucts, and cross products (all of which are reviewed in Appendix B) sprinkled 
throughout the book. Second, a few sections (4.5, 9.2—9.3, and Appendices D 
and E) rely on matrices and other elementary topics from linear algebra. But a 
basic understanding of matrices should suffice here. 

A brief outline of the book is as follows. Chapter | discusses various problem- 
solving strategies. This material is extremely important, so if you read only one 
chapter in the book, make it this one. You should keep these strategies on the 
tip of your brain as you march through the rest of the book. Chapter 2 covers 
statics. Most of this will likely be familiar, but you’ll find some fun problems. 
In Chapter 3, we learn about forces and how to apply F = ma. There’s a bit of 
math here needed for solving some simple differential equations. Chapter 4 deals 
with oscillations and coupled oscillators. Again, there’s a fair bit of math needed 
for solving linear differential equations, but there’s no way to avoid it. Chapter 5 
deals with conservation of energy and momentum. You’ve probably seen much 
of this before, but it has lots of neat problems. 

In Chapter 6, we introduce the Lagrangian method, which will most likely be 
new to you. It looks rather formidable at first, but it’s really not all that rough. 
There are difficult concepts at the heart of the subject, but the nice thing is that the 
technique is easy to apply. The situation here is analogous to taking a derivative 
in calculus; there are substantive concepts on which the theory rests, but the act 
of taking a derivative is fairly straightforward. 

Chapter 7 deals with central forces and planetary motion. Chapter 8 covers 
the easier type of angular momentum situations, where the direction of the 
angular momentum vector is fixed. Chapter 9 covers the more difficult type, 
where the direction changes. Spinning tops and other perplexing objects fall into 
this category. Chapter 10 deals with accelerating reference frames and fictitious 
forces. 

Chapters 11 through 14 cover relativity. Chapter 11 deals with relativistic 
kinematics — abstract particles flying through space and time. Chapter 12 covers 
relativistic dynamics — energy, momentum, force, etc. Chapter 13 introduces the 
important concept of “4-vectors.” The material in this chapter could alternatively 
be put in the previous two, but for various reasons I thought it best to create a 
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separate chapter for it. Chapter 14 covers a few topics from General Relativity. 
It’s impossible for one chapter to do this subject justice, of course, so we’ll just 
look at some basic (but still very interesting) examples. Finally, the appendices 
cover various useful, but slightly tangential, topics. 

Throughout the book, I have included many “Remarks.” These are written 
in a slightly smaller font than the surrounding text. They begin with a small- 
capital “Remark” and end with a shamrock (æ). The purpose of these remarks is 
to say something that needs to be said, without disrupting the overall flow of the 
argument. In some sense these are “extra” thoughts, although they are invariably 
useful in understanding what is going on. They are usually more informal than 
the rest of the text, and I reserve the right to use them to occasionally babble 
about things that I find interesting, but that you may find tangential. For the most 
part, however, the remarks address issues that arise naturally in the course of the 
discussion. I often make use of “Remarks” at the ends of the solutions to problems, 
where the obvious thing to do is to check limiting cases (this topic is discussed in 
Chapter |). However, in this case, the remarks are not “extra” thoughts, because 
checking limiting cases of your answer is something you should always do. 

For your reading pleasure (I hope!), I have included limericks throughout the 
text. I suppose that these might be viewed as educational, but they certainly don’t 
represent any deep insight I have into the teaching of physics. I have written them 
for the sole purpose of lightening things up. Some are funny. Some are stupid. 
But at least they’re all physically accurate (give or take). 

As mentioned above, this book contains a huge number of problems. The ones 
with included solutions are called “Problems,” and the ones without included 
solutions, which are intended to be used for homework assignments, are called 
“Exercises.” There is no fundamental difference between these two types, except 
for the existence of written-up solutions. I have chosen to include the solutions 
to the problems for two reasons. First, students invariably want extra practice 
problems, with solutions, to work on. And second, I had a thoroughly enjoyable 
time writing them up. But a warning on these problems and exercises: Some are 
easy, but many are very difficult. I think you’ll find them quite interesting, but 
don’t get discouraged if you have trouble solving them. Some are designed to be 
brooded over for hours. Or days, or weeks, or months (as I can attest to!). 

The problems (and exercises) are marked with a number of stars (actually 
asterisks). Harder problems earn more stars, on a scale from zero to four. Of 
course, you may disagree with my judgment of difficulty, but I think that an 
arbitrary weighting scheme is better than none at all. As a rough idea of what I 
mean by the number of stars, one-star problems are solid problems that require 
some thought, and four-star problems are really, really, really hard. Try a few 
and you’ll see what I mean. Even if you understand the material in the text 
backwards and forwards, the four-star (and many of the three-star) problems will 
still be extremely challenging. But that’s how it should be. My goal was to create 
an unreachable upper bound on the number (and difficulty) of problems, because 
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it would be an unfortunate circumstance if you were left twiddling your thumbs, 
having run out of problems to solve. I hope I have succeeded. 

For the problems you choose to work on, be careful not to look at the solution 
too soon. There’s nothing wrong with putting a problem aside for a while and 
coming back to it later. Indeed, this is probably the best way to learn things. If 
you head to the solution at the first sign of not being able to solve a problem, 
then you have wasted the problem. 


REMARK: This gives me an opportunity for my first remark (and first limerick, too). A fact that 
often gets overlooked is that you need to know more than the correct way(s) to do a problem; you 
also need to be familiar with many incorrect ways of doing it. Otherwise, when you come upon 
a new problem, there may be a number of decent-looking approaches to take, and you won’t be 
able to immediately weed out the poor ones. Struggling a bit with a problem invariably leads 
you down some wrong paths, and this is an essential part of learning. To understand something, 
you not only have to know what’s right about the right things; you also have to know what’s 
wrong about the wrong things. Learning takes a serious amount of effort, many wrong turns, 
and a lot of sweat. Alas, there are no shortcuts to understanding physics. 


The ad said, For one little fee, 

You can skip all that course-work ennui. 
So send your tuition, 

For boundless fruition! 

Get your mail-order physics degree! & 


Any book that takes ten years to write is bound to contain the (greatly appreci- 
ated) input of many people. I am particularly thankful for Howard Georgi’s help 
over the years, with his numerous suggestions, ideas for many problems, and 
physics sanity checks. I would also like to thank Don Page for his entertaining 
and meticulous comments and suggestions, and an eye for catching errors in ear- 
lier versions. Other friends and colleagues who have helped make this book what 
it is (and who have made it all the more fun to write) are John Bechhoefer, Wes 
Campbell, Michelle Cyrier, Alex Dahlen, Gary Feldman, Lukasz Fidkowski, 
Jason Gallicchio, Doug Goodale, Bertrand Halperin, Matt Headrick, Jenny 
Hoffman, Paul Horowitz, Alex Johnson, Yevgeny Kats, Can Kilic, Ben Krefetz, 
Daniel Larson, Jaime Lush, Rakhi Mahbubani, Chris Montanaro, Theresa Morin, 
Megha Padi, Dave Patterson, Konstantin Penanen, Courtney Peterson, Mala 
Radhakrishnan, Esteban Real, Daniel Rosenberg, Wolfgang Rueckner, Aqil 
Sajjad, Alexia Schulz, Daniel Sherman, Oleg Shpyrko, David Simmons-Duffin, 
Steve Simon, Joe Swingle, Edwin Taylor, Sam Williams, Alex Wissner-Gross, 
and Eric Zaslow. I’m sure that I have forgotten others, especially from the earlier 
years where my memory fades, so please accept my apologies. 

I am also grateful for the highly professional work done by the editorial and 
production group at Cambridge University Press in transforming this into an 
actual book. It has been a pleasure working with Lindsay Barnes, Simon Capelin, 
Margaret Patterson, and Dawn Preston. 

Finally, and perhaps most importantly, I would like to thank all the students 
(both at Harvard and elsewhere) who provided input during the past decade. 
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The names here are literally too numerous to write down, so let me simply say a 
big thank you, and that I hope other students will enjoy what you helped create. 

Despite the painstaking proofreading and all the eyes that have passed over 
earlier versions, there is at most an exponentially small probability that the 
book is error free. So if something looks amiss, please check the webpage 
(www.cambridge.org/9780521876223) for a list of typos, updates, etc. And 
please let me know if you discover something that isn’t already posted. I’m 
sure that eventually I will post some new problems and supplementary material, 
so be sure to check the webpage for additions. Information for instructors will 
also be available on this site. 


Happy problem solving — I hope you enjoy the book! 
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Chapter 1 
Strategies for solving problems 


Physics involves a great deal of problem solving. Whether you are doing 
cutting-edge research or reading a book on a well-known subject, you are going 
to need to solve some problems. In the latter case (the presently relevant one, 
given what is in your hand right now), it is fairly safe to say that the true test 
of understanding something is the ability to solve problems on it. Reading about 
a topic is often a necessary step in the learning process, but it is by no means 
a sufficient one. The more important step is spending as much time as possible 
solving problems (which is inevitably an active task) beyond the time you spend 
reading (which is generally a more passive task). I have therefore included a very 
large number of problems/exercises in this book. 

However, if I’m going to throw all these problems at you, I should at least give 
you some general strategies for solving them. These strategies are the subject of 
the present chapter. They are things you should always keep in the back of your 
mind when tackling a problem. Of course, they are generally not sufficient by 
themselves; you won’t get too far without understanding the physical concepts 
behind the subject at hand. But when you add these strategies to your physical 
understanding, they can make your life a lot easier. 


1.1 General strategies 


There are a number of general strategies you should invoke without hesitation 
when solving a problem. They are: 


1. Draw a diagram, if appropriate. 


In the diagram, be sure to label clearly all the relevant quantities (forces, lengths, 
masses, etc.). Diagrams are absolutely critical in certain types of problems. For 
example, in problems involving “free-body” diagrams (discussed in Chapter 3) or 
relativistic kinematics (discussed in Chapter 11), drawing a diagram can change a 
hopelessly complicated problem into a near-trivial one. And even in cases where 
diagrams aren’t this crucial, they’re invariably very helpful. A picture is definitely 


worth a thousand words (and even a few more, if you label things!). 
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2. Write down what you know, and what you are trying to find. 


In a simple problem, you may just do this in your head without realizing it. But in 
more difficult problems, it is very useful to explicitly write things out. For example, 
if there are three unknowns that you’re trying to find, but you’ve written down 
only two facts, then you know there must be another fact you’re missing (assuming 
that the problem is in fact solvable), so you can go searching for it. It might be a 


conservation law, or an F = ma equation, etc. 


3. Solve things symbolically. 


If you are solving a problem where the given quantities are specified numerically, 

you should immediately change the numbers to letters and solve the problem in terms 

of the letters. After you obtain an answer in terms of the letters, you can plug in the 
actual numerical values to obtain a numerical answer. There are many advantages 
to using letters: 

e IT’S QUICKER. It’s much easier to multiply a g by an £ by writing them down on a 
piece of paper next to each other, than it is to multiply them together on a calculator. 
And with the latter strategy, you’d undoubtedly have to pick up your calculator at 
least a few times during the course of a problem. 

e YOU'RE LESS LIKELY TO MAKE A MISTAKE. It’s very easy to mistype an 8 for a 9 in 
a calculator, but you’re probably not going to miswrite a q for a g on a piece of 
paper. But if you do, you’ll quickly realize that it should be a g. You certainly 
won’t just give up on the problem and deem it unsolvable because no one gave 
you the value of q! 

e YOU CAN DO THE PROBLEM ONCE AND FOR ALL. If someone comes along and says, 
oops, the value of £ is actually 2.4 m instead of 2.3 m, then you won’t have to do 
the whole problem again. You can simply plug the new value of £ into your final 
symbolic answer. 

e YOU CAN SEE THE GENERAL DEPENDENCE OF YOUR ANSWER ON THE VARIOUS GIVEN QUAN- 
TITIES. For example, you can see that it grows with quantities a and b, decreases with 
c, and doesn’t depend on d. There is much, much more information contained in a 
symbolic answer than in a numerical one. And besides, symbolic answers nearly 
always look nice and pretty. 

e YOU CAN CHECK UNITS AND SPECIAL CASES. These checks go hand-in-hand with the 
previous “general dependence” advantage. But since they’re so important, we’ ll 
postpone their discussion and devote Sections 1.2 and 1.3 to them. 

Having said all this, it should be noted that there are occasionally times when things 

get a bit messy when working with letters. For example, solving a system of three 

equations in three unknowns might be rather cumbersome unless you plug in the 
actual numbers. But in the vast majority of problems, it is highly advantageous to 
work entirely with letters. 


4. Consider units/dimensions. 


This is extremely important. See Section 1.2 for a detailed discussion. 
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5. Check limiting/special cases. 


This is also extremely important. See Section 1.3 for a detailed discussion. 


6. Check order of magnitude if you end up getting a numerical answer. 


If you end up with an actual numerical answer to a problem, be sure to do a san- 
ity check to see if the number is reasonable. If you’ve calculated the distance 
along the ground that a car skids before it comes to rest, and if you’ve gotten 
an answer of a kilometer or a millimeter, then you know you’ve probably done 
something wrong. Errors of this sort often come from forgetting some powers of 
10 (say, when converting kilometers to meters) or from multiplying something 
instead of dividing (although you should be able to catch this by checking your 
units, too). 


You will inevitably encounter problems, physics ones or otherwise, where 
you don’t end up obtaining a rigorous answer, either because the calculation is 
intractable, or because you just don’t feel like doing it. But in these cases it’s 
usually still possible to make an educated guess, to the nearest power of 10. For 
example, if you walk past a building and happen to wonder how many bricks 
are in it, or what the labor cost was in constructing it, then you can probably 
give a reasonable answer without doing any severe computations. The physicist 
Enrico Fermi was known for his ability to estimate things quickly and produce 
order-of-magnitude guesses with only minimal calculation. Hence, a problem 
where the goal is to simply obtain the nearest power-of-10 estimate is known as a 
“Fermi problem.” Of course, sometimes in life you need to know things to better 
accuracy than the nearest power of 10... 


How Fermi could estimate things! 

Like the well-known Olympic ten rings, 
And the one hundred states, 

And weeks with ten dates, 

And birds that all fly with one... wings. 


In the following two sections, we’ll discuss the very important strategies of 
checking units and special cases. Then in Section 1.4 we’ll discuss the technique 
of solving problems numerically, which is what you need to do when you end up 
with a set of equations you can’t figure out how to solve. Section 1.4 isn’t quite 
analogous to Sections 1.2 and 1.3, in that these first two are relevant to basically 
any problem you'll ever do, whereas solving equations numerically is something 
you’ll do only for occasional problems. But it’s nevertheless something that every 
physics student should know. 

In all three of these sections, we’ll invoke various results derived later in the 
book. For the present purposes, the derivations of these results are completely 
irrelevant, so don’t worry at all about the physics behind them — there will be 


Fig. 1.1 
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plenty of opportunity for that later on! The main point here is to learn what to do 
with the result of a problem once you’ve obtained it. 


1.2 Units, dimensional analysis 


The units, or dimensions, of a quantity are the powers of mass, length, and time 
associated with it. For example, the units of a speed are length per time. The 
consideration of units offers two main benefits. First, looking at units before 
you start a problem can tell you roughly what the answer has to look like, up 
to numerical factors. Second, checking units at the end of a calculation (which 
is something you should always do) can tell you if your answer has a chance at 
being correct. It won’t tell you that your answer is definitely correct, but it might 
tell you that your answer is definitely incorrect. For example, if your goal in a 
problem is to find a length, and if you end up with a mass, then you know it’s 
time to look back over your work. 


“Your units are wrong!” cried the teacher. 

“Your church weighs six joules — what a feature! 
And the people inside 

Are four hours wide, 

And eight gauss away from the preacher!” 


In practice, the second of the above two benefits is what you will generally 
make use of. But let’s do a few examples relating to the first benefit, because 
these can be a little more exciting. To solve the three examples below exactly, we 
would need to invoke results derived in later chapters. But let’s just see how far we 
can get by using only dimensional analysis. We’ ll use the “[ ]” notation for units, 
and we’ll let M stand for mass, L for length, and T for time. For example, we’ll 
write a speed as [v] = L/T and the gravitational constant as [G] = L? /(MT?) 
(you can figure this out by noting that Gm jm /r? has the dimensions of force, 
which in turn has dimensions ML/T*, from F = ma). Alternatively, you can just 
use the mks units, kg, m, s, instead of M, L, T, respectively. ! 


Example (Pendulum): A mass m hangs from a massless string of length £ 
(see Fig. 1.1) and swings back and forth in the plane of the paper. The acceleration 
due to gravity is g. What can we say about the frequency of oscillations? 


Solution: The only dimensionful quantities given in the problem are [m] = M, 
[¢] = L, and [g] = L/T?. But there is one more quantity, the maximum angle 6p, 
which is dimensionless (and easy to forget). Our goal is to find the frequency, which 


When you check units at the end of a calculation, you will invariably be working with the kg,m,s 
notation. So that notation will inevitably get used more. But I’ll use the M, L, T notation here, 
because I think it’s a little more instructive. At any rate, just remember that the letter m (or M) 
stands for “meter” in one case, and “mass” in the other. 


1.2 Units, dimensional analysis 


has units of 1/7. The only combination of our given dimensionful quantities that has 
units of 1/T is /g/€. But we can’t rule out any 09 dependence, so the most general 


possible form of the frequency is? 
o=f@ j5, (1.1) 


where f is a dimensionless function of the dimensionless variable 69. 
REMARKS: 


1. It just so happens that for small oscillations, f (69) is essentially equal to 1, so the 
frequency is essentially equal to ./g/€. But there is no way to show this by using 
only dimensional analysis; you actually have to solve the problem for real. For 
larger values of 69, the higher-order terms in the expansion of f become important. 
Exercise 4.23 deals with the leading correction, and the answer turns out to be f (69) = 
1-65 /16+---. 

2. Since there is only one mass in the problem, there is no way that the frequency (with 
units of 1/7) can depend on [m] = M. If it did, there would be nothing to cancel the 
units of mass and produce a pure inverse-time. 

3. We claimed above that the only combination of our given dimensionful quantities 
that has units of 1/T is ./g/€. This is easy to see here, but in more complicated 
problems where the correct combination isn’t so obvious, the following method will 
always work. Write down a general product of the given dimensionful quantities 
raised to arbitrary powers (m7€g° in this problem), and then write out the units of 
this product in terms of a, b, and c. If we want to obtain units of 1/7 here, then 
we need 


L\o 1 
M1? (—) =. 1.2 
(=) T (1.2) 


Matching up the powers of the three kinds of units on each side of this equation gives 
M:a=0, L: b+c=0, T: -—2c=-1. (1.3) 


The solution to this system of equations is a = 0, b = —1/2, and c = 1/2, so we 
have reproduced the ./g/€ result. & 


What can we say about the total energy of the pendulum (with the potential energy 
measured relative to the lowest point)? We’ll talk about energy in Chapter 5, but 
the only thing we need to know here is that energy has units of ML? /T?. The only 
combination of the given dimensionful constants of this form is mg£. But again, we 
can’t rule out any 09 dependence, so the energy must take the form f (69)mg£, where 
f is some function. That’s as far as we can go with dimensional analysis. However, 
if we actually invoke a little physics, we can say that the total energy equals the 
potential energy at the highest point, which is mg£(1 — cos 69). Using the Taylor 
expansion for cos @ (see Appendix A for a discussion of Taylor series), we see that 
f (0) = og /2 — 04/24 + ---. So in contrast with the frequency result above, the 
maximum angle 69 plays a critical role in the energy. 


2 We’ll measure frequency here in radians per second, denoted by w. So we’re actually talking about 
the “angular frequency.” Just divide by 27 (which doesn’t affect the units) to obtain the “regular” 
frequency in cycles per second (hertz), usually denoted by v. We’ll talk at great length about 
oscillations in Chapter 4. 
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Example (Spring): A spring with spring constant k has a mass m on its end 
(see Fig. 1.2). The spring force is F(x) = —kx, where x is the displacement from the 
equilibrium position. What can we say about the frequency of oscillations? 


Solution: The only dimensionful quantities in this problem are [m] = M, [k] = 
M/T 2 (obtained by noting that kx has the dimensions of force), and the maximum 
displacement from the equilibrium, [xg] = L. (There is also the equilibrium length, 
but the force doesn’t depend on this, so there is no way it can come into the answer.) 
Our goal is to find the frequency, which has units of 1/7. The only combination of 
our given dimensionful quantities with these units is 
k 
wo=C,/—, (1.4) 


m 


where C is a dimensionless number. It just so happens that C is equal to 1 (assuming 
that we’re measuring w in radians per second), but there is no way to show this by 
using only dimensional analysis. Note that, in contrast with the pendulum above, the 
frequency cannot have any dependence on the maximum displacement. 

What can we say about the total energy of the spring? Energy has units of ML? /T z 
and the only combination of the given dimensionful constants of this form is Bkxż, 
where B is a dimensionless number. It turns out that B = 1/2, so the total energy 
equals hag /2. 

Remark: A real spring doesn’t have a perfectly parabolic potential (that is, a perfectly 
linear force), so the force actually looks something like F(x) = —kx + bx? +--+. If we 
truncate the series at the second term, then we have one more dimensionful quantity to 
work with, [b] = M/LT?. To form a quantity with the dimensions of frequency, 1/7, we 
need xo and b to appear in the combination xob, because this is the only way to get rid 
of the L. You can then see (by using the strategy of writing out a general product of the 


variables, discussed in the third remark in the pendulum example above) that the frequency 
must be of the form f (xob/k)y k/m, where f is some function. We can therefore have xo 


dependence in this case. This answer must reduce to C. k/m for b = 0. Hence, f must be 
of the form f(y) = C+ ciy + ey? +. & 


Example (Low-orbit satellite): A satellite of mass m travels in a circular orbit 
just above the earth’s surface. What can we say about its speed? 


Solution: The only dimensionful quantities in the problem are [m] = M, [g] = 
L/T?, and the radius of the earth [R] = L.> Our goal is to find the speed, which has 
units of L/T. The only combination of our dimensionful quantities with these units is 


v = C/gr. (1.5) 


It turns out that C = 1. 


3 You might argue that the mass of the earth, Mg, and Newton’s gravitational constant, G, should 


be also included here, because Newton’s gravitational force law for a particle on the surface of the 
earth is F = GMgm/R?. But since this force can be written as m(GMẸ /R?) = mg, we can absorb 
the effects of Mg and G into g. 


1.3 Approximations, limiting cases 


1.3 Approximations, limiting cases 


As with units, the consideration of limiting cases (or perhaps we should say 
special cases) offers two main benefits. First, it can help you get started on a 
problem. If you’re having trouble figuring out how a given system behaves, 
then you can imagine making, for example, a certain length become very large or 
very small, and then you can see what happens to the behavior. Having convinced 
yourself that the length actually affects the system in extreme cases (or perhaps 
you will discover that the length doesn’t affect things at all), it will then be 
easier to understand how it affects the system in general, which will then make 
it easier to write down the relevant quantitative equations (conservation laws, 
F = ma equations, etc.), which will allow you to fully solve the problem. In 
short, modifying the various parameters and observing the effects on the system 
can lead to an enormous amount of information. 

Second, as with checking units, checking limiting cases (or special cases) 
is something you should always do at the end of a calculation. But as with 
checking units, it won’t tell you that your answer is definitely correct, but 
it might tell you that your answer is definitely incorrect. It is generally true 
that your intuition about limiting cases is much better than your intuition 
about generic values of the parameters. You should use this fact to your 
advantage. 

Let’s do a few examples relating to the second benefit. The initial expressions 
given in each example below are taken from various examples throughout the 
book, so just accept them for now. For the most part, I’ll repeat here what I'll 
say later on when we work through the problems for real. A tool that comes up 
often in checking limiting cases is the Taylor series approximations; the series 
for many functions are given in Appendix A. 


Example (Dropped ball): A beach ball is dropped from rest at height A. Assume 
that the drag force from the air takes the form Fg = —mav. We’ll find in Section 3.3 
that the ball’s velocity and position are given by 


o) = -É (1-2), and yin =h—£ (1 2(1-e*")). (1.6) 


a 


These expressions are a bit complicated, so for all you know, I could have made a 
typo in writing them down. Or worse, I could have completely botched the solu- 
tion. So let’s look at some limiting cases. If these limiting cases yield expected 
results, then we can feel a little more confident that the answers are actually 
correct. 

Ift is very small (more precisely, ifat « 1; see the discussion following this exam- 
ple), then we can use the Taylor series, e™* ~ 1 — x +x*/2, to make approximations 
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to leading order in at. The v(t) in Eq. (1.6) becomes 


wont (r-(1-a4 Mt ..)) 


~ gt, (1.7) 


plus terms of higher order in at. This answer is expected, because the drag force is 
negligible at the start, so we essentially have a freely falling body with acceleration 
g downward. For small t, Eq. (1.6) also gives 


E 2 
yt) =h—& [ : ( ( ary P =J) 
a a 2 


2 
gt 

zx h- >, 1.8 
J (1.8) 


plus terms of higher order in vt. Again, this answer is expected, because we essentially 
have a freely falling body at the start, so the distance fallen is the standard gt? [2 
We can also look at large ¢ (or rather, large af). In this case, e7™®* is essentially 


zero, so the u(t) in Eq. (1.6) becomes (there’s no need for a Taylor series in this case) 


vy ~ —2. (1.9) 
a 
This is the “terminal velocity.” Its value makes sense, because it is the velocity for 
which the total force, —mg — mav, vanishes. For large t, Eq. (1.6) also gives 


t 
yOrh- = 4+ (1.10) 


E 
a2 
Apparently for large t, g/ a” is the distance (and this does indeed have units of 
length, because a has units of T7 l because mv has units of force) that our ball lags 
behind another ball that started out already at the terminal velocity, —g/a. 


Whenever you derive approximate answers as we just did, you gain something 
and you lose something. You lose some truth, of course, because your new answer 
is technically not correct. But you gain some aesthetics. Your new answer is 
invariably much cleaner (sometimes involving only one term), and this makes it 
a lot easier to see what’s going on. 

In the above example, it actually makes no sense to look at the limit where 
t is small or large, because ¢ has dimensions. Is a year a large or small time? 
How about a hundredth of a second? There is no way to answer this without 
knowing what problem you’re dealing with. A year is short on the time scale 
of galactic evolution, but a hundredth of a second is long on the time scale of 
a nuclear process. It makes sense only to look at the limit of a small (or large) 
dimensionless quantity. In the above example, this quantity is wt. The given 
constant œ has units of T7!, so 1/a sets a typical time scale for the system. It 
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therefore makes sense to look at the limit where t < 1/qa (that is, at < 1), or 
where t >> 1/« (that is, at >> 1). In the limit of a small dimensionless quantity, 
a Taylor series can be used to expand an answer in powers of the small quantity, 
as we did above. We sometimes get sloppy and say things like, “In the limit 
of small ¢.” But you know that we really mean, “In the limit of some small 
dimensionless quantity that has a ¢ in the numerator,” or, “In the limit where ¢ is 
much smaller that a certain quantity that has the dimensions of time.” 


Remark: As mentioned above, checking special cases tells you that either (1) your answer is 
consistent with your intuition, or (2) it’s wrong. It never tells you that it’s definitely correct. 
This is the same as what happens with the scientific method. In the real world, everything comes 
down to experiment. If you have a theory that you think is correct, then you need to check that 
its predictions are consistent with experiments. The specific experiments you do are the analog 
of the special cases you check after solving a problem; these two things represent what you 
know is true. If the results of the experiments are inconsistent with your theory, then you need 
to go back and fix your theory, just as you would need to go back and fix your answer. If, on 
the other hand, the results are consistent, then although this is good, the only thing it really 
tells you is that your theory might be correct. And considering the way things usually turn out, 
the odds are that it’s not actually correct, but rather the limiting case of a more correct theory 
(just as Newtonian physics is a limiting case of relativistic physics, which is a limiting case of 
quantum field theory, etc.). That’s how physics works. You can’t prove anything, so you learn 
to settle for the things you can’t disprove. 


Consider, when seeking gestalts, 

The theories that physics exalts. 

It’s not that they’re known 

To be written in stone. 

It’s just that we can’t say they’re false. & 


When making approximations, how do you know how many terms in the 
Taylor series to keep? In the example above, we used e~* ~ 1 — x + x?/2. 
But why did we stop at the x* term? The honest (but slightly facetious) answer 
is, “Because I had already done this problem before writing it up, so I knew 
how many terms to keep.” But the more informative (although perhaps no more 
helpful) answer is that before you do the calculation, there’s really no way of 
knowing how many terms to keep. So you should just keep a few and see what 
happens. If everything ends up canceling out, then this tells you that you need to 
repeat the calculation with another term in the series. For example, in Eq. (1.8), 
if we had stopped the Taylor series ate~* ~ 1 — x, then we would have obtained 
y(t) = h—0, which isn’t very useful, since the general goal is to get the leading- 
order behavior in the parameter we’re looking at (which is ¢ here). So in this 
case we’d know we’d have to go back and include the x*/2 term in the series. 
If we were doing a problem in which there was still no ¢ (or whatever variable) 
dependence at that order, then we’d have to go back and include the —x?/6 term 
in the series. Of course, you could just play it safe and keep terms up to, say, 
fifth order. But that’s invariably a poor strategy, because you’ll probably never 
in your life have to go out that far in a series. So just start with one or two terms 
and see what it gives you. Note that in Eq. (1.7), we actually didn’t need the 
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second-order term, so we in fact could have gotten by with only e* ~ 1 — x. 
But having the extra term here didn’t end up causing much heartache. 

After you make an approximation, how do you know if it’s a “good” one? Well, 
just as it makes no sense to ask ifa dimensionful quantity is large or small without 
comparing it to another quantity, it makes no sense to ask if an approximation is 
“good” or “bad” without stating the accuracy you want. In the above example, if 
you’re looking at a t value for which at ~ 1/100, then the term we ignored in 
Eq. (1.7) is smaller than gt by a factor at/2 ~ 1/200. So the error is on the order 
of 1%. If this is enough accuracy for whatever purpose you have in mind, then 
the approximation is a good one. If not, it’s a bad one, and you should add more 
terms in the series until you get your desired accuracy. 

The results of checking limits generally fall into two categories. Most of the 
time you know what the result should be, so this provides a double-check on your 
answer. But sometimes an interesting limit pops up that you might not expect. 
Such is the case in the following examples. 


Example (Two masses in 1-D): A mass m with speed v approaches a stationary 
mass M (see Fig. 1.3). The masses bounce off each other elastically. Assume that 
all motion takes place in one dimension. We’ll find in Section 5.6.1 that the final 
velocities of the particles are 


(m—M)v d 2mv 
= ———_ an = . 
ká m+M ’ vi m+M 


(1.11) 


There are three special cases that beg to be checked: 


e Ifm = M, then Eq. (1.11) tells us that m stops, and M picks up a speed v. This is 
fairly believable (and even more so for pool players). And it becomes quite clear 
once you realize that these final speeds certainly satisfy conservation of energy 
and momentum with the initial conditions. 

e IfM > m, then m bounces backward with speed ~ v, and M hardly moves. 
This makes sense, because M is basically a brick wall. 

e Ifm > M, then m keeps plowing along at speed ~ v, and M picks up a speed 
of © 2v. This 2v is an unexpected and interesting result (it’s easier to see if you 
consider what’s happening in the reference frame of the heavy mass m), and it 
leads to some neat effects, as in Problem 5.23. 


Example (Circular pendulum): A mass hangs froma massless string of length £. 
Conditions have been set up so that the mass swings around in a horizontal circle, 
with the string making a constant angle 0 with the vertical (see Fig. 1.4). We’ll find 
in Section 3.5 that the angular frequency, w, of this motion is 


| & 
= . 1.12 
e £cosé ( ) 


1.4 Solving differential equations numerically 


As far as 0 is concerned, there are two limits we should definitely check: 


e If6 — 90°, then œw —> oo. This makes sense; the mass has to spin very quickly 
to avoid flopping down. 

e If 6 — 0, then w > ./g/£, which is the same as the frequency of a standard 
“plane” pendulum of length £ (for small oscillations). This is a cool result and 
not at all obvious. (But once we get to F = ma in Chapter 3, you can convince 
yourself why this is true by looking at the projection of the force on a given 


horizontal line.) 


In the above examples, we checked limiting and special cases of answers 
that were correct (I hope!). This whole process is more useful (and a bit more 
fun, actually) when you check the limits of an answer that is incorrect. In this 
case, you gain the unequivocal information that your answer is wrong. But 
rather than leading you into despair, this information is in fact something you 
should be quite happy about, considering that the alternative is to carry on in 
a state of blissful ignorance. Once you know that your answer is wrong, you 
can go back through your work and figure out where the error is (perhaps by 
checking limits at various stages to narrow down where the error could be). 
Personally, if there’s any way I’d like to discover that my answer is garbage, 
this is it. At any rate, checking limiting cases can often save you a lot of trouble 
in the long run... 


The lemmings get set for their race. 
With one step and two steps they pace. 
They take three and four, 

And then head on for more, 

Without checking the limiting case. 


1.4 Solving differential equations numerically 


Solving a physics problem often involves solving a differential equation. 
A differential equation is one that involves derivatives (usually with respect to 
time, in our physics problems) of the variable you’re trying to solve for. The 
differential equation invariably comes about from using F = ma, and/or t = Ja, 
or the Lagrangian technique we’ll discuss in Chapter 6. For example, consider a 
falling body. F = ma gives —mg = ma, which can be written as —g = y, where 
a dot denotes a time derivative. This is a rather simple differential equation, and 
you can quickly guess that y(t) = —gt?/2 is a solution. Or, more generally with 
the constants of integration thrown in, y(t) = yo + vot — gt? /2. 

However, the differential equations produced in some problems can get rather 
complicated, so sooner or later you will encounter one that you can’t solve exactly 
(either because it’s in fact impossible to solve, or because you can’t think of the 


11 


12 


Strategies for solving problems 


appropriate clever trick). Having resigned yourself to not getting the exact answer, 
you should ponder how to obtain a decent approximation to it. Fortunately, it’s 
easy to write a short program that will give you a very good numerical answer to 
your problem. Given enough computer time, you can obtain any desired accuracy 
(assuming that the system isn’t chaotic, but we won’t have to worry about this 
for the systems we’ll be dealing with). 

We’ll demonstrate the procedure by considering a standard problem, one 
that we’ll solve exactly and in great depth in Chapter 4. Consider the 
equation, 


š = —wx. (1.13) 


This is the equation for a mass on a spring, with œ = /k/m. We’ll find in 
Chapter 4 that the solution can be written, among other ways, as 


x(t) = Acos(wt + $). (1.14) 


But let’s pretend we don’t know this. If someone comes along and gives us the 
values of x(0) and x(0), then it seems that somehow we should be able to find 
x(t) and x(t) for any later ¢, just by using Eq. (1.13). Basically, if we’re told how 
the system starts, and if we know how it evolves, via Eq. (1.13), then we should 
know everything about it. So here’s how we find x(t) and x(t). 
The plan is to discretize time into intervals of some small unit (call it €), and 
to then determine what happens at each successive point in time. If we know x(t) 
and x(t), then we can easily find (approximately) the value of x at a slightly later 
time, by using the definition of x. Similarly, if we know x(t) and x(t), then we 
can easily find (approximately) the value of x at a slightly later time, by using the 
definition of x. Using the definitions of the derivatives, the relations are simply 
x(t +€) © x(t) + ex(t), 
(1.15) 
X(t + €) ~ x(t) + €x(t). 
These two equations, combined with (1.13), which gives us ¥ in terms of x, allow 
us to march along in time, obtaining successive values for x, x, and x4 
Here’s what a typical program might look like.’ (This is a Maple program, but 
even if you aren’t familiar with this, the general idea should be clear.) Let’s say 


4 Of course, another expression for x is the definitional one, analogous to Eqs. (1.15), involving 
the third derivative. But this would then require knowledge of the third derivative, and so on with 
higher derivatives, and we would end up with an infinite chain of relations. An equation of motion 
such as Eq. (1.13) (which in general could be an F = ma, t = Ja, or Euler-Lagrange equation) 
relates ¥ back to x (and possibly x), thereby creating an intertwined relation among x, x, and ¥, and 
eliminating the need for an infinite and useless chain. 

We’ve written the program in the most straightforward way, without any concern for efficiency, 
because computing time isn’t an issue in this simple system. But in more complex systems that 
require programs for which computing time is an issue, a major part of the problem-solving process 
is developing a program that is as efficient as possible. 
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that the particle starts from rest at position x = 2, and let’s pick œ? = 5. We’ll 
use the notation where x1 stands for x, and x2 stands for x. And e stands for e. 
Let’s calculate x at, say, t = 3. 


X:=2 # initial position 

x1:=0: # initial velocity 

e:=.01: # small time interval 

for i to 300 do # do 300 steps (ie, up to 3 seconds) 
¥2:=-5*x: # the given equation 

x:=x+e*x1: # how x changes, by definition of xl 
x1l:=xlte*x2: # how x1 changes, by definition of x2 
end do: # the Maple command to stop the do loop 
x; # print the value of x 


This procedure won’t give the exact value for x, because x and x don’t really 
change according to Eqs. (1.15). These equations are just first-order approxi- 
mations to the full Taylor series with higher-order terms. Said differently, there 
is no way the above procedure can be exactly correct, because there are ambi- 
guities in how the program can be written. Should line 5 come before or after 
line 7? That is, in determining x at time ¢ + €, should we use the x at time ¢ or 
t + €? And should line 7 come before or after line 6? The point is that for very 
small e, the order doesn’t matter much. And in the limit € — 0, the order doesn’t 
matter at all. 

If we want to obtain a better approximation, we can just shorten € down to 
0.001 and increase the number of steps to 3000. If the result looks basically the 
same as with € = 0.01, then we know we pretty much have the right answer. In the 
present example, € = 0.01 yields x ~ 1.965 after 3 seconds. If we set € = 0.001, 
then we obtain x + 1.836. And if we set € = 0.0001, then we get x ~ 1.823. 
The correct answer must therefore be somewhere around x = 1.82. And indeed, 
if we solve the problem exactly, we obtain x(t) = 2 cos(4/5 t). Plugging in t = 3 
gives x © 1.822. 

This is a wonderful procedure, but it shouldn’t be abused. It’s nice to know that 
we can always obtain a decent numerical approximation if all else fails. But we 
should set our initial goal on obtaining the correct algebraic expression, because 
this allows us to see the overall behavior of the system. And besides, nothing 
beats the truth. People tend to rely a bit too much on computers and calculators 
nowadays, without pausing to think about what is actually going on ina problem. 


The skill to do math on a page 

Has declined to the point of outrage. 
Equations quadratica 

Are solved on Math’ matica, 

And on birthdays we don’t know our age. 
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1.5 Problems 


Section 1.2: Units, dimensional analysis 


1.1. 


1.2; 


1.3. 


1.4. 


1.5. 


Escape velocity « 


As given below in Exercise 1.9, show that the escape velocity from the 
earth is v = ./2GMg/R, up to numerical factors. You can use the fact 
that the form of Newton’s gravitation force law implies that the accel- 
eration (and hence overall motion) of the particle doesn’t depend on 
its mass. 


Mass in a tube « 


A tube of mass M and length £ is free to swing around a pivot at one end. 
A mass m is positioned inside the (frictionless) tube at this end. The tube 
is held horizontal and then released (see Fig. 1.5). Let 7 be the fraction of 
the tube that the mass has traversed by the time the tube becomes vertical. 
Does n depend on £? 


Waves in a fluid x 


How does the speed of waves in a fluid depend on its density, p, and 
“bulk modulus,” B (which has units of pressure, which is force per area)? 


Vibrating star x 


Consider a vibrating star, whose frequency v depends (at most) on its 
radius R, mass density p, and Newton’s gravitational constant G. How 
does v depend on R, p, and G? 


Damping +x 


A particle with mass m and initial speed V is subject to a velocity- 
dependent damping force of the form bv”. 


(a) Forn = 0,1,2,..., determine how the stopping time depends on 
m, V, and b. 

(b) Forn = 0,1,2,..., determine how the stopping distance depends 
on m, V, and b. 


Be careful! See if your answers make sense. Dimensional analysis gives 
the answer only up to a numerical factor. This is a tricky problem, so don’t 
let it discourage you from using dimensional analysis. Most applications 
of dimensional analysis are quite straightforward. 


Section 1.3: Approximations, limiting cases 


1.6. 


Projectile distance x 


A person throws a ball (at an angle of her choosing, to achieve the maxi- 
mum distance) with speed v from the edge of a cliff of height A. Assuming 


1.6 Exercises 


that one of the following quantities is the maximum horizontal distance 
the ball can travel, which one is it? (Don’t solve the problem from scratch, 
just check special cases.) 


ho x? vh v 2gh v? 2gh v? 
i > > 1+ (1+ OF a 
v Ë gz g v g v lL 


Section 1.4: Solving differential equations numerically 


1.7. Two masses, one swinging ++ 


Two equal masses are connected by a string that hangs over two pulleys 
(of negligible size), as shown in Fig. 1.6. The left mass moves in a vertical 
line, but the right mass is free to swing back and forth in the plane of the 
masses and pulleys. It can be shown (see Problem 6.4) that the equations 
of motion for r and @ (labeled in the figure) are 


27 = ro? — g(1 — cos0), 
jn 276 gsin0 (1.16) 
~ a a i 


Assume that both masses start out at rest, with the right mass making an 
initial angle of 10° = 2/18 with the vertical. If the initial value of r is 
1m, how much time does it take for it to reach a length of 2m? Write a 
program to solve this numerically. Use g = 9.8 m/s. 


1.6 Exercises 


Section 1.2: Units, dimensional analysis 


1.8. Pendulum on the moon 


If a pendulum has a period of 3 s on the earth, what would its period be 
if it were placed on the moon? Use gy /ge © 1/6. 


1.9. Escape velocity « 


The escape velocity on the surface of a planet is given by 


[2GM 
= ,/ — 1.17 
v R” (1.17) 


where M and R are the mass and radius of the planet, respectively, and G 
is Newton’s gravitational constant. (The escape velocity is the velocity 
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1.10. 


1.12. 


Section 


1.13. 


1.14. 


needed to refute the “What goes up must come down” maxim, neglecting 
air resistance.) 


(a) Write v in terms of the average mass density p, instead of M. 

(b) Assuming that the average density of the earth is four times that of 
Jupiter, and that the radius of Jupiter is 11 times that of the earth, 
what is vy /vg? 


Downhill projectile « 


A hill is sloped downward at an angle @ with respect to the horizontal. 
A projectile of mass m is fired with speed vp perpendicular to the hill. 
When it eventually lands on the hill, let its velocity make an angle 6 with 
respect to the horizontal. Which of the quantities 6, m, vo, and g does 
the angle 6 depend on? 


. Waves on a string + 


How does the speed of waves on a string depend on its mass M, length 
L, and tension (that is, force) T? 


Vibrating water drop « 


Consider a vibrating water drop, whose frequency v depends on its radius 
R, mass density p, and surface tension S. The units of surface tension 
are (force)/(length). How does v depend on R, p, and S? 


1.3: Approximations, limiting cases 


Atwood’s machine « 


Consider the “Atwood’s” machine shown in Fig. 1.7, consisting of three 
masses and three frictionless pulleys. It can be shown that the acceleration 
of m is given by (just accept this): 


3m2m3 — mı (4m3 + m2) (1.18) 


ay = 


mm3 + mı (4m3 +m) ° 


with upward taken to be positive. Find a; in the following special 
cases: 

(a) m2 = 2m = 2m3. 

(b) mı much larger than both mz and m3. 

(c) mı much smaller than both mz and m3. 

(d) m > m, = m3. 

(e) mı = m = m3. 


Cone frustum « 


A cone frustum has base radius b, top radius a, and height h, as shown 
in Fig. 1.8. Assuming that one of the following quantities is the volume 
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of the frustum, which one is it? (Don’t solve the problem from scratch, 
just check special cases.) 


h h h 
TP, T@+), TP tabt bò), 


th at +b 
3 pa + Be 7 x hab 
1.15. Landing at the corner « Fig. 1.8 
A ball is thrown at an angle 0 up to the top of a cliff of height L, from 
a point a distance L from the base, as shown in Fig. 1.9. Assuming that ON 
one of the following quantities is the initial speed required to make the ra 3 
ball hit right at the edge of the cliff, which one is it? (Don’t solve the i 
problem from scratch, just check special cases.) Fi 


V, 
gL 1 gL 1 gL f : 
2(tan0 — 1)’ cos@\ 2(tan6 —1)’ cos@\ 2(tan0 + 1)’ 
gL tand Fig. 1.9 
2(tan@ + 1)” 


1.16. Projectile with drag ++ 


Consider a projectile subject to a drag force F = —mav. If it is fired 
with speed vp at an angle 6, it can be shown that the height as a func- 
tion of time is given by (just accept this here; it’s one of the tasks of 
Exercise 3.53) 


y(t) = L (vo sin + =) (1 oe) a (1.19) 


Show that this reduces to the usual projectile expression, y(t) = 
(vo sin @)t — gt*/2, in the limit of small œ. What exactly is meant by 
“small æ”? 


Section 1.4: Solving differential equations numerically 


1.17. Pendulum xx 


A pendulum of length £ is released from the horizontal position. It can 
be shown that the tangential F = ma equation is (where 6 is measured 
with respect to the vertical) 


gsind 
7 


Öö = (1.20) 


If £ = 1m, and g = 9.8m/s?, write a program to show that the time 
it takes the pendulum to swing down through the vertical position is 
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t © 0.592. This happens to be about 1.18 times the (1 /2)./€/g ~ 
0.502 s it would take the pendulum to swing down if it were released 
from very close to the vertical (this is 1/4 of the standard period of 
2n./t/g for a pendulum). It also happens to be about 1.31 times the 
/2€/¢ ~ 0.4528 it would take a mass to simply freefall a height £. 


1.18. Distance with damping «« 


1.7 


1.1. 


1.2. 


A mass is subject to a damping force proportional to its velocity, which 
means that the equation of motion takes the form x = —Ax, where A is 
some constant. If the initial speed is 2 m/s, and if A = 1 s7}, how far has 
the mass traveled at 1 s? 10s? 100s? You should find that the distance 
approaches a limiting value. 

Now assume that the mass is subject to a damping force proportional 
to the square of its velocity, which means that the equation of motion 
now takes the form ¥ = —Ax?, where A is some constant. If the initial 
speed is 2m/s, and if 4 = 1 m~!, how far has the mass traveled at 1 s? 
10s? 100s? How about some larger powers of 10? You should find that 
the distance keeps growing, but slowly like the log of t. (The results 
for these two forms of the damping are consistent with the results of 
Problem 1.5.) 


Solutions 


Escape velocity 

It is tempting to use the same reasoning as in the low-orbit satellite example in Section 
1.2. This reasoning gives the same result, v = C./gR = C./GMg/R, where C is some 
number (it turns out that C = Jf 2). Although this solution yields the correct answer, it 
isn’t quite rigorous, in view of the footnote in the low-orbit satellite example. Because 
the particle isn’t always at the same radius, the force changes, so it isn’t obvious that we 
can absorb the Mg and G dependence into one quantity, g, as we did with the orbiting 
satellite. Let us therefore be more rigorous with the following reasoning. 

The dimensionful quantities in the problem are [m] = M, the radius of the earth 
[R] = L, the mass of the earth [Mg] = M, and Newton’s gravitational con- 
stant [G] = L3/MT?. These units for G follow from the gravitational force law, 
F = Gmm2/ 72. If we use no information other than these given quantities, then there 
is no way to arrive at the speed of C.\/GMg/R, because for all we know, there could be a 
factor of (m /Mp)! in the answer. This number is dimensionless, so it wouldn’t mess up 
the units. 

If we want to make any progress in this problem, we have to use the fact that the 
gravitational force takes the form of GMgm/r?. This then implies (as was stated in the 
problem) that the acceleration is independent of m. And since the path of the particle is 
determined by its acceleration, we see that the answer can’t depend on m. We are there- 
fore left with the quantities G, R, and Mg, and you can show that the only combination 
of these quantities that gives the units of speed is v = C./GMg/R. 


Mass in a tube 

The dimensionful quantities are [g] = L/T?, [4] = L, [m] = M, and [M] = M. We 
want to produce a dimensionless number 7). Since g is the only constant involving time, 
n cannot depend on g. This then implies that n cannot depend on £, which is the only 
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1.7 Solutions 


length remaining. Therefore, n depends only on m and M (and furthermore only on the 
ratio m/M, since we want a dimensionless number). So the answer to the stated problem 
is, “No.” 

It turns out that you have to solve the problem numerically if you actually want to 
find 7 (see Problem 8.5). Some results are: Ifm « M, then n ~ 0.349. Ifm = M, then 
n © 0.378. And if m = 2M, then n © 0.410. 


Waves in a fluid 

We want to make a speed, [v] = L/T, out of the quantities [0] = M /L?, and [B] = 
[F/A] = (ML/T?)/(L*) = M/(LT?). We can play around with these quantities to find 
the combination that has the correct units, but let’s do it the no-fail way. If v œ p"B ; 
then we have 


r- (2) (zr) aay 
T L LT?) ` f 
Matching up the powers of the three kinds of units on each side of this equation gives 
M:0=a+b, L:1=-3a—6b6, T: —1 = -—2b. (1.22) 
The solution to this system of equations is a = —1/2 and b = 1/2. Therefore, our 


answer is v x ./B/p. Fortunately, there was a solution to this system of three equations 
in two unknowns. 


Vibrating star 

We want to make a frequency, [v] = 1/T, out of the quantities [R] = L, [p] = M/L, 
and [G] = L? /(MT?). These units for G follow from the gravitational force law, F = 
Gmım2/r°. As in the previous problem, we can play around with these quantities to find 
the combination that has the correct units, but let’s do it the no-fail way. If v œ R“p°G°, 
then we have 


l -r (%4) ( r ) (1.23) 

T L MT? j} ` ` 
Matching up the powers of the three kinds of units on each side of this equation gives 
M:0=b-c, L: 0=a—3b+3c, T: -l=-—2c. (1.24) 


The solution to this system of equations is a = 0, and b = c = 1/2. Therefore, our 
answer is v x ./pG. So it turns out that there is no R dependence. 


REMARK: Note the difference in the given quantities in this problem (R, p, and G) 
and the ones in Exercise 1.12 (R, p, and S). In this problem with the star, the mass 
is large enough so that we can ignore the surface tension, S. And in Exercise 1.12 
with the drop, the mass is small enough so that we can ignore the gravitational force, 
and hence G. & 


Damping 


(a) The constant b has units [b] = [Force][v™”] = (ML/T?)(T"/L"). The other 
quantities are [m] = M and [V] = L/T. There is also n, which is dimen- 
sionless. You can show that the only combination of these quantities that has 


units of T is 
m 


t=f(n) pyre’ 
where f (n) is a dimensionless function of n. 

Forn = 0, we have t = f (0) mV /b. This increases with m and V, and decreases 
with b, as it should. 

Forn = 1, we have t = f (1) m/b. So we seem to have t ~ m/b. This, however, 
cannot be correct, because ¢ should definitely grow with V. A large initial speed 
Vı requires some nonzero time to slow down to a smaller speed V2, after which 
time we simply have the same scenario with initial speed V2. So where did we go 
wrong? After all, dimensional analysis tells us that the answer does have to look 
like t = f (1) m/b, where f (1) is a numerical factor. The resolution to this puzzle 


(1.25) 
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is that f (1) is infinite. If we worked out the problem using F = ma, we would 
encounter an integral that diverges. So for any V, we would find an infinite t. ° 

Similarly, for n > 2, there is at least one power of V in the denominator of t. 
This certainly cannot be correct, because ¢ should not decrease with V. So f (n) 
must likewise be infinite for all of these cases. 

The moral of this exercise is that sometimes you have to be careful when 
using dimensional analysis. The numerical factor in front of your answer 
nearly always turns out to be of order 1, but in some strange cases it turns 
out to be 0 or oo. 


Remark: Forn > 1, the expression in Eq. (1.25) still has relevance. For example, 
for n = 2, the m/(Vb) expression is relevant if you want to know how long it 
takes to go from V to some final speed Vf. The answer involves m/(V¢b), which 
diverges as Vp > 0. & 


(b) You can show that the only combination of the quantities that has units of L is 
m 
byn-2’ 

where g(n) is a dimensionless function of n. 

For n = 0, we have £ = g(0) mV? /b. This increases with V, as it should. 

For n = 1, we have £ = g(1) mV /b. This increases with V, as it should. 

For n = 2 we have £ = g(2)m/b. So we seem to have £ ~ m/b. But as in 
part (a), this cannot be correct, because £ should definitely depend on V. A large 
initial speed V; requires some nonzero distance to slow down to a smaller speed 
V2, after which point we simply have the same scenario with initial speed V2. So, 
from the reasoning in part (a), the total distance is infinite for n > 2, because the 
function g is infinite. 


t= gin) (1.26) 


REMARK: Note that for integral n # 1, t and £ are either both finite or both 
infinite. For n = 1, however, the total time is infinite, whereas the total distance 
is finite. This situation actually holds for 1 < n < 2, if we want to consider 
fractional n. & 


1.6. Projectile distance 


All of the possible answers have the correct units, so we’ll have to figure things out by 
looking at special cases. Let’s look at each choice in turn: 


h? 
ae : Incorrect, because the answer shouldn’t be zero for h = 0. Also, it shouldn’t 
v 


grow with g. And even worse, it shouldn’t be infinite for v > 0. 


— : Incorrect, because the answer should depend on A. 
g 


wh 
— : Incorrect, because the answer shouldn’t be zero for h = 0. 
E 
v? 2gh N : i 
— 1+ ~3 : Can’trule this out, and it happens to be the correct answer. 
g v 
v? 2gh , 
—|1+ =a) Incorrect, because the answer should be zero for v —> 0. But this 
g v 
expression goes to 2h for v > 0. 
2 
: 7 z | Incorrect, because the answer shouldn’t be infinite for v? = 2gh. 
yt 


6 The total time f is actually undefined, because the particle never comes to rest. But £ does grow 
with V, in the sense that if ¢ is defined to be the time to slow down to some certain small speed, 
then ź grows with V. 


1.7. Two masses, one swinging 


1.7 Solutions 


As in Section 1.4, we'll write a Maple program. We’ll let q stand for 0, and we’ll use 
the notation where q1 stands for 6, and q2 stands for 0. Likewise for r. We’ll run the 
program for as long as r < 2. As soon as r exceeds 2, the program will stop and print 


the value of the time. 


r:=1: 

EL: $ 
q:=3.14/18: 
ql:=0: 
e:=.001: 
i:=0: 

while r<2 do 
i:=i+1: 


r2:=(r*ql*2-9.8* (1-cos(q)))/2: 


ri=rte*rl: 

rl:=rl+e*r2: 
q2:=-2*r1l*ql/r-9.8*sin(q)/r: 
q:=qte*ql: 

ql:=qlt+e*q2: 

end do: 


ite; 


initial r value 


initial r velocity 

initial angle 

initial angular velocity 

small time interval 

i counts the number of time steps 


run 


the program until r=2 


increase the counter by 1 


the 
how 
how 
the 
how 
how 
the 


first of the given eqs 

r changes, by definition of rl 

rl changes, by definition of r2 
second of the given eqs 

q changes, by definition of ql 

ql changes, by definition of q2 
Maple command to stop the do loop 


print the value of the time 


This yields a time oft = 8.057 s. If we instead use a time interval of 0.0001 s, we obtain 
t = 8.1377 s. And a time interval of 0.00001 s gives t = 8.14591 s. So the correct time 


must be somewhere around 8.15 s. 
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Chapter 2 
Statics 


The subject of statics often appears in later chapters in other books, after force 
and torque have been discussed. However, the way that force and torque are 
used in statics problems is fairly minimal, at least compared with what we’ll 
be doing later in this book. Therefore, since we won’t be needing much of the 
machinery that we’ll be developing later on, Pll introduce here the bare min- 
imum of force and torque concepts necessary for statics problems. This will 
open up a whole class of problems for us. But even though the underlying 
principles of statics are quick to state, statics problems can be unexpect- 
edly tricky. So be sure to tackle a lot of them to make sure you understand 
things. 


2.1 Balancing forces 


A “static” setup is one where all the objects are motionless. If an object remains 
motionless, then Newton’s second law, F = ma (which we’ll discuss in great 
detail in the next chapter), tells us that the total external force acting on the object 
must be zero. The converse is not true, of course. The total external force on an 
object is also zero if it moves with constant nonzero velocity. But we’ll deal only 
with statics problems here. The whole goal in a statics problem is to find out what 
the various forces have to be so that there is zero net force acting on each object 
(and zero net torque, too, but that’s the topic of Section 2.2). Because a force is 
a vector, this goal involves breaking the force up into its components. You can 
pick Cartesian coordinates, polar coordinates, or another set. It is usually clear 
from the problem which system will make your calculations easiest. Once you 
pick a system, you simply have to demand that the total external force in each 
direction is zero. 

There are many different types of forces in the world, most of which are large- 
scale effects of complicated things going on at smaller scales. For example, the 
tension in a rope comes from the chemical bonds that hold the molecules in 
the rope together, and these chemical forces are electrical forces. In doing a 
mechanics problem involving a rope, there is certainly no need to analyze all the 
details of the forces taking place at the molecular scale. You just call the force in 
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the rope a “tension” and get on with the problem. Four types of forces come up 
repeatedly: 


Tension 

Tension is the general name for a force that a rope, stick, etc., exerts when it 
is pulled on. Every piece of the rope feels a tension force in both directions, 
except the end points, which feel a tension on one side and a force on the other 
side from whatever object is attached to the end. In some cases, the tension may 
vary along the rope. The “Rope wrapped around a pole” example at the end of 
this section is a good illustration of this. In other cases, the tension must be the 
same everywhere. For example, in a hanging massless rope, or in a massless 
rope hanging over a frictionless pulley, the tension must be the same at all points, 
because otherwise there would be a net force on at least some part of the rope, 
and then F = ma would yield an infinite acceleration for this (massless) piece. 


Normal force 

This is the force perpendicular to a surface that the surface applies to an object. 
The total force applied by a surface is usually a combination of the normal force 
and the friction force (see below). But for frictionless surfaces such as greasy 
ones or ice, only the normal force exists. The normal force comes about because 
the surface actually compresses a tiny bit and acts like a very rigid spring. The 
surface gets squashed until the restoring force equals the force necessary to keep 
the object from squashing in any more. 

For the most part, the only difference between a “tension” and a “normal 
force” is the direction of the force. Both situations can be modeled by a spring. 
In the case of a tension, the spring (a rope, a stick, or whatever) is stretched, 
and the force on the given object is directed toward the spring. In the case of 
a normal force, the spring is compressed, and the force on the given object is 
directed away from the spring. Things like sticks can provide both normal forces 
and tensions. But a rope, for example, has a hard time providing a normal force. 
In practice, in the case of elongated objects such as sticks, a compressive force 
is usually called a “compressive tension,” or a “negative tension,” instead of a 
normal force. So by these definitions, a tension can point either way. At any rate, 
it’s just semantics. If you use any of these descriptions for a compressed stick, 
people will know what you mean. 


Friction 

Friction is the force parallel to a surface that a surface applies to an object. 
Some surfaces, such as sandpaper, have a great deal of friction. Some, such as 
greasy ones, have essentially no friction. There are two types of friction, called 
“kinetic” friction and “static” friction. Kinetic friction (which we won’t cover in 
this chapter) deals with two objects moving relative to each other. It is usually 
a good approximation to say that the kinetic friction between two objects is 
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proportional to the normal force between them. The constant of proportionality 
is called ux (the “coefficient of kinetic friction”), where 4, depends on the two 
surfaces involved. Thus, F = uN, where N is the normal force. The direction 
of the force is opposite to the motion. 

Static friction deals with two objects at rest relative to each other. In the static 
case, we have F < usN (where us is the “coefficient of static friction”). Note 
the inequality sign. All we can say prior to solving a problem is that the static 
friction force has a maximum value equal to Fmax = MsN. In a given problem, 
it is most likely less than this. For example, if a block of large mass M sits on a 
surface with coefficient of friction us, and you give the block a tiny push to the 
right (tiny enough so that it doesn’t move), then the friction force is of course not 
equal to u,N = usMg to the left. Such a force would send the block sailing off 
to the left. The true friction force is simply equal and opposite to the tiny force 
you apply. What the coefficient jz, tells us is that if you apply a force larger than 
usMg (the maximum friction force on a horizontal table), then the block will end 
up moving to the right. 


Gravity 

Consider two point objects, with masses M and m, separated by a distance R. 
Newton’s gravitational force law says that the force between these objects is 
attractive and has magnitude F = GMm/R?, where G = 6.67-107!! m3/(kgs?). 
As we’ll show in Chapter 5, the same law also applies to spheres of nonzero size. 
That is, a sphere may be treated like a point mass located at its center. Therefore, 
an object on the surface of the earth feels a gravitational force equal to 


GM 
F=m (=) = mg, (2.1) 


where M is the mass of the earth, and R is its radius. This equation defines g. 
Plugging in the numerical values, we obtain g ~ 9.8 m/s, as you can check. 
Every object on the surface of the earth feels a force of mg downward (g varies 
slightly over the surface of the earth, but let’s ignore this). If the object is not 
accelerating, then there must be other forces present (normal forces, etc.) to make 
the total force be equal to zero. 

Another common force is the Hooke’s-law spring force, F = —kx. But we’ll 
postpone the discussion of springs until Chapter 4, where we’ll spend a whole 
chapter on them in depth. 


Example (Block ona plane): A block of mass M rests on a fixed plane inclined 
at an angle 6. You apply a horizontal force of Mg on the block, as shown in Fig. 2.1. 
Assume that the friction force between the block and the plane is large enough to keep 
the block at rest. What are the normal and friction forces (call them N and Ff) that 
the plane exerts on the block? If the coefficient of static friction is u, for what range 
of angles 6 will the block in fact remain at rest? 
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Solution: Let’s break the forces up into components parallel and perpendicular 
to the plane. (The horizontal and vertical components would also work, but the 
calculation would be a little longer.) The forces are N, Fr, the applied Mg, and 
the weight Mg, as shown in Fig. 2.2. Balancing the forces parallel and perpen- 
dicular to the plane gives, respectively (with upward along the plane taken to be 
positive), 


Ff = Mg sin 0 — Mg cos0, 
(2.2) 
N = Mgcosé + Mg sing. 


INTERMEDIATE REMARKS: 


1. Iftan@ > 1, then Ff is positive (that is, it points up the plane). Andiftan@ < 1, then 
Fp is negative (that is, it points down the plane). There is no need to worry about 
which way it points when drawing the diagram. Just pick a direction to be positive, 
and if F comes out to be negative (as it does in the figure above, because 8 < 45°), 
then it actually points in the other direction. 

2. Fr ranges from —Mg to Mg as 6 ranges from 0 to 2/2 (convince yourself that these 
limiting values make sense). As an exercise, you can show that N is maximum when 
tan = 1, in which case N = J/2Mg and Fẹ = 0. 

3. The sin@ and cos@ factors in Eq. (2.2) follow from the angles 0 drawn in Fig. 2.2. 
However, when solving problems like this one, it’s easy to make a mistake in the 
geometry and then label an angle as @ when it really should be 90° — @. So two 
pieces of advice: (1) Never draw an angle close to 45° in a figure, because if you do, 
you won’t be able to tell the @ angles from the 90° — 6 ones. (2) Always check your 
results by letting 6 go to 0 or 90° (in other words, does virtually all of a force, or 
virtually none of it, act in a certain direction when the plane is, say, horizontal). Once 
you do this a few times, you’ ll realize that you probably don’t even need to work out 
the geometry in the first place. Since you know that any given component is going 
to involve either sin@ or cos@, you can just pick the one that works correctly in a 
certain limit. & 


The coefficient jz tells us that |Ff| < uN. Using Eq. (2.2), this inequality becomes 
Mg|sin@ — cos6| < uMg(cosé + sin 8). (2.3) 


The absolute value here signifies that we must consider two cases: 


e Iftané@ > 1, then Eq. (2.3) becomes 


1 
sind — cos < u(cosð +sinf) => tan < = (2.4) 
We divided by 1 — m, so this inequality is valid only if u < 1. But if u > 1, we 
see from the first inequality here that any value of 0 (subject to our assumption, 
tan@ > 1) works. 
e Iftané@ < 1, then Eq. (2.3) becomes 


l-u 


— sin + cos < u(cosdé+sind) => tan > 
l+ 


(2.5) 
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Putting these two ranges for 0 together, we have 


(2.6) 


Remarks: For very small jz, these bounds both approach 1, which means that 6 must be 
very close to 45°. This makes sense. If there is very little friction, then the components 
along the plane of the horizontal and vertical Mg forces must nearly cancel; hence, @ ~ 45°. 
A special value for jz is 1, because from Eq. (2.6), we see that u = 1 is the cutoff value 
that allows 6 to reach both 0 and 7/2. If u > 1, then any tilt of the plane is allowed. We’ve 
been assuming throughout this example that 0 < 6 < 2/2. The task of Exercise 2.20 is to 
deal with the case where 6 > 2/2, where the block is under an overhang. & 


Let’s now do an example involving a rope in which the tension varies with 
position. We’ll need to consider differential pieces of the rope to solve this 
problem. 


Example (Rope wrapped around a pole): A rope wraps an angle @ around a 
pole. You grab one end and pull with a tension 79. The other end is attached to a large 
object, say, a boat. If the coefficient of static friction between the rope and the pole 
is u, what is the largest force the rope can exert on the boat, if the rope is not to slip 
around the pole? 


Solution: Consider a small piece of the rope that subtends an angle d0. Let the 

4 tension in this piece be T (which varies slightly over the small length). As shown in 

< Fig. 2.3, the pole exerts a small outward normal force, Ngo, on the piece. This normal 
force exists to balance the “inward” components of the tensions at the ends. These 


Nao inward components have magnitude T sin(d0 ay! Therefore, Ngg = 2T sin (d0 /2). 

d 

~ N The small-angle approximation, sin x ~ x, allows us to write this as Nyg = T d0. 

ZA The friction force on the little piece of rope satisfies Fyg < uNg = uT d0. This 
Tsin d0/2 


friction force is what gives rise to the difference in tension between the two ends of 


the piece. In other words, the tension, as a function of 8, satisfies 
Fig. 2.3 


T(0+d0) < T(0) + uT do 
= dT <pTdo 


= / pe / do 
To H 
=> n?<pwd+cC 
= T<TheM?, (2.7) 
' One of them actually has magnitude (T + dT) sin(d0/2), where dT is the increase in tension along 


the small piece. But the extra term this produces, (dT) sin(d6/2), is a second-order small quantity, 
so it can be ignored. 


2.2 Balancing torques 


where we have used the fact that T = To when 6 = 0. The exponential behavior 
here is quite strong (as exponential behaviors tend to be). If we let y = 1, then just a 
quarter turn around the pole produces a factor of e”/2 ~ 5, One full revolution yields 
a factor of e?" ~ 530, and two full revolutions yield a factor of e*” ~ 300000. 
Needless to say, the limiting factor in such a case is not your strength, but rather the 
structural integrity of the pole around which the rope winds. 


2.2 Balancing torques 


In addition to balancing forces in a statics problem, we must also balance torques. 
We’ ll have much more to say about torque in Chapters 8 and 9, but we’ll need 
one important fact here. Consider the situation in Fig. 2.4, where three forces 
are applied perpendicular to a stick, which is assumed to remain motionless. F1 
and F3 are the forces at the ends, and F3 is the force in the interior. We have, of 
course, F3 = F + F2, because the stick is at rest. But we also have the following 
relation: 


Claim 2.1 If the system is motionless, then F3a = F(a + b). In other words, 
the torques (force times distance) around the left end cancel.” And you can show 
that they cancel around any other point, too. 


We’ ll prove this claim in Chapter 8 by using angular momentum, but let’s give a 
short proof here. 


Proof: We’ll make one reasonable assumption, namely, that the correct 
relationship between the forces and distances is of the form, 


Ff (a) = Faf (a + b), (2.8) 


where f (x) is a function to be determined.’ Applying this assumption with the 
roles of “left” and “right” reversed in Fig. 2.4 gives 


Ff (b) = Ff (a + b). (2.9) 
Adding Eqs. (2.8) and (2.9), and using F3 = F + Fo, yields 
fa) +f) =f a+b). (2.10) 


This equation implies that f (rx) = rf (x) for any x and for any rational number 
r, as you can show (see Exercise 2.28). Therefore, assuming f (x) is continuous, 


2 Another proof of this claim is given in Problem 2.11. 
3 What we’re doing here is simply assuming linearity in F. That is, two forces of F applied at a point 
should be the same as a force of 2F applied at that point. You can’t really argue with that. 
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it must be a linear function, f(x) = Ax, as we wanted to show. The constant A is 
irrelevant, because it cancels in Eq. (2.8). E 


Note that dividing Eq. (2.8) by Eq. (2.9) gives Fi f(a) = Fzf (b), and hence 
F\a = Fb, which says that the torques cancel around the point where F3 is 
applied. You can show that the torques cancel around any arbitrary pivot point. 
When adding up all the torques in a given physical setup, it is of course required 
that you use the same pivot point when calculating each torque. 

In the case where the forces aren’t perpendicular to the stick, the above claim 
applies to the components of the forces perpendicular to the stick. This makes 
sense, because the components parallel to the stick have no effect on the rotation 
of the stick around the pivot point. Therefore, referring to Figs. 2.5 and 2.6, the 
equality of the torques can be written as 


Fyasin 6g = Fpb sin 6p. (2.11) 


This equation can be viewed in two ways: 


e (Fa sin @q)a = (Fp sin 6,)b. In other words, we effectively have smaller forces acting 
on the given “lever arms,” as shown in Fig. 2.5. 
e Fa(asin 0a) = Fp(b sin 6p). In other words, we effectively have the given forces acting 


on smaller “lever arms,” as shown in Fig. 2.6. 


Claim 2.1 shows that even if you apply only a tiny force, you can balance 
the torque due to a very large force, provided that you make your lever arm 
sufficiently long. This fact led a well-known mathematician of long ago to claim 
that he could move the earth if given a long enough lever arm. 


One morning while eating my Wheaties, 
I felt the earth move ‘neath my feeties. 
The cause for alarm 

Was a long lever arm, 

At the end of which grinned Archimedes! 


One handy fact that comes up often is that the gravitational torque on a 
stick of mass M is the same as the gravitational torque due to a point-mass 
M located at the center of the stick. The truth of this statement relies on the fact 
that torque is a linear function of the distance to the pivot point (see Exercise 
2.27). More generally, the gravitational torque on an object of mass M may be 
treated simply as the gravitational torque due to a force Mg located at the center 
of mass. 

We’ll talk more about torque in Chapters 8 and 9, but for now we’ll just 
use the fact that in a statics problem the torques around any given point must 
balance. 


2.2 Balancing torques 


Example (Leaning ladder): A ladder leans against a frictionless wall. If the 
coefficient of friction with the ground is u, what is the smallest angle the ladder can 
make with the ground and not slip? 


Solution: Let the ladder have mass m and length £. As shown in Fig. 2.7, we have 
three unknown forces: the friction force F, and the normal forces Ny and M2. And 
to solve for these three forces we fortunately have three equations: UFyert = 0, 
=Fhoriz = 0, and Ut = 0 (t is the standard symbol for torque). Looking at the 
vertical forces, we see that N} = mg. And then looking at the horizontal forces, we 
see that M2 = F. So we have quickly reduced the unknowns from three to one. 

We will now use Xt = 0 to find M (or F). But first we must pick the “pivot” point 
around which we will calculate the torques. Any stationary point will work fine, but 
certain choices make the calculation easier than others. The best choice for the pivot 
is generally the point at which the most forces act, because then the Xt = 0 equation 
will have the smallest number of terms in it (because a force provides no torque around 
the point where it acts, since the lever arm is zero). In this problem, there are two 
forces acting at the bottom end of the ladder, so this is the point we’ll choose for the 
pivot (but you should verify that other choices for the pivot, for example, the middle 
or top of the ladder, give the same result). Balancing the torques due to gravity and 
N2, we have 

mg 
2tan 0 ` 


No€sin@d = mg(£/2)cosð => N= (2.12) 


This is also the value of the friction force F. The condition F < uN, = umg 
therefore becomes 


mg 1 
< = tan > —. 2.1 
2tand 7 ee ne 2u G13) 


Remarks: Note that the total force exerted on the ladder by the floor points up at an angle 
given by tan 8 = Nı/F = (mg)/(mg/2 tan@) = 2 tan@. We see that this force does not 
point along the ladder. There is simply no reason why it should. But there is a nice reason 
why it should point upward with twice the slope of the ladder. This is the direction that 
causes the lines of the three forces on the ladder to be concurrent (that is, pass through a 
common point), as shown in Fig. 2.8. This concurrency is a neat little theorem for statics 
problems involving three forces. The proof is simple. If the three lines weren’t concurrent, 
then one force would produce a nonzero torque around the intersection point of the other 
two lines of force.* 

This theorem provides a quick way to solve the ladder problem in the more general 
case where the center of mass is a fraction f of the way up. In this case, the concurrency 
theorem tells us that the slope of the total force from the floor is (1/f) tan @, consistent 
with the f = 1/2 result from above. The vertical component is still mg, so the horizontal 
(friction) component is now fmg/tan@. Demanding that this be less than or equal to 
umg gives tan > f/u, consistent with the f = 1/2 result. Since this result depends 


4 The one exception to this reasoning is where no two of the lines intersect, that is, where all three 
lines are parallel. Equilibrium is certainly possible in such a scenario, as we saw in Claim 2.1. But 
you can hang on to the concurrency theorem in this case if you consider the parallel lines to meet 
at infinity. 


Fig. 2.7 


Fig. 2.8 
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only on the location of the center of mass, and not on the exact distribution of mass, a 
corollary is that if you climb up a ladder (resting on a frictionless wall), your presence 
makes the ladder more likely to slip if you are above the center of mass (because you have 
raised the center of mass of the entire system and thus increased f), and less likely if you 
are below. & 


The examples we’ve done in this chapter have consisted of only one object. 
But many problems involve more than one object (as you’ II find in the problems 
and exercises for this chapter), and there’s one additional fact you’ll often need to 
invoke for these, namely Newton’s third law. This states that the force that object 
A exerts on object B is equal and opposite to the force that B exerts on A (we’ll 
talk more about Newton’s laws in Chapter 3). So if you want to find, say, the 
normal force between two objects, you might be able to figure it out by looking 
at forces and torques on either object, depending on how much you already know 
about the other forces acting on each. Once you’ve found the force by dealing 
with, say, object 4, you can then use the equal and opposite force to help figure 
out things about B. Depending on the problem, one object is often more useful 
than the other to use first. 

Note, however, that if you pick your subsystem (on which you’re going to 
consider forces and torques) to include both A and B, then this won’t tell you 
anything at all about the normal force (or friction) between them. This is true 
because the normal force is an internal force between the objects (when consid- 
ered together as a system), whereas only external forces are relevant in calculating 
the total force and torque on the system (because all the internal forces cancel in 
pairs, by Newton’s third law). The only way to determine a given force is to deal 
with it as an external force on some subsystem(s). 

Statics problems often involve a number of decisions. If there are various parts 
to the system, then you must decide which subsystems you want to balance the 
external forces and torques on. And furthermore, you must decide which point to 
use as the origin for calculating the torques. There are invariably many choices 
that will give you the information you need, but some will make your calculations 
much cleaner than others (Exercise 2.35 is a good example of this). The only way 
to know how to choose wisely is to start solving problems, so you may as well 
tackle some... 


2.3 Problems 


Section 2.1: Balancing forces 


2.1. Hanging rope 
A rope with length Z and mass density per unit length p is suspended 
vertically from one end. Find the tension as a function of height along 
the rope. 
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2.2. Block on a plane 


A block sits on a plane that is inclined at an angle 0. Assume that 
the friction force is large enough to keep the block at rest. What are the 
horizontal components of the friction and normal forces acting on the 
block? For what 0 are these horizontal components maximum? 


2.3. Motionless chain « 


A frictionless tube lies in the vertical plane and is in the shape of a func- 
tion that has its endpoints at the same height but is otherwise arbitrary. 
A chain with uniform mass per unit length lies in the tube from end to 
end, as shown in Fig. 2.9. Show, by considering the net force of gravity 
along the curve, that the chain doesn’t move. Fig. 2.9 


2.4. Keeping a book up « 


A book of mass M is positioned against a vertical wall. The coeffi- 
cient of friction between the book and the wall is u. You wish to keep 
the book from falling by pushing on it with a force F applied at an 
angle 0 with respect to the horizontal (—7x/2 < 0 < 2/2), as shown 
in Fig. 2.10. 


(a) Fora given 0, what is the minimum F required? 

(b) For what O is this minimum F the smallest? What is the Fig. 2.10 
corresponding minimum F? 

(c) What is the limiting value of 6, below which there does not exist 
an F that keeps the book up? 


2.5. Rope ona plane x 


A rope with length Z and mass density per unit length p lies on a plane 
inclined at an angle 6 (see Fig. 2.11). The top end is nailed to the plane, 
and the coefficient of friction between the rope and the plane is u. What 
are the possible values for the tension at the top of the rope? 


Fig. 2.11 
2.6. Supporting a disk +x 
(a) A disk of mass M and radius R is held up by a massless string, as 
shown in Fig. 2.12. The surface of the disk is frictionless. What is 
the tension in the string? What is the normal force per unit length 
that the string applies to the disk? 
(b) Let there now be friction between the disk and the string, with 
coefficient u. What is the smallest possible tension in the string 
at its lowest point? 
2.7. Objects between circles xx Fig. 2.12 


Each of the following planar objects is placed, as shown in Fig. 2.13, 
between two frictionless circles of radius R. The mass density per unit 


Fig. 2.15 


A 


Fig. 2.16 
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2.8. 


2.9. 


2.10. 


area of each object is ø, and the radii to the points of contact make an 
angle 0 with the horizontal. For each case, find the horizontal force that 
must be applied to the circles to keep them together. For what 0 is this 
force maximum or minimum? 


(a) An isosceles triangle with common side length L. 
(b) A rectangle with height L. 
(c) Acircle. 


Hanging chain x««« 


(a) Achain with uniform mass density per unit length hangs between 
two given points on two walls. Find the general shape of the chain. 
Aside from an arbitrary additive constant, the function describing 
the shape should contain one unknown constant. (The shape of a 
hanging chain is known as a catenary.) 

(b) The unknown constant in your answer depends on the horizontal 
distance d between the walls, the vertical distance A between the 
support points, and the length £ of the chain (see Fig. 2.14). Find 
an equation involving these given quantities that determines the 
unknown constant. 


Hanging gently +» 

A chain with uniform mass density per unit length hangs between two 
supports located at the same height, a distance 2d apart (see Fig. 2.15). 
What should the length of the chain be so that the magnitude of the 
force at the supports is minimized? You may use the fact that a hanging 
chain takes the form, y(x) = (1/a) cosh(a@x). You will eventually need 
to solve an equation numerically. 


Mountain climber «««« 


A mountain climber wishes to climb up a frictionless conical moun- 
tain. He wants to do this by throwing a lasso (a rope with a loop) over 
the top and climbing up along the rope. Assume that the climber is of 
negligible height, so that the rope lies along the mountain, as shown 
in Fig. 2.16. At the bottom of the mountain are two stores. One sells 
“cheap” lassos (made of a segment of rope tied to a loop of fixed 
length); see Fig. 2.17. The other sells “deluxe” lassos (made of one 
piece of rope with a loop of variable length; the loop’s length may 
change without any friction of the rope with itself). When viewed from 
the side, the conical mountain has an angle œ at its peak. For what 
angles œ can the climber climb up along the mountain if he uses a 
“cheap” lasso? A “deluxe” lasso? (Hint: The answer in the “cheap” case 
isn’t a < 90°.) 


2.3 Problems 


Section 2.2: Balancing torques 


2.11. 


2.12. 


2:13: 


Equality of torques +x 


This problem gives another way of demonstrating Claim 2.1, using an 
inductive argument. We’ll get you started, and then you can do the 
general case. 

Consider the situation where forces F are applied upward at the ends 
ofa stick of length £, and a force 2F is applied downward at the midpoint 
(see Fig. 2.18). The stick doesn’t rotate (by symmetry), and it doesn’t 
translate (because the net force is zero). If we wish, we may consider 
the stick to have a pivot at the left end. If we then erase the force F on 
the right end and replace it with a force 2F at the middle, then the two 
2F forces in the middle cancel, so the stick remains at rest.° Therefore, 
we see that a force F applied at a distance £ from a pivot is equivalent 
to a force 2F applied at a distance ¢/2 from the pivot, in the sense that 
they both have the same effect in canceling out the rotational effect of 
the downwards 2F force. 

Now consider the situation where forces F are applied upward at the 
ends, and forces F are applied downward at the £/3 and 2£/3 marks 
(see Fig. 2.19). The stick doesn’t rotate (by symmetry), and it doesn’t 
translate (because the net force is zero). Consider the stick to have a 
pivot at the left end. From the above paragraph, the force F at 24/3 
is equivalent to a force 2F at €/3. Making this replacement, we now 
have a total force of 3F at the £/3 mark. Therefore, we see that a 
force F applied at a distance £ is equivalent to a force 3F applied at a 
distance ¢/3. 

Your task is to now use induction to show that a force F applied at a 
distance £ is equivalent to a force nF applied at a distance £/n, and to 
then argue why this demonstrates Claim 2.1. 


Direction of the tension « 


Show that the tension in a completely flexible rope, massive or massless, 
points along the rope everywhere in the rope. 


Find the force « 


A stick of mass M is held up by supports at each end, with each support 
providing a force of Mg/2. Now put another support somewhere in the 
middle, say, at a distance a from one support and b from the other; 
see Fig. 2.20. What forces do the three supports now provide? Is this 
solvable? 


5 There is now a different force applied at the pivot, namely zero, but the purpose of the pivot is to 


simply apply whatever force is necessary to keep the left end motionless. 
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2.16. 


Leaning sticks « 


One stick leans on another as shown in Fig. 2.21. A right angle is formed 
where they meet, and the right stick makes an angle 6 with the horizontal. 
The left stick extends infinitesimally beyond the end of the right stick. 
The coefficient of friction between the two sticks is u. The sticks have 
the same mass density per unit length and are both hinged at the ground. 
What is the minimum angle @ for which the sticks don’t fall? 


. Supporting a ladder + 


A ladder of length L and mass M has its bottom end attached to the 
ground by a pivot. It makes an angle 6 with the horizontal and is held 
up by a massless stick of length £ that is also attached to the ground by a 
pivot (see Fig. 2.22). The ladder and the stick are perpendicular to each 
other. Find the force that the stick exerts on the ladder. 


Balancing the stick +» 


Given a semi-infinite stick (that is, one that goes off to infinity in one 
direction), determine how its density should depend on position so that it 
has the following property: If the stick is cut at an arbitrary location, the 
remaining semi-infinite piece will balance on a support that is located a 
distance £ from the end (see Fig. 2.23). 


. The spool +x 


A spool consists of an axle of radius r and an outside circle of radius R 
which rolls on the ground. A thread is wrapped around the axle and is 
pulled with tension T at an angle 0 with the horizontal (see Fig. 2.24). 


(a) Given R and r, what should 0 be so that the spool doesn’t move? 
Assume that the friction between the spool and the ground is large 
enough so that the spool doesn’t slip. 

(b) Given R, r, and the coefficient of friction u between the spool 
and the ground, what is the largest value of T for which the spool 
remains at rest? 

(c) Given R and u, what should r be so that you can make the spool 
slip from the static position with as small a T as possible? That 
is, what should r be so that the upper bound on T in part (b) is as 
small as possible? What is the resulting value of T? 


. Stick ona circle «« 


A stick of mass density per unit length p rests on a circle of radius R (see 
Fig. 2.25). The stick makes an angle 0 with the horizontal and is tangent 
to the circle at its upper end. Friction exists at all points of contact, and 
assume that it is large enough to keep the system at rest. Find the friction 
force between the ground and the circle. 


2.4 Exercises 


2.19. Leaning sticks and circles +s» 


A large number of sticks (with mass density per unit length o) and 
circles (with radius R) lean on each other, as shown in Fig. 2.26. Each 
stick makes an angle 0 with the horizontal and is tangent to the next 
circle at its upper end. The sticks are hinged to the ground, and every 
other surface is frictionless (unlike in the previous problem). In the limit 
of a very large number of sticks and circles, what is the normal force 
between a stick and the circle it rests on, very far to the right? Assume 
that the last circle leans against a wall, to keep it from moving. 


2.4 Exercises 


Section 2.1: Balancing forces 


2.20. 


2.21. 


2:22; 


2:23. 


Block under an overhang + 


A block of mass is positioned underneath an overhang that makes an 
angle 6 with the horizontal. You apply a horizontal force of Mg on the 
block, as shown in Fig. 2.27. Assume that the friction force between the 
block and the overhang is large enough to keep the block at rest. What 
are the normal and friction forces (call them N and Ff) that the overhang 
exerts on the block? If the coefficient of static friction is u, for what 
range of angles £ does the block in fact remain at rest? 


Pulling a block x 


A person pulls on a block with a force F, at an angle 0 with respect to the 
horizontal. The coefficient of friction between the block and the ground 
is u. For what 0 is the F required to make the block slip a minimum? 
What is the corresponding F? 


Holding a cone « 


With two fingers, you hold an ice cream cone motionless upside down, 
as shown in Fig. 2.28. The mass of the cone is m, and the coefficient 
of static friction between your fingers and the cone is u. When viewed 
from the side, the angle of the tip is 26. What is the minimum normal 
force you must apply with each finger in order to hold up the cone? In 
terms of 6, what is the minimum value of u that allows you to hold up 
the cone? Assume that you can supply as large a normal force as needed. 


Keeping a book up +» 


The task of Problem 2.4 is to find the minimum force required to keep a 
book up. What is the maximum allowable force, as a function of 0 and 
u? Is there a special angle that arises? Given u, make a rough plot of 
the allowed values of F for —1/2 < 0 < 7/2. 
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2.24. Bridges «x 


(a) Consider the first bridge in Fig. 2.29, made of three equilateral 
EAAS triangles ofbeams. Assume that the seven beams are massless and 
that the connection between any two of them is a hinge. If a car 
of mass m is located at the middle of the bridge, find the forces 
(and specify tension or compression) in the beams. Assume that 
the supports provide no horizontal forces on the bridge. 
(b) Same question, but now with the second bridge in Fig. 2.29, made 
of seven equilateral triangles. 
(c) Same question, but now with the general case of 4n — 1 equilateral 
triangles. 


m 


Fig. 2.29 


2.25. Rope between inclines xx 


A rope rests on two platforms that are both inclined at an angle 0 (which 
you are free to pick), as shown in Fig. 2.30. The rope has uniform mass 
density, and the coefficient of friction between it and the platforms is 1. 
The system has left-right symmetry. What is the largest possible fraction 
of the rope that does not touch the platforms? What angle 6 allows this 
maximum fraction? 


Fig. 2.30 


2.26. Hanging chain +» 


A chain with mass M hangs between two walls, with its ends at the 
same height. The chain makes an angle 0 with each wall, as shown in 
Fig. 2.31. Find the tension in the chain at the lowest point. Solve this in 
two different ways: 


(a) Consider the forces on half of the chain. (This is the quick way.) 
(b) Use the fact (see Problem 2.8) that the height of a hanging chain is 
À 6 given by y(x) = (1/æ) cosh(@x), and consider the vertical forces 
on an infinitesimal piece at the bottom. This will give you the 
M tension in terms of a. Then find an expression for œ in terms of 
the given angle 0. (This is the long way.) 


Fig. 2.31 
Section 2.2: Balancing torques 


2.27. Gravitational torque 


A horizontal stick of mass M and length L is pivoted at one end. Integrate 
the gravitational torque along the stick (relative to the pivot), and show 
that the result is the same as the torque due to a mass M located at the 
center of the stick. 


2.28. Linear function x 


Show that if a function satisfies f(a) + f (b) = f(a + b), then f (rx) = 
rf (x) for any x and for any rational number r. 


2:29; 


2.30. 


231. 


2.32. 


2:33: 


2.34. 


2.4 Exercises 


Direction of the force x 


A stick is connected to other parts of a static system by hinges at its 
ends. Show that (1) if the stick is massless, then the forces it feels at the 
hinges are directed along the stick, but (2) if the stick is massive, then 
the forces need not point along the stick. 


Ball on a wall « 


A ball is held up by a string, as shown in Fig. 2.32, with the string tangent 
to the ball. If the angle between the string and the wall is 0, what is the 
minimum coefficient of static friction between the ball and the wall that 
keeps the ball from falling? 


Cylinder and hanging mass x 


A uniform cylinder of mass M sits on a fixed plane inclined at an 
angle 0. A string is tied to the cylinder’s rightmost point, and a mass m 
hangs from the string, as shown in Fig. 2.33. Assume that the coeffi- 
cient of friction between the cylinder and the plane is sufficiently large 
to prevent slipping. What is m, in terms of M and 0, if the setup is 
static? 


Ladder on a corner xx 


A ladder of mass M and length Z leans against a frictionless wall, with 
a quarter of its length hanging over a corner, as shown in Fig. 2.34. It 
makes an angle 0 with the horizontal. What angle 6 requires the smallest 
coefficient of friction at the corner to keep the ladder at rest? (Different 
values of 0 require different ladder lengths, but assume that the mass is 
M for any length.) 


Stick on a corner xx 


You support one end of a stick of mass M and length L with the tip of 
your finger. A quarter of the way up the stick, it rests on a frictionless 
corner of a table, as shown in Fig. 2.35. The stick makes an angle 0 
with the horizontal. What is the magnitude of the force your finger must 
apply to keep the stick in this position? For what angle 0 does your force 
point horizontally? 


Stick and a cylinder «« 


A horizontal stick of mass m has its left end attached to a pivot on a plane 
inclined at an angle 0, while its right end rests on the top of a cylinder 
also of mass m which in turn rests on the plane, as shown in Fig. 2.36. 
The coefficient of friction between the cylinder and both the stick and 
the plane is m. 


(a) Assuming that the system is at rest, what is the normal force from 
the plane on the cylinder? 
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(b) What is the smallest value of u (in terms of 6) for which the 
system doesn’t slip anywhere? 


2.35. Two sticks and a string «+» 


Two sticks, each of mass m and length £, are connected by a hinge at their 
top ends. They each make an angle 0 with the vertical. A massless string 
connects the bottom of the left stick to the right stick, perpendicularly 
as shown in Fig. 2.37. The whole setup stands on a frictionless table. 


(a) What is the tension in the string? 


(b) What force does the left stick exert on the right stick at the hinge? 
Hint: No messy calculations required! 


2.36. Two sticks and a wall xx 


Two sticks are connected, with hinges, to each other and to a wall. The 
bottom stick is horizontal and has length L, and the sticks make an angle 
of @ with each other, as shown in Fig. 2.38. If both sticks have the same 
mass per unit length, p, find the horizontal and vertical components 
of the force that the wall exerts on the top hinge, and show that the 
magnitude goes to infinity for both 6 > 0 and 6 —> 1/2.° 


2.37. Stick on a circle xx 


Using the results from Problem 2.18 for the setup shown in Fig. 2.39, 
show that ifthe system is to remain at rest, then the coefficient of friction: 


(a) between the stick and the circle must satisfy 


= sin 0 (2.14) 
= TH e088 | f 
(b) between the stick and the ground must satisfy’ 

nee sin 0 cos 0 

l H> . (2.15) 
(1 + cos @)(2 — cos 0) 
2.38. Stacking blocks x» 
N blocks of length £ are stacked on top of each other at the edge of a 
table, as shown in Fig. 2.40 for N = 4. What is the largest horizontal 
Fig. 2.40 


6 The force must therefore achieve a minimum at some intermediate angle. If you want to go through 
the algebra, you can show that this minimum occurs when cos @ = /3 — 1, which gives 6 © 43°. 

7 If you want to go through the algebra, you can show that the right-hand side achieves a maximum 
when cos@ = v3 — 1, which gives 9 ~ 43°. (Yes, I did just cut and paste this from the previous 
footnote. But it’s still correct!) This is the angle for which the stick is most likely to slip on the 
ground. 


2.5 Solutions 


distance the rightmost point on the top block can hang out beyond the 
table? How does your answer behave for N —> 00?® 


2.5 Solutions 


2.1. 


2.2. 


2:3; 


2.4. 


Hanging rope 

Let T(y) be the tension as a function of height. Consider a small piece of the rope 
between y and y + dy (0 < y < L). The forces on this piece are T(y + dy) upward, 
T(v) downward, and the weight pg dy downward. Since the rope is at rest, we have 
T + dy) = T(y) + pg dy. Expanding this to first order in dy gives T'(y) = pg. The 
tension in the bottom of the rope is zero, so integrating from y = 0 up to a position 
y gives 


T) = pay. (2.16) 


As a double-check, at the top end we have 7(L) = pgL, which is the weight of the 
entire rope, as it should be. 

Alternatively, you can simply write down the answer, T(y) = pgy, by noting that 
the tension at a given point in the rope is what supports the weight of all the rope 
below it. 


Block on a plane 


Balancing the forces shown in Fig. 2.41, parallel and perpendicular to the plane, we 
see that F = mgsin@ and N = mgcos@. The horizontal components of these are 
Fcos@ = mgsin@cos@ (to the right), and N sin@é = mgcos@sin@ (to the left). 
These are equal, as they must be, because the net horizontal force on the block is zero. 
To maximize the value of mg sin 0 cos 0, we can either take the derivative, or we can 
write it as (mg/2) sin 20, from which it is clear that the maximum occurs at 6 = 7/4. 
The maximum value is mg/2. 


Motionless chain 


Let the curve be described by the function f(x), and let it run from x = a tox = b. 
Consider a little piece of the chain between x and x + dx (see Fig. 2.42). The length 


of this piece is y 1 +f” dx, so its mass is py 1 +f’? dx, where p is the mass per unit 
length. The component of the gravitational acceleration along the curve is —g sin 0 = 


—2f'//1 +f? (using tan = f’), with positive corresponding to moving along the 
curve from a to b. The total force along the curve is therefore 


b b —gf! 
F= (—g sin 0) dm = -poy1+f2dx 


a 
b 
=-gp | sax 
a 


= -gp (f (a) — f(b) 
=0. (2.17) 


Keeping a book up 


(a) The normal force from the wall is F cos@, so the friction force Fp holding 
the book up is at most uF cos@. The other vertical forces on the book are the 


8 It turns out that the method of stacking shown in Fig. 2.40 (with the blocks simply stacked on top 


of each other) doesn’t yield the optimal overhang. See Hall (2005) for an interesting discussion of 


other methods. 
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gravitational force, which is —Mg, and the vertical component of F, which is 
F sin @. If the book is to stay at rest, we must have F'sin@ + Ff — Mg = 0. 
Combining this with the condition Ff < uF cos 6 gives 


F(sin@ + u cos) > Mg. (2.18) 
Therefore, F must satisfy 


F> e i (2.19) 
sin + u cos 0 
assuming that sin 0 + u cos 0 is positive. Ifit is negative, then there is no solution 
for F. 
(b) To minimize this lower bound, we must maximize the denominator. Taking the 
derivative gives cos@ — u sin 0 = 0, so tan = 1/u. Plugging this value of 6 
back into Eq. (2.19) gives 


F> "E. (with tan0 = 1/y). (2.20) 
14+ p2 
This is the smallest possible F that keeps the book up, and the angle must be 
6 =tan!(1 /) for it to work. We see that if jz is very small, then to minimize 
your F, you should push essentially vertically with a force mg. But if jz is very 
large, you should push essentially horizontally with a force mg/m. 
(c) There is no possible F that satisfies the condition in Eq. (2.19) if the right-hand 
side is infinite (more precisely, there is no F that satisfies Eq. (2.18) if the 
coefficient of F is zero or negative). This occurs when 


tan = —p. (2.21) 


If @ is more negative than this, then it is impossible to keep the book up, no 
matter how hard you push. 


Rope on a plane 


The component of the gravitational force along the plane is (pL)g sin 8, and the max- 
imum value of the friction force is uN = u(pL)g cos@. Therefore, you might think 
that the tension at the top of the rope is pLg sind — upLgcos@. However, this is 
not necessarily the case. The tension at the top depends on how the rope is placed on 
the plane. If, for example, the rope is placed on the plane without being stretched, 
then the friction force points upwards, and the tension at the top does indeed equal 
pLg sin — upLg cosé@. Or it equals zero if upLg cos > pLg sin0, in which case 
the friction force need not achieve its maximum value. 

If, on the other hand, the rope is placed on the plane after being stretched (or 
equivalently, it is dragged up along the plane and then nailed down at its top end), 
then the friction force points downwards, and the tension at the top equals pLg sin 0 + 
upLg cosé. 

Another special case occurs when the rope is placed on a frictionless plane, and then 
the coefficient of friction is “turned on” to jz. The friction force is still zero. Changing 
the plane from ice to sandpaper (somehow without moving the rope) doesn’t suddenly 
cause there to be a friction force. Therefore, the tension at the top equals pLg sin 8. 

In general, depending on how the rope is placed on the plane, the tension at the top 
can take any value from a maximum of pLg sin@ + upLg cos 6, down to a minimum 
of pLg sind — upLg cos0 (or zero, whichever is larger). If the rope is replaced by a 
stick (which can support a compressive force), then the tension can achieve negative 
values down to pLg sin — upLg cos 9, if this happens to be negative. 


Supporting a disk 


(a) The gravitational force downward on the disk is Mg, and the force upward is 
2T. These forces must balance, so 


T=. (2.22) 


2.5 Solutions 


We can find the normal force per unit length that the string applies to the disk 
in two ways. 


FIRST METHOD: Let N d0 be the normal force on an arc of the disk that subtends 
an angle d0. Such an arc has length Rd0, so N/R is the desired normal force 
per unit arclength. The tension in the string is the same throughout it, because 
the string is massless. So all points are equivalent, and hence N is constant, 
independent of 6. The upward component of the normal force is N d0 cos 6, 
where @ is measured from the vertical (that is, —7/2 < 0 < m/2 here). Since 
the total upward force is Mg, we must have 
m/2 
N cos dé = Mg. (2.23) 
—1/2 
The integral equals 2N, so we have N = Mg/2. The normal force per unit 
length, N/R, therefore equals Mg/2R. 


SECOND METHOD: Consider the normal force, N d6, on a small arc of the disk 
that subtends an angle d0. The tension forces on each end of the corresponding 
small piece of string almost cancel, but they don’t exactly, because they point 
in slightly different directions. Their nonzero sum is what produces the normal ‘ 
force on the disk. From Fig. 2.43, we see that the two forces have a sum of 
2T sin(d@/2), directed “inward”. Since d0 is small, we can use sinx œ~ x to 
approximate this as T d0. Therefore, N d0 = T d0, and so N = T. The normal 


force per unit arclength, N/R, therefore equals T/R. Using T = Mg/2 from Eq. meh 
(2.22), we arrive at N/R = Mg/2R. L 
(b) Let T(6) be the tension, as a function of 0, for —z/2 < 6 < m/2. T now Tsin d0/2 


depends on 6, because there is a tangential friction force. Most of the work for 
this problem was already done in the “Rope wrapped around a pole” example in Fig. 2.43 
Section 2.1. We’ll simply invoke Eq. (2.7), which in the present language says 


T(0) < T(O)e"". (2.24) 


Letting 9 = 2/2, and using T (7/2) = Mg/2, gives Mg/2 < T(0)e""/?. We 
therefore see that the tension at the bottom point must satisfy 


M; 
T(0) > L ema, (2.25) 


REMARK: This minimum value of T (0) goes to Mg/2 as u — 0, as it should. 
And it goes to zero as u —> oo, as it should (imagine a very rough surface, so 
that the friction force from the rope near 0 = 2/2 accounts for essentially all the 
weight). But interestingly, the tension at the bottom doesn’t exactly equal zero, 
no matter now large u is. Basically, the smaller T is, the smaller N is. But the 
smaller N is, the smaller the change in T is (because N determines the friction 


force). So T doesn’t decrease much when it’s small, and this results in it never 
being able to reach zero. & 
2.7. Objects between circles M 


(a) Let N be the normal force between the circles and the triangle. The goal in 
this problem is to find the horizontal component of N, that is, N cos 0. From 
Fig. 2.44, we see that the upward force on the triangle from the normal forces is 
2N sin 0. This must equal the weight of the triangle, which is gø times the area. Fig. 2.44 
Since the bottom angle of the isosceles triangle is 20, the top side has length 


° This holds for 6 > 0. There would be a minus sign on the right-hand side if @ < 0. But since the 
tension is symmetric around 6 = 0 in the case we’re concerned with, we’ll just deal with 6 > 0. 
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2L sin @, and the altitude to this side is Lcos@. So the area of the triangle is 

L? sin 6 cos 0. The mass is therefore oZ? sin @ cos 8. Equating the weight with 

the upward component of the normal forces gives N = (gor? /2)cos0. The 

horizontal component of N is therefore 

go L? cos? 0 

7 i 
This equals zero when 6 = 7/2, and it increases as 0 decreases, even though 
the triangle is getting smaller. It has the interesting property of approaching the 
finite value go L? /2, as 0 —> 0. 

(b) In Fig. 2.45, the base of the rectangle has length 2R(1 — cos@). Its mass is 
therefore 20 RL(1 — cos 0). Equating the weight with the upward component of 
the normal forces, 2N sin 0, gives N = ogRL(1 — cos@)/ sin 0. The horizontal 
component of N is therefore 


N cos = (2.26) 


ogRL(1 — cos 0) cos 0 


N cos 
sin 6 


(2.27) 


This equals zero for both 6 = 2/2 and 6 = 0 (because 1 — cos@ ~ 6? /2 goes 
to zero faster than sin@ ~ 0, for small 0). Taking the derivative to find where it 
reaches a maximum, we obtain (using sin? 6 = 1 — cos? 0), 


cos? 6 — 2 cos + 1 = 0. (2.28) 


Fortunately, there is an easy root of this cubic equation, namely cos@ = 1, 
which we know is not the maximum. Dividing through by the factor (cos 6 — 1) 
gives cos? @ + cos@ — 1 = 0. The roots of this quadratic equation are 


-1+5 
cos@ = A . (2.29) 


We must choose the plus sign, because we need | cos 0| < 1. So our answer is 
cos ~ 0.618, which is the inverse of the golden ratio. The angle 6 is ~ 51.8°. 
(c) In Fig. 2.46, the length of the hypotenuse shown is R sec 0, so the radius of the 
top circle is R(sec@ — 1). Its mass is therefore or R? (sec — 1)*. Equating 
the weight with the upward component of the normal forces, 2N sin 6, gives 
N=o gm R? (sec 0— 1)? /(2 sin 0). The horizontal component of N is therefore 


ogn R? cos 1 1 : ogmR?(1 — cos)? 
2sin@ i 


N cos = 


(2.30) 


cos 0 2 sin 0 cos 0 


This equals zero when 6 = 0 (using cos@ © 1 — 67/2 and sin@ ~ 9, for 
small 0). For @ —> 7/2, it behaves like 1/ cos 6, which goes to infinity. In this 
limit, N points almost vertically, but its magnitude is so large that the horizontal 
component still approaches infinity. 


Fig. 2.46 


2.8. Hanging chain 


(a) The key fact to note is that the horizontal component, Ty, of the tension is the 
T(x+dx) same throughout the chain. This is true because the net horizontal force on any 
subpart of the chain must be zero. Label the constant value as T, = C. 

Let the shape of the chain be described by the function y(x). Since the tension 
points along the chain at all points (see Problem 2.12), its components satisfy 
T/T = y', which gives T, = Cy’. In other words, T, is proportional to the 
slope of the chain. 

Now consider a little piece of the chain, with endpoints at x and x + dx, as 
shown in Fig. 2.47. The difference in the T, values at the endpoints is what 
balances the weight of the little piece, (dm)g. The length of the piece is ds = 


dx,/1 + y’, so if p is the density, we have 


dT, 
dT, = (pds)g = pg dx 1 +y? g = eaylty?. 231) 
i X 


2.5 Solutions 


Using the T, = Cy’ result from above, this becomes Cy” = pgy 1 + y”. Letting 
z = y', we can separate variables and integrate to obtain 


dz pg dx | pex 
f = sinh z = —— +4, (2.32) 
V1+22 | C C 


where A is a constant of integration. We can make this look a little cleaner if we 
define constants œ and a such that a = pg/C and a = A/a. We then obtain 


sinh™! z = a(x + a) z = sinha (x + a). (2.33) 


Recalling that z = dy/dx, we can integrate again to obtain 
1 
yx) = — cosha (x + a) + h. (2.34) 
a 


The shape of the chain is therefore a hyperbolic cosine function. The constant 
h isn’t too important, because it depends simply on where we pick the y = 0 
height. Furthermore, we can eliminate the need for the constant a if we pick 
x = 0 to be where the lowest point of the chain is (or where it would be, in the 
case where the slope is always nonzero). In this case, using Eq. (2.34), we see 
that y’ (0) = 0 implies a = 0, as desired. We then have (ignoring the constant 4) 
the nice simple result, 


V(x) = L cosh(ax). (2.35) 


(b) The constant œ can be determined from the locations of the endpoints and the 


length of the chain. As stated in the problem, the position of the chain may be 
described by giving (1) the horizontal distance d between the two endpoints, 
(2) the vertical distance à between the two endpoints, and (3) the length £ of 
the chain, as shown in Fig. 2.48. Note that it isn’t obvious what the horizontal 
distances between the ends and the minimum point (which we have chosen as 
the x = 0 point) are. If A = 0, then these distances are d/2, by symmetry. But 
otherwise, they aren’t so clear. 

If we let the left endpoint be located at x = —xo, then the first of the above 
three facts says that the right endpoint is located at x = d — xo. We now have 
two unknowns, xo and œ. The second fact tells us that (we’ll take the right end 
to be higher than the left end, without loss of generality) 


y(d — xo) — y(—x0) = A. (2.36) 
And the third fact gives, using Eq. (2.35), 
d—xo 1 d—xo 
£= i y 1 +y? dx = — sinh(ax) R (2.37) 
—xo a —Xx0 


where we have used (d/du) coshu = sinh u, and 1 + sinh? u = cosh? u, and 
J cosh u = sinh u. Writing out Eqs. (2.36) and (2.37) explicitly, we have 


cosh (æ (d — xo)) — cosh(—axo) = a, 
(2.38) 


sinh (a(d xo)) sinh(—axo) = aé. 


We can eliminate xo by taking the difference of the squares of these two 


equations. Using the hyperbolic identities cosh?u — sinh>u = 1 and 
cosh u cosh v — sinh u sinh v = cosh(u — v), we obtain 
2cosh(ad) — 2 = œ? (È — 2°). (2.39) 


This is the desired equation that determines a. Given d, A, and £, we can numer- 
ically solve for w. Using a “half-angle” formula, you can show that Eq. (2.39) 
may also be written as 


2 sinh(ad/2) = ay 2 — 22. (2.40) 
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REMARK: Let’s check a couple limits. If à = 0 and £ = d (that is, the chain 
forms a horizontal straight line), then Eq. (2.40) becomes 2 sinh(ad/2) = ad. 
The solution to this is œ = 0, which does indeed correspond to a horizontal 
straight line, because for small œ, we can use cosh € ~ 1 + €*/2 to say that the 
y(x) in Eq. (2.35) behaves like ax? /2 (up to an additive constant), which varies 
slowly with x for small œ. Another limit is where £ is much larger than both d 
and å. In this case, Eq. (2.40) becomes 2 sinh (œd /2) ~ af. The solution to this 
is a large œ (or more precisely, œ >> 1/d), which corresponds to a “droopy” 
chain, because the y(x) in Eq. (2.35) varies rapidly with x for large œ. & 


Hanging gently 

We must first find the mass of the chain by calculating its length. Then we must 
determine the slope of the chain at the supports, so we can find the components of 
the force there. Using the given information, y(x) = (1/a) cosh(ax), the slope of the 
chain as a function of x is 


y= Ls (z cosh(a)) = sinh(ax). (2.41) 
dx \a 


The total length is therefore (using 1 + sinh? z = cosh? z) 


d d 
2 
l= / 1 +y? dx =f cosh(ax) = — sinh(ad). 2.42 
De yity by a (2.42) 


The weight of the rope is W = p£g, where p is the mass per unit length. Each support 
applies a vertical force of W/2. So this equals F sin 6, where F is the magnitude 
of the force at each support, and 6 is the angle it makes with the horizontal. Since 
tan@ = y'(d) = sinh(ad), we see from Fig. 2.49 that sin 0 = tanh(@d). Therefore, 


1 W 1 inh (æd 
Pe  PESINGE) _ 98 coshiad). (2.43) 
sind 2 tanh(ad) a a 
Taking the derivative of this (as a function of œ), and setting the result equal to zero to 


find the minimum, gives tanh(ad) = 1/(ad). This must be solved numerically. The 
result is 


ad ~ 1.1997 = n. (2.44) 


So @ is given by œ = n/d, and the shape of the chain that requires the minimum F is 
thus 


d nx 
v(x) ~ T cosh (=) (2.45) 


From Eqs. (2.42) and (2.44), the length of the chain is £ = (2d/n) sinh(n) © (2.52)d. 
To further get an idea of what the chain looks like, we can calculate the ratio of the 
height, A, to the width, 2d. 
h — y(d)—y() _ cosh(y) — 1 
2d 2d 2n 


We can also calculate the angle of the rope at the supports, using tan@ = sinh(ad). 
This gives tan 0 = sinh ņ, and so 0 © 56.5°. 


~ 0.338. (2.46) 


Remark: We can also ask what shape the chain should take in order to minimize 
the horizontal or vertical component of F. The vertical component, Fy, is simply half 
the weight, so we want the shortest possible chain, namely a horizontal one (which 
requires an infinite F). This corresponds to a = 0. The horizontal component, Fy, 
equals F cos 0. From Fig. 2.49, we see that cos @ = 1/ cosh(ad). Therefore, Eq. (2.43) 
gives Fy = pg/a. This goes to zero as œ — ov, which corresponds to a chain with 
infinite length, that is, a very “droopy” chain. & 


2.5 Solutions 


2.10. Mountain climber 


CHEAP LASso: We will take advantage of the fact that a cone is “flat,” in the sense 
that we can make one out of a piece of paper, without crumpling the paper. Cut the 
cone along a straight line emanating from the peak and passing through the knot of the 
lasso, and roll the cone flat onto a plane. Call the resulting figure, which is a sector of 
a circle, S (see Fig. 2.50). If the cone is very sharp, then S looks like a thin “pie piece.” 
If the cone is very wide, with a shallow slope, then S looks like a pie with a piece taken 
out of it. Points on the straight-line boundaries of the sector S are identified with each 
other. Let P be the location of the lasso’s knot. Then P appears on each straight-line 
boundary, at equal distances from the tip of S. Let £ be the angle of the sector S. 

The key to this problem is to realize that the path of the lasso’s loop must be a 
straight line on S, as shown by the dotted line in Fig. 2.50. This is true because the rope 
takes the shortest distance between two points because there is no friction, and rolling 
the cone onto a plane doesn’t change distances. But a straight line between the two 
identified points P is possible if and only if the sector S is smaller than a semicircle. 
The condition for a climbable mountain is therefore 8 < 180°. 

What is this condition, in terms of the angle of the peak, w? Let C denote a cross- 
sectional circle of the mountain, a distance d (measured along the cone) from the top. 
(We are considering this circle for geometrical convenience. It is not the path of the 
lasso; see the remark below.) A semicircular S implies that the circumference of C 
equals zd. This then implies that the radius of C equals d/2. Therefore, 

; d/2 1 5 
sin(a/2) < PA = z => a < 60°. (2.47) 
This is the condition under which the mountain is climbable. In short, having œ < 60° 
guarantees that there is a loop around the cone with shorter length than the distance 
straight to the peak and back. 


Remark: When viewed from the side, the rope will appear perpendicular to the side 
of the mountain at the point opposite the lasso’s knot. A common mistake is to assume 
that this implies that the climbable condition is œ < 90°. This is not the case, because 
the loop does not lie in a plane. Lying in a plane, after all, would imply an elliptical 
loop. But the loop must certainly have a kink in it where the knot is, because there 
must exist a vertical component to the tension there to hold the climber up. If we had 
posed the problem with a planar, triangular mountain, then the condition would have 
beena < 90°. & 


DELUXE Lasso: If the mountain is very steep, the climber can slide down the mountain 
by means of the loop growing larger. If the mountain has a shallow slope, the climber 
can slide down by means of the loop growing smaller. The only situation in which the 
climber doesn’t slide down is the one where the change in position of the knot along 
the mountain is exactly compensated by the change in length of the loop. 

Roll the cone onto a plane as we did in the cheap-lasso case. In terms of the sector S in 
a plane, the above condition requires that if we move P a distance £ up (or down) along 
the mountain, the distance between the identified points P must decrease (or increase) 
by £. In Fig. 2.50, we must therefore have an equilateral triangle, so 6 = 60°. 

What peak-angle œ does this correspond to? As in part the cheap-lasso case, let C be 
a cross-sectional circle of the mountain, a distance d (measured along the cone) from 
the top. Then 6 = 60° implies that the circumference of C equals (2 /3)d. This then 
implies that the radius of C equals d/6. Therefore, 
ae ~ 19° 2.48 

3 g => aw ; (2.48) 

This is the condition under which the mountain is climbable. We see that there is 
exactly one angle for which the climber can climb up along the mountain. The cheap 


sin(a/2) = 
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lasso is therefore much more useful than the fancy deluxe lasso, assuming, of course, 
that you want to use it for climbing mountains, and not, say, for rounding up cattle. 


REMARK: Another way to see the 6 = 60° result is to note that the three directions 
of rope emanating from the knot must all have the same tension, because the deluxe 
lasso is one continuous piece of rope. They must therefore have 120° angles between 
themselves (to provide zero net force on the massless knot). This implies that 6 = 60° 
in Fig. 2.50. & 


FURTHER REMARKS: For each type of lasso, we can also ask the question: For what 
angles can the mountain be climbed if the lasso is looped N times around the top of 
the mountain? The solution here is similar to that above. 

For the cheap lasso, roll the cone N times onto a plane, as shown in Fig. 2.51 for 
N = 4. The resulting figure, Sy, is a sector of a circle divided into N equal sectors, 
each representing a copy of the cone. As above, Sy must be smaller than a semicircle. 
The circumference of the circle C (defined above) must therefore be less than rd/N. 
Hence, the radius of C must be less than d/2N. Thus, 

in(r/2) d/2N 1 
sin(a/2) < —— = — 
d 2N 
For the deluxe lasso, again roll the cone N times onto a plane. From the original 
reasoning above, we must have V6 = 60°. The circumference of C must therefore be 
zd/3N, and so its radius must be d/6N. Therefore, 


d/6N 1 yi 
d ON wma gin ay): s 


= a< 2sin“!(5). (2.49) 


sin(a/2) (2.50) 
Equality of torques 
The proof by induction is as follows. Assume that we have shown that a force F 
applied at a distance d is equivalent to a force kF applied at a distance d/k, for all 
integers k up to n — 1. We now want to show that the statement holds for k = n. 
Consider the situation in Fig. 2.52. Forces F are applied at the ends of a stick, and 
forces 2F /(n — 1) are applied at the j£/n marks (for 1 < j < n — 1). The stick doesn’t 
rotate (by symmetry), and it doesn’t translate (because the net force is zero). Consider 
the stick to have a pivot at the left end. Replacing the interior forces by their equivalent 
ones at the ¢/n mark (see Fig. 2.52) gives a total force there equal to 


2F (£ — 2) T 
n—l 2 


mail ( (2.51) 


= L4+24+3+2-2+ D)= 


We therefore see that a force F applied at a distance £ is equivalent to a force nF 
applied at a distance £/n, as was to be shown. 

We can now show that Claim 2.1 holds, for arbitrary distances a and b (see Fig. 2.53). 
Consider the stick to be pivoted at its left end, and let € be a tiny distance (small 
compared with a). Then a force F3 at a distance a is equivalent to a force F3(a/e) at 
a distance €.!° But a force F3(a/e) at a distance € is equivalent to a force F3(a/e) 
(€/(a+b)) = F3a/(a + b) at a distance (a + b). This equivalent force at the distance 
(a + b) must cancel the force Fz there, because the stick is motionless. Therefore, we 
have F3a/(a + b) = F2, which proves the claim. 


Direction of the tension 


Consider an infinitesimal piece of the rope, and look at the torque around one end. 
Any forces acting at this end provide no torque around it. If the tension at the other 


10 Technically, we can use the reasoning in the previous paragraph to say this only if a/e is an integer, 


but since a/e is very large, we can simply pick the closest integer to it, and there will be negligible 


error. 


2.13. 


2.14. 


2.5 Solutions 


end is directed at a finite angle away from the direction of the rope, then this produces 
a certain torque. But this torque can’t be canceled by the much smaller torque from 
the tiny gravitational force, because this force is proportional to the length of the tiny 
piece. Therefore, the tension must point along the rope. It actually points along the 
direction of the rope at the end of the little piece it acts on, which isn’t quite along the 
direction of the rope at the end we’re considering torques around, because the rope 
bends (assuming it’s not vertical). So the tension ends up producing a very small torque 
which cancels the very small torque from gravity. 

This argument doesn’t work for a rigid stick, because the stick can produce finite 
torques around the end of a piece via forces at that end, because the end is really 
a cross section of finite size. There is a shearing action in the stick, and the large 
shearing forces act with tiny lever arms (relative to, say, a point at the middle of the 
cross section) to produce finite torques. 


Find the force 


In Fig. 2.54, let the supports at the ends exert forces Fı and F2, and let the support in 
the interior exert a force F. Then 


Pi +F +F = Mg. (2.52) 
Balancing torques around the left and right ends gives, respectively, 
a+b 
Fa+F2(a+b) = Mg 2? 
b (2.53) 
Fb + F(a +b) = Mg — 


where we have used the fact that the stick can be treated like a point mass at its center. 
Note that the equation for balancing the torques around the center of mass is redundant; 
it is obtained by taking the difference of the two previous equations and then dividing 
by 2. And balancing torques around the middle pivot also takes the form of a linear 
combination of these equations, as you can show. 

It appears as though we have three equations and three unknowns, but we really have 
only two equations, because the sum of Eqs. (2.53) gives Eq. (2.52). Therefore, since 
we have two equations and three unknowns, the system is underdetermined. Solving 
Eqs. (2.53) for F and F? in terms of F, we see that any forces of the form 


Mg Fb Mg Fa 
(F1, F, Fo) = sf 
2 a+b 2 a+b 


(2.54) 


are possible. In retrospect, it makes sense that the forces are not determined. By 
changing the height of the new support an infinitesimal distance, we can make F be 
anything from 0 up to Mg (a+ b)/2b, which is when the stick comes off the left support 
(assuming b > a). 


Leaning sticks 

Let Mı be the mass of the left stick, and let M, be the mass of the right stick. Then 
M\/M, = tan. Let N and Ff be the normal and friction forces between the sticks 
(see Fig. 2.55). Ff has a maximum value of uN. Balancing the torques on the left 
stick (around the contact point with the ground) gives N = (Mg/2) sin @. Balancing 
the torques on the right stick (around the contact point with the ground) gives Ff = 
(M,g/2) cos 0. The condition Ff < uN is therefore 


1 
M,cos@ < uMsin@ => tan? 0 > —, (2.55) 
u 


where we have used M/M, = tan 0. This answer checks in the two extremes: In the 
limit u — 0, we see that 6 must be very close to 7/2, which makes sense. And in the 
limit u — œœ (that is, very sticky sticks), we see that 0 can be very small, which also 
makes sense. 
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2.15. 


2.16. 


2.17. 


" For 6 


Supporting a ladder 


Let F be the desired force. F must be directed along the stick, because otherwise 
there would be a net torque on the (massless) stick relative to the pivot at its right 
end, and this would contradict the fact that it is at rest. Look at torques on the ladder 
around the pivot at its bottom. The gravitational force provides a clockwise torque of 
Mg(L/2) cos 6, and the force F from the stick provides a counterclockwise torque of 
F(£/ tan 0). Equating these two torques gives 

MgL 


F = sino. (2.56) 


REMARKS: F goes to zero as @ —> 0, as it should.!! And F increases to MgL/2£ 
as 0 — 2/2, which isn’t so obvious (the required torque from the stick is very 
small, but the lever arm is also very small). However, in the special case where 
the ladder is exactly vertical, no force is required. You can see that our calcula- 
tions above are not valid in this case, because we divided by cos@, which is zero 
when 0 = 17/2. 

The normal force at the pivot of the stick (which equals the vertical component of 
F, because the stick is massless) is equal to MgL sin 6 cos 0 /2£. This has a maximum 
value of MgL/4£ atO = 71/4. & 


Balancing the stick 

Let the stick go off to infinity in the positive x direction, and let it be cut at x = xo. 
Then the pivot point is located at x = xo + £ (see Fig. 2.56). Let the density be p(x). 
The condition that the total gravitational torque relative to xp + £ be zero is 


oo 
T= f p(x)(x — (xo + £))g dx = 0. (2.57) 
x0 
We want this to equal zero for all xo, so the derivative of t with respect to xp must be 
zero. T depends on xo through both the limits of integration and the integrand. In taking 
the derivative, the former dependence requires finding the value of the integrand at 
the xo limit, while the latter dependence requires taking the derivative of the integrand 
with respect to xo, and then integrating. (To derive these two contributions, just replace 
xo with x9 + dxo and expand things to first order in dxo.) We obtain 


dt me 
0= — = gtp(xo) -g p(x) dx. (2.58) 
dxo A 
Taking the derivative of this equation with respect to xo gives £p’ (xo) = —p (xo). The 


solution to this is (rewriting the arbitrary xo as x) 
p(x) = Ae ™ E, (2.59) 


We therefore see that the density decreases exponentially with x. The smaller £ is, the 
quicker it falls off. Note that the density at the pivot is 1/e times the density at the left 
end. And you can show that 1 — 1/e ~ 63 % of the mass is contained between the left 
end and the pivot. 


The spool 


(a) Let Ff be the friction force the ground provides. Balancing the horizontal forces 
on the spool gives (see Fig. 2.57) 


T cos@ = Fy. (2.60) 


— 0, we would need to lengthen the ladder with a massless extension, because the stick 


would have to be very far to the right to remain perpendicular to the ladder. 
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Balancing torques around the center of the spool gives 
Tr = FẹR. (2.61) 
These two equations imply 


ý 
0 = —. 2.62 
cos R ( ) 


The niceness of this result suggests that there is a quicker way to obtain it. And 
indeed, we see from Fig. 2.58 that cos@ = r/R is the angle that causes the line 


of the tension to pass through the contact point on the ground. Since gravity T 
and friction provide no torque around this point, the total torque around it is 
therefore zero, and the spool remains at rest. O 
(b) The normal force from the ground is a ag 
1 
N = Mg — Tsin. (2.63) RUO 


Using Eq. (2.60), the statement Ff < uN becomes 


M. 
T cos < u(Mg — T sind) T<— E, (264) Fig.258 
cosé + usin 


where @ is given in Eq. (2.62). 

(c) The maximum value of T is given in (2.64). This depends on @, which in turn 
depends on r. We want to find the r that minimizes this maximum T. Taking the 
derivative with respect to 0, we find that the @ that maximizes the denominator 
in Eq. (2.64) is given by tan 6) = u. You can then show that the value of T for 
this ĝo is 


M; 
ho HE (2.65) 


Vi+ 
To find the corresponding r, we can use Eq. (2.62) to write tan@ = JR? = r2/r. 
The relation tan 6) = u then yields 
a R 
ro = Tia i 
This is the r that yields the smallest upper bound on T. In the limit u = 0, we 


have 0) = 0, To = 0, and ro = R. And in the limit u = œo, we have 6) = 7/2, 
To = Mg, and rọ = 0. 


(2.66) 


2.18. Stick on a circle 


Let N be the normal force between the stick and the circle, and let Ff be the friction 

force between the ground and the circle (see Fig. 2.59). Then we immediately see that 

the friction force between the stick and the circle is also Ff, because the torques from 

the two friction forces on the circle must cancel. We’ve drawn all forces as acting 

on the circle. By Newton’s third law, N and Ff act in the opposite directions on the 

stick at its top end. Fig. 2.59 
Looking at torques on the stick around the point of contact with the ground, we 

have Mg(£/2) cos@ = N£, where M = p£ is the mass of the stick, and £ is its length. 

Therefore, N = (pfg/2)cos@. Balancing the horizontal forces on the circle gives 

Nsin@ = Ff + Ff cos 0, so we have 


Nsin@ plg sin 0 cos @ 
~ 1L+4cos@  2(1+cos@) 


But from Fig. 2.59 we have £ = R/ tan(@/2). Using the identity tan(9/2) = sin 0/(1 + 
cos 6), we finally obtain 


Ff 


(2.67) 


1 
Ff = 3 PER coso. (2.68) 
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2.19, 


In the limit 6 —> 2/2, Ff approaches zero, which makes sense. In the limit 0 > 0 
(which corresponds to a very long stick), the friction force approaches pgR/2, which 
isn’t so obvious. 


Leaning sticks and circles 


Let S; be the ith stick, and let C; be the ith circle. The normal forces that C; feels 
from S; and S;;; are equal in magnitude, because these two forces provide the only 
horizontal forces on the frictionless circle, so they must cancel. Let N; be this normal 
force. 

Look at the torques on S;+1, relative to the hinge on the ground. The torques come 
from N;, Ni+1, and the weight of S;;,. From Fig. 2.60, we see that N; acts at a point 
which is a distance Rtan(@/2) away from the hinge. Since the stick has a length 
R/tan(@/2), this point is a fraction tan? (6/2) up along the stick. Therefore, balancing 
the torques on S;+1 gives 


1 20 
Mg cos + N; tan 
2 2 


No is by definition zero, so we have Ni = (Mg/2) cos @ (as in the previous problem). 
If we successively use Eq. (2.69), we see that M2 equals (Mg/2) cos (1 + tan? (0/2)), 
and N3 equals (Mg/2) cos a(1 + tan? (0/2) + tan*(6/2)), and so on. In general, 


= Ni41. (2.69) 


M; 0 0 0 1) 9 
Ni = Sealy + tan? — + tant — +... + tan? D = j, (2.70) 
2 2 2 2 
In the limit 7 + 00, we may write this infinite geometric sum in closed form as 
N lim N Mg cos 0 1 (2.71) 
= lim N; = ‘ : 
PO S66. 2 1 — tan? (0/2) 


Note that this is the solution to Eq. (2.69), with N; = Ni+1. So if a limit exists, it must 
be this. Using M = pR/ tan(6/2), we can rewrite Noo as 


i= pRg cos ( 1 ) l (2.72) 
2tan(6/2) \ 1 — tan2(6/2) 


The identity cos 6 = cos? (8/2) — sin” (6/2) may then be used to write this as 


_ pRg cos? (0/2) 


Noe 2 sin(6/2) (ea) 


REMARKS: Noo goes to infinity for @ —> 0, which makes sense, because the sticks are 
very long. All of the N; are essentially equal to half the weight of a stick (in order to 
cancel the torque from the weight relative to the pivot). For 9 —> 2/2, we see from 
Eq. (2.73) that Noo approaches pRg/4, which is not at all obvious; the N; start off at 
Nı = (Mg/2)cos@ ~ 0, but gradually increase to pRg/4, which is a quarter of the 
weight of a stick. Note that the horizontal force that must be applied to the last circle 
far to the right is Næ sin@ = pRgcos*(6/2). This ranges from pRg for @ — 0, to 
pRg/4 ford > 1/2. & 


Chapter 3 
Using F = ma 


The general goal of classical mechanics is to determine what happens to a given 
set of objects in a given physical situation. In order to figure this out, we need to 
know what makes the objects move the way they do. There are two main ways 
of going about this task. The first one, which you are undoubtedly familiar with, 
involves Newton’s laws. This is the subject of the present chapter. The second 
one, which is more advanced, is the Lagrangian method. This is the subject of 
Chapter 6. It should be noted that each of these methods is perfectly sufficient 
for solving any problem, and they both produce the same information in the end. 
But they are based on vastly different principles. We’ll talk more about this in 
Chapter 6. 


3.1 Newton's laws 


In 1687 Newton published his three laws in his Principia Mathematica. These 
laws are fairly intuitive, although I suppose it’s questionable to attach the adjective 
“intuitive” to a set of statements that weren’t written down until a mere 300 years 
ago. At any rate, the laws may be stated as follows. 


e First law: A body moves with constant velocity (which may be zero) unless acted on 
by a force. 

e Second law: The time rate of change of the momentum of a body equals the force acting 
on the body. 

e Third law: For every force on one body, there is an equal and opposite force on another 
body. 


We could discuss for days on end the degree to which these statements are 
physical laws, and the degree to which they are definitions. Sir Arthur Eddington 
once made the unflattering remark that the first law essentially says that “every 
particle continues in its state of rest or uniform motion in a straight line except 
insofar as it doesn’t.” However, although the three laws might seem somewhat 
light on content at first glance, there’s actually more to them than Eddington’s 
comment implies. Let’s look at each in turn.! 


l A disclaimer: This section represents my view on which parts of the laws are definitions and which 
parts have content. But you should take all of this with a grain of salt. For further reading, see 
Anderson (1990), Keller (1987), O’Sullivan (1980), and Eisenbud (1958). 
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First law 

One thing this law does is give a definition of zero force. Another thing it does 
is give a definition of an inertial frame, which is defined simply as a frame of 
reference in which the first law holds; since the term “velocity” is used, we have 
to state what frame we’re measuring the velocity with respect to. The first law 
does not hold in an arbitrary frame. For example, it fails in the frame of a rotating 
turntable.” Intuitively, an inertial frame is one that moves with constant velocity. 
But this is ambiguous, because we have to say what the frame is moving with 
constant velocity with respect to. But all this aside, an inertial frame is defined 
as the special type of frame in which the first law holds. 

So, what we now have are two intertwined definitions of “force” and “inertial 
frame.” Not much physical content here. But the important point is that the law 
holds for all particles. So if we have a frame in which one free particle moves 
with constant velocity, then all free particles move with constant velocity. This 
is a statement with content. We can’t have a bunch of free particles moving with 
constant velocity while another one is doing a fancy jig. 


Second law 
Momentum is defined? to be mv. If m is constant,* then the second law says that 


F = ma, (3.1) 


where a = dv/dt. This law holds only in an inertial frame, which is defined by 
the first law. 


For things moving free or at rest, 
Observe what the first law does best. 
It defines a key frame, 

“Tnertial” by name, 


Where the second law then is expressed. 


You might think that the second law merely gives a definition of force, but 
there is more to it than that. There is a tacit implication in the law that this “force” 
is something that has an existence that isn’t completely dependent on the particle 
whose “m” appears in the law (more on this in the third law below). A spring 
force, for example, doesn’t depend at all on the particle on which it acts. And 
the gravitational force, GMm/r*, depends partly on the particle and partly on 
something else (another mass). 


2 It’s possible to modify things so that Newton’s laws hold in such a frame, provided that we introduce 
the so-called “fictitious” forces. But we'll save this discussion for Chapter 10. 

3 We’re doing everything nonrelativistically here, of course. Chapter 12 gives the relativistic 
modification to the mv expression. 

4 We'll assume in this chapter that m is constant. But don’t worry, we’ll get plenty of practice with 
changing mass (in rockets and such) in Chapter 5. 


3.1 Newton's laws 


If you feel like just making up definitions, then you can define a new quantity, 
G = ma. This is a perfectly legal thing to do; you can’t really go wrong 
in making a definition (well, unless you’ve already defined the quantity to be 
something else). However, this definition is completely useless. You can define 
it for every particle in the world, and for any acceleration, but the point is that 
the definitions don’t have anything to do with each other. There is simply no 
(uncontrived) quantity in this world that gives accelerations in the ratio of 4 to 
1 when “acting” on masses m and 2m. The quantity G has nothing to do with 
anything except the particle you defined it for. The main thing the second law 
says is that there does indeed exist a quantity F that gives the same ma when 
acting on different particles. The statement of the existence of such a thing is far 
more than a definition. 

Along this same line, note that the second law says that F = ma, and not, 
for example, F = mv, or F = m d*x/dt? . In addition to being inconsistent with 
the real world, these expressions are inconsistent with the first law. F = mv 
would say that a nonzero velocity requires a force, in contrast with the first law. 
And F = md?x/dt? would say that a particle moves with constant acceleration 
(instead of constant velocity) unless acted on by a force, also in contrast with the 
first law. 

As with the first law, it is important to realize that the second law holds for 
all particles. In other words, if the same force (for example, the same spring 
stretched by the same amount) acts on two particles with masses m and m2, then 
Eq. (3.1) says that their accelerations are related by 


el ma (3.2) 


a2 mM 


This relation holds regardless of what the common force is. Therefore, once 
we’ve used one force to find the relative masses of two objects, then we know 
what the ratio of their a’s will be when they are subjected to any other force. Of 
course, we haven’t really defined mass yet. But Eq. (3.2) gives an experimental 
method for determining an object’s mass in terms of a standard (say, 1 kg) mass. 
All we have to do is compare its acceleration with that of the standard mass, when 
acted on by the same force. 

Note that F = ma is a vector equation, so it is really three equations in one. 
In Cartesian coordinates, it says that Fy = max, Fy = may, and F; = maz. 


Third law 

One thing this law says is that if we have two isolated particles interacting through 
some force, then their accelerations are opposite in direction and inversely pro- 
portional to their masses. Equivalently, the third law essentially postulates that 
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the total momentum of an isolated system is conserved (that is, independent of 
time). To see this, consider two particles, each of which interacts only with the 
other particle and nothing else in the universe. Then we have 


dPtotal = dp 4 dp2 
dt dt dt 
=F; +F, (3.3) 


where F; and F; are the forces acting on mı and m2, respectively. This demon- 
strates that momentum conservation (that is, dpiotal/dt = 0) is equivalent to 
Newton’s third law (that is, Fı = —F2). Similar reasoning holds with more 
than two particles, but we’ll save this more general case, along with many other 
aspects of momentum, for Chapter 5. 

There isn’t much left to be defined via this law, so this is a law of pure content. 
It can’t be a definition, anyway, because it’s actually not always valid. It holds 
for forces of the “pushing” and “pulling” type, but it fails for the magnetic force, 
for example. In that case, momentum is carried off in the electromagnetic field 
(so the total momentum of the particles and the field is conserved). But we won’t 
deal with fields here. Just particles. So the third law will always hold in any 
situation we’ll be concerned with. 

The third law contains an extremely important piece of information. It says 
that we will never find a particle accelerating unless there’s some other parti- 
cle accelerating somewhere else. The other particle might be far away, as with 
the earth-sun system, but it’s always out there somewhere. Note that if we were 
given only the second law, then it would be perfectly possible for a given particle 
to spontaneously accelerate with nothing else happening in the universe, as long 
as a similar particle with twice the mass accelerated with half the acceleration 
when placed in the same spot, etc. This would all be fine, as far as the second law 
goes. We would say that a force with a certain value is acting at the point, and 
everything would be consistent. But the third law says that this is simply not the 
way the world (at least the one we live in) works. In a sense, a force without a 
counterpart seems somewhat like magic, whereas a force with an equal and oppo- 
site counterpart has a “cause and effect” nature, which seems (and apparently is) 
more physical. 

In the end, however, we shouldn’t attach too much significance to Newton’s 
laws, because although they were a remarkable intellectual achievement and 
work spectacularly for everyday physics, they are the laws of a theory that is only 
approximate. Newtonian physics is a limiting case of the more correct theories 
of relativity and quantum mechanics, which are in turn limiting cases of yet more 
correct theories. The way in which particles (or waves, or strings, or whatever) 
interact on the most fundamental level surely doesn’t bear any resemblance to 
what we call forces. 


3.2 Free-body diagrams 


3.2 Free-body diagrams 


The law that allows us to be quantitative is the second law. Given a force, we can 
apply F = ma to find the acceleration. And knowing the acceleration, we can 
determine the behavior of a given object (that is, the position and velocity), pro- 
vided that we are given the initial position and velocity. This process sometimes 
takes a bit of work, but there are two basic types of situations that commonly 
arise. 


e In many problems, all you are given is a physical situation (for example, a block 
resting on a plane, strings connecting masses, etc.), and it is up to you to find all the 
forces acting on all the objects, using F = ma. The forces generally point in various 
directions, so it’s easy to lose track of them. It therefore proves useful to isolate the 
objects and draw all the forces acting on each of them. This is the subject of the present 
section. 

e In other problems, you are given the force explicitly as a function of time, position, 
or velocity, and the task immediately becomes the mathematical one of solving the 
F = ma = mx equation (we’ll just deal with one dimension here). These differential 
equations can be difficult (or impossible) to solve exactly. They are the subject of 
Section 3.3. 


Let’s consider here the first of these two types of scenarios, where we are pre- 
sented with a physical situation and we must determine all the forces involved. 
The term free-body diagram is used to denote a diagram with all the forces 
drawn on a given object. After drawing such a diagram for each object in the 
setup, we simply write down all the F = ma equations they imply. The result 
will be a system of linear equations in various unknown forces and accelera- 
tions, for which we can then solve. This procedure is best understood through an 
example. 


Example (A plane and masses): Mass ™ is held on a plane with inclination 
angle 6, and mass Mz hangs over the side. The two masses are connected by a 
massless string which runs over a massless pulley (see Fig. 3.1). The coefficient of 
kinetic friction between M and the plane is u. M is released from rest. Assuming that 
Mh is sufficiently large so that M4 gets pulled up the plane, what is the acceleration 
of the masses? What is the tension in the string? 


Solution: The first thing to do is draw all the forces on the two masses. These are 
shown in Fig. 3.2. The forces on Mh are gravity and the tension. The forces on M4 are 
gravity, friction, the tension, and the normal force. Note that the friction force points 
down the plane, because we are assuming that Mı moves up the plane. 

Having drawn all the forces, we can now write down all the F = ma equations. 
When dealing with M1, we could break things up into horizontal and vertical compo- 
nents, but it is much cleaner to use the components parallel and perpendicular to the 
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plane." These two components of F = ma, along with the vertical F = ma equation 
for M, give 


T -f — Migsin = Ma, 
N — Mig cos0 = 0, (3.4) 
Mog — T = Mna, 


where we have used the fact that the two masses accelerate at the same rate (and 
we have defined the positive direction for Mz to be downward). We have also used 
the fact that the tension is the same at both ends of the string, because otherwise there 
would be a net force on some part of the string which would then undergo infinite 
acceleration, because it is massless. 

There are four unknowns in Eq. (3.4) (namely T, a, N, and f), but only three 
equations. Fortunately, we have a fourth equation: f = uN, because we are assuming 
that M4 is in fact moving, so we can use the expression for kinetic friction. Using this 
in the second equation above gives f = uM) g cos 0. The first equation then becomes 
T —pM,g cos 0 — Mı g sin 0 = M,a. Adding this to the third equation leaves us with 
only a, so we find 


g(M> — “uM, cosé — Mı sin 8) T M\M2g(1 + u cos + sind) 
g= = . 
Mı +M2 Mı +M 


(3.5) 


Note that in order for M4 to in fact accelerate upward (that is, a > 0), we must have 
M2 > Mı (u cos + sin 0). This is clear from looking at the forces along the plane. 


Remark: If we instead assume that Mj is sufficiently large so that it slides down the 
plane, then the friction force points up the plane, and we find (as you can check), 


= g(M + uM, cos 6 — M; sin 0) 
Mı + Mz 


= M\Mo2g(1 — wcosé + sin 8) 
Mı + M2 i 


, and T 


(3.6) 


In order for Mı to in fact accelerate downward (that is, a < 0), we must have M2 < 
Mı (sin — u cos 0). Therefore, the range of Mz for which the system doesn’t accelerate 
(that is, it just sits there, assuming that it started at rest) is 


M,(sin@ — wcos@) < Ma < M\(sin@ + wcosé). (3.7) 


If u is very small, then M2 must essentially be equal to Mı sin@ if the system is to 
be static. Equation (3.7) also implies that if tan < jy, then Mı won’t slide down, 
evenifM,=0. & 


In problems like the one above, it’s clear which things you should pick as the 
objects you’re going to draw forces on. But in other problems, where there are 


> When dealing with inclined planes, it’s usually the case that one of these two coordinate systems 
works much better than the other. Sometimes it isn’t clear which one, but if things get messy with 
one system, you can always try the other. 


3.2 Free-body diagrams 


various different subsystems you can choose, you must be careful to include all 
the relevant forces on a given subsystem. Which subsystems you want to pick 
depends on what quantities you’ re trying to find. Consider the following example. 


Example (Platform and pulley): A person stands on a platform-and-pulley sys- 
tem, as shown in Fig. 3.3. The masses of the platform, person, and pulley® are M, m, 
and pL, respectively.” The rope is massless. Let the person pull up on the rope so that 
she has acceleration a upward. (Assume that the platform is somehow constrained to 
stay level, perhaps by having the ends run along some rails.) Find the tension in the 
rope, the normal force between the person and the platform, and the tension in the 
rod connecting the pulley to the platform. 


Solution: To find the tension in the rope, we simply want to let our subsystem be 

the whole system (except the ceiling). If we imagine putting the system in a black 

box (to emphasize the fact that we don’t care about any internal forces within the 

system), then the forces we see “protruding” from the box are the three weights (Mg, Fig. 3.3 
mg, and ug) downward, and the tension T upward. Applying F = ma to the whole 

system gives 


T—-(M+m+wg=(M+m+pa => T=(M+m+pw(e+a). 
(3.8) 


To find the normal force N between the person and the platform, and also the tension 
f in the rod connecting the pulley to the platform, it is not sufficient to consider the 
system as a whole. This is true because these forces are internal forces to this system, 
so they won’t show up in any F = ma equations (which involve only external forces 
to a system). So we must consider subsystems: 


e Let’s apply F = mato the person. The forces acting on the person are gravity, the 
normal force from the platform, and the tension from the rope (pulling downward 
on her hand). So we have 


N —T — mg = ma. (3.9) 


e Nowapply F = mato the platform. The forces acting on the platform are gravity, 
the normal force from the person, and the force upward from the rod. So we have 


f —N—Meg = Ma. (3.10) 


e Now apply F = ma to the pulley. The forces acting on the pulley are gravity, the 
force downward from the rod, and twice the tension in the rope (because it pulls 


® Assume that the pulley’s mass is concentrated at its center, so that we don’t have to worry about 
any rotational dynamics (the subject of Chapter 8). 

7 My apologies for using u as a mass here, since it usually denotes a coefficient of friction. Alas, 
there are only so many symbols for “m.” 
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up on both sides). So we have 


2T —f — ug = pa. (3.11) 


Note that if we add up the three previous equations, we obtain the F = ma equation 
in Eq. (3.8), as should be the case, because the whole system is the sum of the three 
above subsystems. Equations (3.9)-(3.11) are three equations in the three unknowns, 
T, N, and f. Their sum yields the T in (3.8), and then Eqs. (3.9) and (3.11) give, 
respectively, as you can show, 


N=(M+2m+p)(gt+a), and f=Q2M+2m+p)(gt+a). (3.12) 


Remarks: You can also obtain these results by considering subsystems different from the 
ones we chose above. For example, you can choose the pulley-plus-platform subsystem, 
etc. But no matter how you choose to break up the system, you will need to produce three 
independent F = ma equations in order to solve for the three unknowns, T, N, and f. 

In problems like this one, it’s easy to forget to include certain forces, such as the second T 
in Eq. (3.11). The safest thing to do is to always isolate each subsystem, draw a box around 
it, and then draw all the forces that “protrude” from the box. In other words, draw the 
free-body diagram. Figure 3.4 shows the free-body diagram for the subsystem consisting 
of only the pulley. & 


Another class of problems, similar to the above example, goes by the name 
of Atwood machines. An Atwood’s machine is the name used for any system 
consisting of a combination of masses, strings, and pulleys.* In general, the 
pulleys and strings can have mass, but we’ll deal only with massless ones in 
this chapter. As we’ll see in the following example, there are two basic steps in 
solving an Atwood’s problem: (1) write down all the F = ma equations, and 
(2) relate the accelerations of the various masses by noting that the length of the 
string(s) doesn’t change, a fact that we call “conservation of string.” 


Example (Atwood’s machine): Consider the pulley system in Fig. 3.5, with 
masses mı and m2. The strings and pulleys are massless. What are the accelerations 
of the masses? What is the tension in the string? 


Solution: The first thing to note is that the tension T is the same everywhere 
throughout the massless string, because otherwise there would be an infinite accel- 
eration of some part of the string. It then follows that the tension in the short string 
connected to mz is 2T. This is true because there must be zero net force on the mass- 
less right pulley, because otherwise it would have infinite acceleration. The F = ma 


$ George Atwood (1746-1807) was a tutor at Cambridge University. He published the description 
of the first of his machines in Atwood (1784). For a history of Atwood’s machines, see Greenslade 
(1985). 


3.2 Free-body diagrams 


equations for the two masses are therefore (with upward taken to be positive) 


T—mg = m;a], 
(3.13) 
2T — mg = maz. 


We now have two equations in the three unknowns, a1, a2, and T. So we need one 
more equation. This is the “conservation of string” fact, which relates a, and ap. 
If we imagine moving mz and the right pulley up a distance d, then a length 2d 
of string has disappeared from the two parts of the string touching the right pulley. 
This string has to go somewhere, so it ends up in the part of the string touching m1 


(see Fig. 3.6). Therefore, mı goes down by a distance 2d. In other words, y} = 
—2y2, where yı and y2 are measured relative to the initial locations of the masses. 
Taking two time derivatives of this statement gives our desired relation between 


a, and ap, mi ' d id 
a, = —2ap. (3.14) an ! 
Combining this with Eq. (3.13), we can now solve for a1, a2, and T. The result is nee 
= 2m2 — 4m _ 2m, -m T= 3mı mg (3.15) Fig. 3.6 


=. > a? = O = . 
4m, + m2 4m, + m2 4m, + m2 


Remarks: There are all sorts of limits and special cases that we can check here. A couple 
are: (1) If mz = 2m, then Eq. (3.15) gives aj = az = 0, and T = mjg. Everything is 
at rest. (2) If mz >> mı, then Eq. (3.15) gives aj = 2g, a2 = —g, and T = 3myg. In 
this case, mp is essentially in free fall, while mı gets yanked up with acceleration 2g. The 
value of T is exactly what is needed to make the net force on mı equal to mı (2g), because 
T — mg = 3m,g — mg = m; (2g). You can check the case where m; >> mp. 

For the more general case where there are N masses instead of two, the “conservation 
of string” statement is a single equation that relates all N accelerations. It is most easily 
obtained by imagining moving N — 1 of the masses, each by an arbitrary amount, and then 
seeing what happens to the last mass. Note that these arbitrary motions undoubtedly do 
not correspond to the actual motions of the masses. This is fine; the single “conservation 
of string” equation has nothing to do with the N F = ma equations. The combination of 
all N + 1 equations is needed to constrain the motions down to a unique set. & 


In the problems and exercises for this chapter, you will encounter some strange 
Atwood’s setups. But no matter how complicated they get, there are only two 
things you need to do to solve them, as mentioned above: write down the F = ma 
equations for all the masses (which may involve relating the tensions in various 
strings), and then relate the accelerations of the masses, using “conservation of 
string.” 


It may seem, with the angst it can bring, 
That an Atwood’s machine’s a cruel thing. 
But you just need to say 

That F is ma, 

And use conservation of string! 
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3.3 Solving differential equations 


Let’s now consider the type of problem where we are given the force as a function 
of time, position, or velocity, and our task is to solve the F = ma = mx differ- 
ential equation to find the position, x(f), as a function of time.’ In what follows, 
we will develop a few techniques for solving differential equations. The ability 
to apply these techniques dramatically increases the number of systems we can 
understand. 

It’s also possible for the force F to be a function of higher derivatives of x, 
in addition to the quantities ¢, x, and v = x. But these cases don’t arise much, so 
we won’t worry about them. The F = ma differential equation we want to solve 
is therefore (we’ll just work in one dimension here) 


mx = F(t,x,v). (3.16) 


In general, this equation cannot be solved exactly for x(f).!° But for most of 
the problems we’ll deal with, it can be solved. The problems we’ll encounter 
will often fall into one of three special cases, namely, where F is a function of t 
only, or x only, or v only. In all of these cases, we must invoke the given initial 
conditions, x9 = x(fo) and vp = v(to), to obtain our final solutions. These initial 
conditions will appear in the limits of the integrals in the following discussion.!! 

Note: You may want to just skim the following page and a half, and then refer 
back as needed. Don’t try to memorize all the different steps. We present them 
only for completeness. The whole point here can basically be summarized by 
saying that sometimes you want to write x as dv/dt, and sometimes you want to 
write it as v du/dx (see Eq. (3.20)). Then you “simply” have to separate variables 
and integrate. We’ll go through the three special cases, and then we’ll do some 
examples. 


e F isa function of t only: F = F(t). 
Since a = d?x/dt*, we just need to integrate F = ma twice to obtain x(t). Let’s do this 
in a very systematic way, to get used to the general procedure. First, write F = ma as 


do pe 3.17 
mM : (3.17) 


° In some setups, such as in Problem 3.11, the force isn’t given, so you have to figure out what it is. 
But the main part of the problem is still solving the resulting differential equation. 

10 You can always solve for x(t) numerically, to any desired accuracy. This topic is discussed in 
Section 1.4. 

ll Tt is no coincidence that we need two initial conditions to completely specify the solution to 
our second-order (meaning the highest derivative of x that appears is the second one) F = mx 
differential equation. It is a general result (which we’ll just accept here) that the solution to an 
nth-order differential equation has n free parameters, which are determined by the initial conditions. 


3.3 Solving differential equations 


Then separate variables and integrate both sides to obtain !? 


v(t) t 
mf a= f F(t) dt’. (3.18) 
V t 


0 0 


We have put primes on the integration variables so that we don’t confuse them with the 
limits of integration, but in practice we usually don’t bother with them. The integral of 
dv’ is just v’, so Eq. (3.18) yields v as a function of t, that is, v(t). We can then separate 
variables in dx/dt = v(t) and integrate to obtain 


x(t) t 
f dx! = f u(t’) dt’. (3.19) 
XO to 


This yields x as a function of t, that is, x(t). This procedure might seem like a cumber- 
some way to simply integrate something twice. That’s because it is. But the technique 
proves more useful in the following case. 

e F is a function of x only: F = F(x). 
We will use 


=v (3.20) 


to write F = ma as 


dv 


muz = FQ). (3.21) 


Now separate variables and integrate both sides to obtain 


v(x) x 
mf v dv! = f F(x’) dx’. (3.22) 
VU XI 


0 X0 


The integral of v’ is v’ A /2, so the left-hand side involves the square of v(x). Taking 


the square root, this gives v as a function of x, that is, v(x). Separating variables in 


dx/dt = v(x) then yields 
x(t) dx! t 
f Z= f dt’. (3.23) 
x0 v ) to 


Assuming that we can do the integral on the left-hand side, this equation gives ¢ as a 
function of x. We can then (in principle) invert the result to obtain x as a function of t, 
that is, x(¢). The unfortunate thing about this case is that the integral in Eq. (3.23) might 
not be doable. And even if it is, it might not be possible to invert t(x) to produce x(t). 


2 iF you haven’t seen such a thing before, the act of multiplying both sides by the infinitesimal 
quantity dt might make you feel a bit uneasy. But it is in fact quite legal. If you wish, you can 
imagine working with the small (but not infinitesimal) quantities Av and At, for which it is 
certainly legal to multiply both sides by At. Then you can take a discrete sum over many At 
intervals, and then finally take the limit At —> 0, which results in the integral in Eq. (3.18). 
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e F is a function of v only: F = F(v). 
Write F = ma as 


d TN 3.24 
m= FO). (3.24) 


Separate variables and integrate both sides to obtain 


v(t) dv’ t 
mf — =f dt’. (3.25) 
vo F(v ) to 


Assuming that we can do this integral, it yields ¢ as a function of v, and hence (in 
principle) v as a function of f, that is, v(t). We can then integrate dx/dt = v(t) to obtain 
x(t) from 


x(t) t 
f dx = f v(t) dt’. (3.26) 
x ti 


0 0 


Note: In this F = F(v) case, if we want to find v as a function of x, v(x), then we 
should write a as v(dv/dx) and integrate 


v(x) ’ dy! x 
m f ve" = f dx. (8.27) 
vo Fv ) xO 


We can then obtain x(t) from Eq. (3.23), if desired. 


When dealing with the initial conditions, we have chosen to put them in the 
limits of integration above. If you wish, you can perform the integrals without 
any limits, and just tack on a constant of integration to your result. The constant 
is then determined by the initial conditions. 

Again, as mentioned above, you do not have to memorize the above three 
procedures, because there are variations, depending on what you’re given and 
what you want to solve for. All you have to remember is that x can be written 
as either dv/dt or v dv/dx. One of these will get the job done (namely, the one 
that makes only two of the three variables, ¢, x, and v, appear in your differential 
equation). And then be prepared to separate variables and integrate as many times 
as needed. !* 


ais dv by dt. 

Is this useful? There’s no guarantee. 

If it leads to “Oh, heck!” s, 

Take dv by dx, 

And then write down its product with v. 


'3 We want only two of the variables to appear in the differential equation because the goal is to 
separate variables and integrate, and because equations have only two sides. If equations were 
triangles, it would be a different story. 


3.3 Solving differential equations 


Example (Gravitational force): A particle of mass m is subject to a constant 
force F = —mg. The particle starts at rest at height A. Because this constant force 
falls into all of the above three categories, we should be able to solve for y(t) in two 


ways: 


(a) Find y(t) by writing a as dv/dt. 
(b) Find y(t) by writing a as udu/dy. 


Solution: 


(a) F = ma gives dv/dt = —g. Multiplying by dt and integrating yields v = 
—gt+ A, where A is a constant of integration. 14 The initial condition v(0) = 0 
gives A = 0. Therefore, dy/dt = —gt. Multiplying by dt and integrating yields 
y= -gt /2 + B. The initial condition y(0) = h gives B = h. Therefore, 


l 
y=h= sat (3.28) 


(b) F = ma gives vdv/dy = —g. Separating variables and integrating yields 
v*/2 = —gy + C. The initial condition v(h) = 0 gives v?/2 = —gy + gh. 
Therefore, v = dy/dt = —,/2g(h— y) . We have chosen the negative square 
root because the particle is falling. Separating variables gives 


a dt. (3.29) 
Fs] 

This yields 2,\/h — y = ./2g t, where we have used the initial condition y(0) = 
h. Hence, y = h — gt? /2, as in part (a). In part (b) here, we essentially derived 


conservation of energy, as we’ll see in Chapter 5. 


Example (Dropped ball): A beach ball is dropped from rest at height h. Assume 
that the drag force! from the air takes the form of Fa = —fv. Find the velocity and 
height as a function of time. 


Solution: For simplicity in future formulas, let’s write the drag force as Fg = 
—ßv = —mav (otherwise we’d have a bunch of 1/m’s floating around). Taking 
upward to be the positive y direction, the force on the ball is 


F = —mg — mav. (3.30) 


14 We’ll do this example by adding on constants of integration which are then determined by the 
initial conditions. We’ll do the following example by putting the initial conditions in the limits of 
integration. 

15 The drag force is roughly proportional to v as long as the speed is fairly small (say, less than 
10m/s). For large speeds (say, greater than 100 m/s), the drag force is roughly proportional to 
v. But these approximate cutoffs depend on various things, and in any event there is a messy 
transition region between the two cases. 
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Note that v is negative here, because the ball is falling, so the drag force points 
upward, as it should. Writing F = m dv/dt and separating variables gives 


v(t) dv’ t 
Í ——— = -f dt’. (3.31) 
0 gtav 0 
Integration yields In(1 + av/g) = —at. Exponentiation then gives 
E —at 
vi) =-=(l-e™). (3.32) 
a 


Writing dy/dt = v(t), and then separating variables and integrating to obtain y(t), 
yields 


y(t) t P 
f dy! = -Ef (1 ~e) a. (3.33) 
h a JO 


Therefore, 
1 
ror a (: = -(1 z a) (3.34) 
a a 


REMARKS: 


1. Let’s look at some limiting cases. If t is very small (more precisely, if æt « 1), then 
we can use e~* ~ 1 — x + x*/2 to make approximations to leading order in t. You 
can show that Eq. (3.32) gives u(t) ~ —gt. This makes sense, because the drag force 
is negligible at the start, so the ball is essentially in freefall. And likewise you can 
show that Eq. (3.34) gives y(t) ~ h — gt? /2, which is again the freefall result. 

We can also look at large ¢. In this case, e~* is essentially equal to zero, so 
Eq. (3.32) gives v(t) © —g/a. (This is the “terminal velocity.” Its value makes 
sense, because it is the velocity for which the total force, —mg — mav, vanishes.) 
And Eq. (3.34) gives y(t) © h — (g/a)t + g/a?. Interestingly, we see that for large 
t, g/a? is the distance our ball lags behind another ball that started out already at the 
terminal velocity, —g/a. 

2. You might think that the velocity in Eq. (3.32) doesn’t depend on m, since no m’s 
appear. However, there is an m hidden in œ. The quantity œ (which we introduced 
just to make our formulas look a little nicer) was defined by Fy = —Bv = —mav. 
But the quantity 6 = ma is roughly proportional to the cross-sectional area, A, of 
the ball. Therefore, œ x A/m. Two balls of the same size, one made of lead and one 
made of styrofoam, have the same A but different m’s. So their w’s are different, and 
they fall at different rates. 

If we have a solid ball with density p and radius r, then a « A/m x r?/(pr?) = 
1/pr. For large dense objects in a thin medium such as air, the quantity œ is small, 
so the drag effects are not very noticeable over short times (because if we include 
the next term in the expansion for v, we obtain u(t) ~ —gt + wgt?/2). Large dense 
objects therefore all fall at roughly the same rate, with an acceleration essentially 
equal to g. But if the air were much thicker, then all the a’s would be larger, and 
maybe it would have taken Galileo a bit longer to come to his conclusions. 


What would you have thought, Galileo, 
If instead you dropped cows and did say, “Oh! 


3.4 Projectile motion 


To lessen the sound 
Of the moos from the ground, 
They should fall not through air, but through mayo!”!© & 


3.4 Projectile motion 


Consider a ball thrown through the air, not necessarily vertically. We will neglect 
air resistance in the following discussion. Things get a bit more complicated 
when this is included, as Exercise 3.53 demonstrates. 

Let x and y be the horizontal and vertical positions, respectively. The force 
in the x direction is Fy = 0, and the force in the y direction is F, = —mg. So 
F = ma gives 


¥=0, and j=-g. (3.35) 


Note that these two equations are “decoupled.” That is, there is no mention 
of y in the equation for x, and vice versa. The motions in the x and y direc- 
tions are therefore completely independent. The classic demonstration of the 
independence of the x and y motions is the following. Fire a bullet horizon- 
tally (or, preferably, just imagine firing a bullet horizontally), and at the same 
time drop a bullet from the height of the gun. Which bullet will hit the ground 
first? (Neglect air resistance, the curvature of the earth, etc.) The answer is that 
they will hit the ground at the same time, because the effect of gravity on the 
two y motions is exactly the same, independent of what is going on in the x 
direction. 

If the initial position and velocity are (X, Y) and (Vx, Vy), then we can easily 
integrate Eq. (3.35) to obtain 


x(t) =Vy, and jit) = Vy gt. (3.36) 


Integrating again gives 
l 2 
xt) =X+Vyt, and yt) =Y¥+V,t— z2 ; (3.37) 


These equations for the speeds and positions are all you need to solve a projectile 
problem. 


'6 It’s actually much more likely that Galileo obtained his “all objects fall at the same rate in a 
vacuum” result by rolling balls down planes than by dropping balls from the Tower of Pisa; see 
Adler and Coulter (1978). So I suppose this limerick is relevant only in the approximation of the 
proverbial spherical cow. 
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Example (Throwing a ball): 


(a) For a given initial speed, at what inclination angle should a ball be thrown 
so that it travels the maximum horizontal distance by the time it returns to the 
ground? Assume that the ground is horizontal, and that the ball is released from 
ground level. 

(b) What is the optimal angle if the ground is sloped upward at an angle 8 (or 
downward, if 6 is negative)? 


Solution: 


(a) Let the inclination angle be 0, and let the initial speed be V. Then the horizontal 
speed is always Vy = V cos, and the initial vertical speed is V, = V sin 8. 
The first thing we need to do is find the time ¢ in the air. We know that the 
vertical speed is zero at time ¢/2, because the ball is moving horizontally at 
the highest point. So the second of Eqs. (3.36) gives Vy = g(t/2). Therefore, 
t = 2Vy/g. 17 The first of Eqs. (3.37) tells us that the horizontal distance 
traveled is d = Vyt. Using t = 2V,/g in this gives 


4 2VxVy V?(2sin0cos0) V*sin26 
g g g 


(3.38) 


The sin20 factor has a maximum at 0 = 7/4. The maximum horizontal 


distance traveled is then dmax = y2 /g. 


REMARK: For@ = 71/4, you can show that the maximum height achieved is V? /4g. 
This is half the maximum height of V? /2g (as you can show) if the ball is thrown 
straight up. Note that any possible distance you might want to find in this problem 
must be proportional to V*/g, by dimensional analysis. The only question is what 
the numerical factor is. & 


(b) As in part (a), the first thing we need to do is find the time ¢ in the air. If the 


ground is sloped at an angle £, then the equation for the line of the ground is 
y = (tan B)x. The path of the ball is given in terms of t by 


1 
x= (V cos0)t, and y= (V sin0)t— 520”, (3.39) 


where 0 is the angle of the throw, as measured with respect to the horizontal 
(not the ground). We must solve for the ¢ that makes y = (tan £)x, because this 
gives the time when the path of the ball intersects the line of the ground. Using 
Eq. (3.39), we find that y = (tan 6)x when 


2V 
t = — (sin 0 — tan £ cos 0). (3.40) 
g 


17 Alternatively, the time of flight can be found from the second of Eqs. (3.37), which says that the 
ball returns to the ground when Vt = gt? /2. We will have to use this second strategy in part (b), 
where the trajectory is not symmetric around the maximum. 


3.4 Projectile motion 


(There is, of course, also the solution t = 0.) Plugging this into the expression 
for x in Eq. (3.39) gives 


ay? 2 
x = ——(sin@ cos@ — tan f cos“@). (3.41) 
& 


We must now maximize this value for x, which is equivalent to maximizing 
the distance along the slope. Setting the derivative with respect to 0 equal to 
zero, and using the double-angle formulas, sin 20 = 2 sin @ cos 0 and cos 20 = 
cos — sin?6, we find tan 6 = —cot20. This can be rewritten as tan B = 
— tan(x /2 — 20). Therefore, 8 = — (7/2 — 20), so we have 


p= ; (6 if 3) . 8.42) 


In other words, the throwing angle should bisect the angle between the ground 
and the vertical. 


REMARKS: 


1. For B ~ 2/2, we have 0 ~ z/2, as should be the case. For 6 = 0, we have 
0 = 77/4, as we found in part (a). And for B ~ —z/2, we have 6 ~ 0, which makes 
sense. 

2. Aquicker method of obtaining the time in Eq. (3.40) is the following. Consider the set 
of tilted axes parallel and perpendicular to the ground; let these be the x’ and y’ axes, 
respectively. The initial velocity in the y’ direction is V sin(8— £), and the acceleration 
in this direction is g cos £. The time in the air is twice the time it takes the ball to reach 
the maximum “height” above the ground (measured in the y’ direction), which occurs 
when the velocity in the y’ direction is instantaneously zero. The total time is therefore 
2V sin(@ — B)/(g cos £), which you can show is equivalent to the time in Eq. (3.40). 
Note that the g sin £ acceleration in the x’ direction is irrelevant in calculating this 
time. In the present example, using these tilted axes doesn’t save a huge amount of 
time, but in some situations (see Exercise 3.50) the tilted axes can save you a lot 
of grief. 

3. An interesting fact about the motion of the ball in the maximum-distance case is that 
the initial and final velocities are perpendicular to each other. The demonstration of 
this is the task of Problem 3.16. 

4. Substituting the value of 0 from Eq. (3.42) into Eq. (3.41), you can show (after a bit 
of algebra) that the maximum distance traveled along the tilted ground is 


y2 
pae a (8.43) 
cos B 1 + sin £ 


Solving for V, we have V? = g(d + d sin B). This can be interpreted as saying that 
the minimum speed at which a ball must be thrown in order to pass over a wall of 
height A, at a distance L away on level ground, is given by V? = g(VL? + h2 + h). 
This checks in the limits of h > 0 and L — 0. 

5. A compilation of many other projectile results can be found in Buckmaster 
(1985). & 
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Fig. 3.7 


Using F=ma 


Along with the bullet example mentioned above, another classic example of 
the independence of the x and y motions is the “hunter and monkey” problem. 
In it, a hunter aims an arrow (a toy one, of course) at a monkey hanging from 
a branch in a tree. The monkey, thinking he’s being clever, tries to avoid the 
arrow by letting go of the branch right when he sees the arrow released. The 
unfortunate consequence of this action is that he in fact will get hit, because 
gravity acts on both him and the arrow in the same way; they both fall the same 
distance relative to where they would have been if there were no gravity. And 
the monkey would get hit in such a case, because the arrow is initially aimed at 
him. You can work this out in Exercise 3.44, in a more peaceful setting involving 
fruit. 


If a monkey lets go of a tree, 

The arrow will hit him, you see, 
Because both heights are pared 

By a half gr? 

From what they would be with no g. 


3.5 Motion in a plane, polar coordinates 


When dealing with problems where the motion lies in a plane, it is often conve- 
nient to work with polar coordinates, r and 0. These are related to the Cartesian 
coordinates by (see Fig. 3.7) 


x=rcosd, and y=rsinð. (3.44) 


Depending on the problem, either Cartesian or polar coordinates are easier to 
use. It is usually clear from the setup which is better. For example, if the problem 
involves circular motion, then polar coordinates are a good bet. But to use polar 
coordinates, we need to know what Newton’s second law looks like when written 
in terms of them. Therefore, the goal of the present section is to determine what 
F = ma = mř looks like when written in terms of polar coordinates. 

At a given position r in the plane, the basis vectors in polar coordinates are 
f, which is a unit vector pointing in the radial direction; and 6, which is a unit 
vector pointing in the counterclockwise tangential direction. In polar coordinates, 
a general vector may be written as 


r=rr. (3.45) 


Since the goal of this section is to find r, we must, in view of Eq. (3.45), get 
a handle on the time derivative of f. And we’ll eventually need the derivative 
of a, too. In contrast with the fixed Cartesian basis vectors (X and y), the polar 
basis vectors (f and 0) change as a point moves around in the plane. We can 


3.5 Motion in a plane, polar coordinates 


find Ë and å in the following way. In terms of the Cartesian basis, Fig. 3.8 
shows that 


f = cos Ê + sinb y, 


n (3.46) 
0 = — sin 0 È + cos 6 ĵ. 
Taking the time derivative of these equations gives 
È = — sin 0 6% + cos0 69, 
F ; ; (3.47) 
0 = — cos 6 6x — sind 6y. 
Using Eq. (3.46), we arrive at the nice clean expressions, 
+= 60, and 6=—6F. (3.48) 


These relations are fairly evident if you look at what happens to the basis vectors 
as r moves a tiny distance in the tangential direction. Note that the basis vectors 
do not change as r moves in the radial direction. We can now start differentiating 
Eq. (3.45). One derivative gives (yes, the product rule works fine here) 


i= ifrit =i? +760. (3.49) 
This makes sense, because + is the velocity in the radial direction, and rË is the 


velocity in the tangential direction, often written as ræ (where œ = Ê is the 
angular velocity, or “angular frequency”).'* Differentiating Eq. (3.49) then gives 


i —7#+74+766 +766 +700 
= 7# +7-(66) +766 + röð + r6(—6P) 


(i — ró? + (rb +. 276)6. (3.50) 


Finally, equating mf with F = F,f + Fo gives the radial and tangential 
forces as 
is 32 
F,.=m(r—ré), 


a . (3.51) 
Fo = m(r6 + 276). 


(See Exercise 3.67 for a slightly different derivation of these equations.) Let’s 
look at each of the four terms on the right-hand sides of Eqs. (3.51). 


'8 For r6 to be the tangential velocity, we must measure @ in radians and not degrees. Then r0 is by 
definition the position along the circumference, so rÊ is the velocity along the circumference. 
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Using F=ma 


e The mř term is quite intuitive. For radial motion, it simply states that F = ma along the 


radial direction. 


e The mré term is also quite intuitive. For circular motion, it states that F = ma along 


the tangential direction, because ré is the second derivative of the distance r8 along the 
circumference. 


e The —mr6” term is also fairly clear. For circular motion, it says that the radial force 


is —m(r6)? /r= —mv* /r, which is the familiar force that causes the centripetal accel- 
eration, v? /r. See Problem 3.20 for an alternate (and quicker) derivation of this v? /r 
result. 


e The 2mi6 term isn’t so obvious. It is associated with the Coriolis force. There are 


various ways to look at this term. One is that it exists in order to keep angular momentum 


conserved. We’ll have a great deal to say about the Coriolis force in Chapter 10. 


Example (Circular pendulum): A mass hangs from a massless string of length 
£. Conditions have been set up so that the mass swings around in a horizontal circle, 
with the string making a constant angle 6 with the vertical (see Fig. 3.9). What is the 
angular frequency, w, of this motion? 


Solution: The mass travels in a circle, so the horizontal radial force must be 

„ = mÊ? = mro? (with r = £sin £), directed radially inward. The forces on 
the mass are the tension in the string, T, and gravity, mg (see Fig. 3.10). There is no 
acceleration in the vertical direction, so F = ma in the vertical and radial directions 
gives, respectively, 


T cos B — mg = 0, 


(3.52) 
T sin B = m(£ sin B)w. 


oe 
o= ee (3.53) 


If B ~ 90°, then w — oo, which makes sense. And if £ ~ 0, then œ ~ /g/€, which 
happens to equal the frequency of a plane pendulum of length £. The task of Exercise 


Solving for w gives 


3.60 is to explain why. 


3.6 Problems 


Section 3.2: Free-body diagrams 


3.1. Atwood’s machine « 


A massless pulley hangs from a fixed support. A massless string con- 
necting two masses, mı and m2, hangs over the pulley (see Fig. 3.11). 
Find the acceleration of the masses and the tension in the string. 


3.2. 


3.3. 


3.4. 


SDs 


3.6. 


3.7. 


3.8. 


3.6 Problems 


Double Atwood’s machine xx 


A double Atwood’s machine is shown in Fig. 3.12, with masses mı, m2, 
and m3. Find the accelerations of the masses. 


Infinite Atwood’s machine ««« 


Consider the infinite Atwood’s machine shown in Fig. 3.13. A string 
passes over each pulley, with one end attached to a mass and the other 
end attached to another pulley. All the masses are equal to m, and all 
the pulleys and strings are massless. The masses are held fixed and then 
simultaneously released. What is the acceleration of the top mass? (You 
may define this infinite system as follows. Consider it to be made of 
N pulleys, with a nonzero mass replacing what would have been the 
(N + 1)th pulley. Then take the limit as N —> oo.) 


Line of pulleys x 


N +2 equal masses hang from a system of pulleys, as shown in Fig. 3.14. 
What are the accelerations of all the masses? 


Ring of pulleys «x 

Consider the system of pulleys shown in Fig. 3.15. The string (which 
is a loop with no ends) hangs over N fixed pulleys that circle around 
the underside of a ring. N masses, mı, m2, ... , my, are attached to 


N pulleys that hang on the string. What are the accelerations of all the 
masses? 


Sliding down a plane «x 


(a) A block starts at rest and slides down a frictionless plane inclined 
at an angle 0. What should 6 be so that the block travels a given 
horizontal distance in the minimum amount of time? 

(b) Same question, but now let there be a coefficient of kinetic friction 
u between the block and the plane. 


Sliding sideways on a plane xxx 


A block is placed on a plane inclined at an angle 0. The coefficient of 
friction between the block and the plane is u = tan 6. The block is given 
a kick so that it initially moves with speed V horizontally along the plane 
(that is, in the direction perpendicular to the direction pointing straight 
down the plane). What is the speed of the block after a very long time? 


Moving plane xxx 


A block of mass m is held motionless on a frictionless plane of mass 
M and angle of inclination 0 (see Fig. 3.16). The plane rests on a fric- 
tionless horizontal surface. The block is released. What is the horizontal 
acceleration of the plane? 
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Section 3.3: Solving differential equations 


3.9. 


3.10. 


3.11; 


3.12. 


3.13. 


Exponential force x 


A particle of mass m is subject to a force F(t) = mage~”’. The initial 
position and speed are zero. Find x(t). 


—kx force xx 


A particle of mass m is subject to a force F(x) = —kx, with k > 0. The 
initial position is xo, and the initial speed is zero. Find x(t). 


Falling chain «x 


A chain with length £ is held stretched out on a frictionless horizontal 
table, with a length yo hanging down through a hole in the table. The 
chain is released. As a function of time, find the length that hangs down 
through the hole (don’t bother with ¢ after the chain loses contact with 
the table). Also, find the speed of the chain right when it loses contact 
with the table.!? 


Throwing a beach ball xxx 


A beach ball is thrown upward with initial speed vp. Assume that the 
drag force from the air is Fg = —mav. What is the speed of the ball, 
vf, right before it hits the ground? (An implicit equation is sufficient.) 
Does the ball spend more time or less time in the air than it would if it 
were thrown in vacuum? 


Balancing a pencil x«« 


Consider a pencil that stands upright on its tip and then falls over. Let’s 
idealize the pencil as a mass m sitting at the end of a massless rod of 
length ¢.°° 


(a) Assume that the pencil makes an initial (small) angle 69 with the 
vertical, and that its initial angular speed is wo. The angle will 
eventually become large, but while it is small (so that sin@ ~ 0), 
what is 8 as a function of time? 

(b) You might think that it should be possible (theoretically, at least) 
to make the pencil balance for an arbitrarily long time, by mak- 
ing the initial 0) and wo sufficiently small. However, it turns out 
that due to Heisenberg’s uncertainty principle (which puts a con- 
straint on how well we can know the position and momentum of 


19 Assume that the hole is actually a short frictionless tube bent into a gradual right angle, so that 


the chain’s horizontal momentum doesn’t cause it to overshoot the hole. For a description of what 


happens in a similar problem when this constraint is removed, see Calkin (1989). 


20 Tt actually involves only a trivial modification to do the problem correctly using the moment of 


inertia and the torque. But the point-mass version is quite sufficient for the present purposes. 


3.6 Problems 


a particle), it is impossible to balance the pencil for more than a 
certain amount of time. The point is that you can’t be sure that 
the pencil is initially both at the top and at rest. The goal of this 
problem is to be quantitative about this. The time limit is sure to 
surprise you. 

Without getting into quantum mechanics, let’s just say that 
the uncertainty principle says (up to factors of order 1) that 
AxAp > h, where h = 1.05 - 107°4 Js is Planck’s constant. 
The implications of this are somewhat vague, but we’ ll just take 
it to mean that the initial conditions satisfy (€0))(mlao) > A. 
With this constraint, your task is to find the maximum time it 
can take your @(f) solution in part (a) to become of order 1. In 
other words, determine (roughly) the maximum time the pencil 
can balance. Assume m = 0.01 kg, and £ = 0.1 m. 


Section 3.4: Projectile motion 


3.14. 


3.15. 


3.16. 


3.17. 


Maximum trajectory area « 


A ball is thrown at speed v from zero height on level ground. At 
what angle should it be thrown so that the area under the trajectory 
is maximum? 


Bouncing ball x 


A ball is thrown straight upward so that it reaches a height A. It falls 
down and bounces repeatedly. After each bounce, it returns to a certain 
fraction f of its previous height. Find the total distance traveled, and 
also the total time, before it comes to rest. What is its average speed? 


Perpendicular velocities «+ 


In the maximum-distance case in part (b) of the example in Section 
3.4, show that the initial and final velocities are perpendicular to each 
other.”! 


Throwing a ball from a cliff +» 


A ball is thrown with speed v from the edge of a cliff of height h. At 
what inclination angle should it be thrown so that it travels the maximum 
horizontal distance? What is this maximum distance? Assume that the 
ground below the cliff is horizontal. 


21 You can grind through this problem and explicitly find the final angle, but there’s a quicker way. 


This quicker method makes use of the conservation-of-energy statement that the difference in the 


squares of the initial and final speeds depends only on the change in height (the relation happens 


to be v? = ve = 2gh, but you don’t need this actual expression). Hint: Consider the reverse path. 
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3.18. 


3.19. 


Redirected motion «« 


A ball is dropped from rest at height A above level ground, and it bounces 
off a surface at height y (with no loss in speed). The surface is inclined 
so that the ball bounces off at an angle 0 with respect to the horizontal. 
What should y and 6 be so that the ball travels the maximum horizontal 
distance by the time it hits the ground? 


Maximum trajectory length x«« 


A ball is thrown at speed v from zero height on level ground. Let 09 be 
the angle at which the ball should be thrown so that the length of the 
trajectory is maximum. Show that 69 satisfies 


1+ sing 
sin 6o In (=55) =1. (3.54) 


You can show numerically that 6) ~ 56.5°. 


Section 3.5: Motion in a plane, polar coordinates 


3.20. 


S21 


3.22. 


Centripetal acceleration x 


Show that the magnitude of the acceleration of a particle moving in a cir- 
cle at constant speed is v? /r. Do this by drawing the position and velocity 
vectors at two nearby times, and then making use of some similar 
triangles. 


Vertical acceleration +» 


A bead rests at the top of a fixed frictionless hoop of radius R that lies in 
a vertical plane. The bead is given a tiny push so that it slides down and 
around the hoop. At what points on the hoop is the bead’s acceleration 
vertical??? What is this vertical acceleration? Note: We haven’t studied 
conservation of energy yet, but use the fact that the bead’s speed after it 
has fallen a height / is given by v = /2gh. 


Circling around a pole +x 


A mass, which is free to move on a horizontal frictionless surface, is 
attached to one end of a massless string that wraps partially around a 
frictionless vertical pole of radius r (see the top view in Fig. 3.17). You 
hold on to the other end of the string. At t = 0, the mass has speed vp in 
the tangential direction along the dotted circle of radius R shown. Your 
task is to pull on the string so that the mass keeps moving along the 


22 One such point is the bottom of the hoop. Another point is technically the top, where a ~ 0. Find 


the other two more interesting points (one on each side). 


3.23. 


3.24. 


3.7 Exercises 


dotted circle. You are required to do this in such a way that the string 
remains in contact with the pole at all times. (You will have to move 
your hand around the pole, of course.) What is the speed of the mass as 
a function of time? There is a special value of the time; what is it and 
why is it special? 


A force Fy = mre xx 


Consider a particle that feels an angular force only, of the form Fg = 
mr6. Show that 7 = /Alnr +B, where A and B are constants of 
integration, determined by the initial conditions. (There’s nothing all 
that physical about this force. It simply makes the F = ma equations 
solvable.) 


Free particle xxx 


Consider a free particle in a plane. With Cartesian coordinates, it is easy 
to use F = ma to show that the particle moves in a straight line. The 
task of this problem is to demonstrate this result in a much more cum- 
bersome way, using polar coordinates and Eq. (3.51). More precisely, 
show that cos@ = ro/r for a free particle, where ro is the radius at 
closest approach to the origin, and 0 is measured with respect to this 
radius. 


3.7 Exercises 


Section 3.2: Free-body diagrams 


3:235. 


3.26. 


3.27. 


A peculiar Atwood’s machine 

(a) The Atwood’s machine in Fig. 3.18 consists of n masses, m, m/2, 
m/4,...,m/2"—'. All the pulleys and strings are massless. Put a 
mass m/2”—! at the free end of the bottom string. What are the 
accelerations of all the masses? 

(b) Remove the mass m/2”—! (which was arbitrarily small, for very 
large n) that was attached in part (a). What are the accelerations 
of all the masses, now that you’ve removed this infinitesimal 
piece? 

Keeping the mass still x 

In the Atwood’s machine in Fig. 3.19, what should M be, in terms of 
mı and mz, so that it doesn’t move? 

Atwood’s 1 « 


Consider the Atwood’s machine in Fig. 3.20. It consists of three pulleys, 
a short piece of string connecting one mass to the bottom pulley, and 
a continuous long piece of string that wraps twice around the bottom 


75 


m/2 
m/4 ` 
m/2™! Q ? 
Fig. 3.18 
M 
m my 
Fig. 3.19 
m 
2m 


Fig. 3.20 


76 

m 

m 

Fig. 3.21 

m 3m 

2m 
Fig. 3.22 
m 
m 

Fig. 3.23 

& 
Fig. 3.24 


Using F=ma 


3.29. 


3.30. 


3.34. 


side of the bottom pulley, and once around the top side of the top two 
pulleys. The two masses are m and 2m. Assume that the parts of the string 
connecting the pulleys are essentially vertical. Find the accelerations of 
the masses. 


. Atwood’s2 x 


Consider the Atwood’s machine in Fig. 3.21, with two masses m. The 
axle of the bottom pulley has two string ends attached to it, as shown. 
Find the accelerations of the masses. 


Atwood’s3 x 
Consider the Atwood’s machine in Fig. 3.22, with masses m, 2m, and 
3m. Find the accelerations of the masses. 


Atwood’s 4 xx 


Consider the Atwood’s machine in Fig. 3.23 (and also on the front cover). 
If the number of pulleys that have string passing beneath them is M 
instead of the 3 shown, find the accelerations of the masses. 


. Atwood’s 5 xx 


Consider the Atwood’s machine in Fig. 3.24. The two shaded pulleys 
have mass m, and the string slides frictionlessly along all of the pulleys 
(so you don’t have to worry about any rotational motion). Find the 
accelerations of the two shaded pulleys. 


. Atwood’s 6 +x 


Consider the Atwood’s machine in Fig. 3.25. Find the accelerations of 
the masses. (This is a strange one.) 


. Accelerating plane +x 


A block of mass m rests on a plane inclined at an angle 0. The coefficient 
of static friction between the block and the plane is u. The plane is 
accelerated to the right with acceleration a (which may be negative); 
see Fig. 3.26. For what range of a does the block remain at rest with 
respect to the plane? In terms of u, there are two special values of 6; 
what are they, and why are they special? 


Accelerating cylinders «« 


Three identical cylinders are arranged in a triangle as shown in Fig. 3.27, 
with the bottom two lying on the ground. The ground and the cylinders 
are frictionless. You apply a constant horizontal force (directed to the 
right) on the left cylinder. Let a be the acceleration you give to the 
system. For what range of a will all three cylinders remain in contact 
with each other? 


3.7 Exercises 


3.35. Leaving the sphere «x 


A small mass rests on top of a fixed sphere of radius R. The coefficient 
of friction is u. The mass is given a sideways kick that produces an 
initial angular speed wo. Let 0 be the angle down from the top of the 
sphere. In terms of 0 and its derivatives, what is the tangential F = ma 
equation? Depending on the value of wo, the mass either comes to rest 
on the sphere or flies off it. If g = 10m/s*, R = 1m, and u = 1, 
write a program to numerically determine the minimum wọ for which 
the mass leaves the sphere. For this cutoff case, give the angle at which 
the mass loses contact, and describe (roughly) what the plot of 6 versus 
6 looks like. See Prior and Mele (2007) for the exact solution for Ô in 
terms of 0. 


Fig. 3.25 


3.36. Comparing the times ««« 


A block of mass m is projected up along the surface of a plane inclined 
at an angle 0. The initial speed is vo, and the coefficients of static and 
kinetic friction are both equal to u. The block reaches a highest point 
and then slides back down to the starting point. 


(a) Show that for the block to in fact slide back down instead of 
remaining at rest at the highest point, tan @ must be greater than u. 

(b) Assuming that tan@ > u, is the total up and down time longer 
or shorter than the total time it would take if the plane were 
frictionless? Or does the answer depend on what 6 and p are? 

(c) Assuming that tan 0 > u, show that for a given 0, the value of u 
that yields the minimum total time is given by u © (0.397) tan 0. 
(You will need to solve something numerically.) This minimum 
time turns out to be about 90% of the time it would take if the 
plane were frictionless. 


Fig. 3.26 


Section 3.3: Solving differential equations 


3.37. —bv? force « 


A particle of mass m is subject to a force F(v) = —bv?. The initial F 


position is zero, and the initial speed is vp. Find x(f). = 
3.38. kx force xx 
A particle of mass m is subject to a force F(x) = kx, with k > 0. The Fig. 3.27 


initial position is xo, and the initial speed is zero. Find x(t). 
Section 3.4: Projectile motion 


3.39. Equal distances x 


At what angle should a ball be thrown so that its maximum height equals 
the horizontal distance traveled? 
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3.40. 


3.41. 


3.43. 


3.44. 


Redirected motion « 


A ball is dropped from rest at height h. At height y, it bounces off a 
surface with no loss in speed. The surface is inclined at 45°, so the ball 
bounces off horizontally. What should y be so that the ball travels the 
maximum horizontal distance? What is this maximum distance? 


Throwing in the wind + 


A ball is thrown horizontally to the right, from the top of a vertical 
cliff of height A. A wind blows horizontally to the left, and assume 
(simplistically) that the effect of the wind is to provide a constant force 
to the left, equal in magnitude to the weight of the ball. How fast should 
the ball be thrown so that it lands at the foot of the cliff? 


. Throwing in the wind again « 


A ball is thrown eastward across level ground. A wind blows horizontally 
to the east, and assume (simplistically) that the effect of the wind is to 
provide a constant force to the east, equal in magnitude to the weight of 
the ball. At what angle 6 should the ball be thrown so that it travels the 
maximum horizontal distance? 


Increasing gravity « 


At t = 0 on the planet Gravitus Increasicus, a projectile is fired with 
speed vo at an angle 0 above the horizontal. This planet is a strange 
one, in that the acceleration due to gravity increases linearly with time, 
starting with a value of zero when the projectile is fired. In other words, 
g(t) = Bt, where £ is a given constant. What horizontal distance does 
the projectile travel? What should 6 be to maximize this distance? 


Newton’s apple « 


Newton is tired of apples falling on his head, so he decides to throw a 
rock at one of the larger and more formidable-looking apples positioned 
directly above his favorite sitting spot. Forgetting all about his work on 
gravitation, he aims the rock directly at the apple (see Fig. 3.28). To his 
surprise, the apple falls from the tree just as he releases the rock. Show, 
by calculating the rock’s height when it reaches the horizontal position 
of the apple, that the rock hits the apple.” 


. Colliding projectiles + 


Two balls are fired from ground level, a distance d apart. The right one is 
fired vertically with speed v (see Fig. 3.29). You wish to simultaneously 


23 This problem suggests a way in which William Tell and his son might survive their ordeal if they 


were plopped down, with no time to practice, on a planet with an unknown gravitational constant 


(provided that the son weren’t too short or that g weren’t too big). 


3.46. 


3.47. 


3.48. 


3.49. 


3.50. 


3.7 Exercises 


fire the left one at the appropriate velocity u so that it collides with the 
right ball when they reach their highest point. What should u be (give 
the horizontal and vertical components)? Given d, what should v be so 
that the speed u is minimum? 


Equal tilts « 


A plane tilts down at an angle 0 below the horizontal. On this plane, a 
projectile is fired with speed v at an angle 0 above the horizontal, as 
shown in Fig. 3.30. What is the distance, d, along the plane that the 
projectile travels? What is d in the limit © —> 90°? What 0 yields the 
maximum horizontal distance? 


Throwing at a wall « 


You throw a ball with speed vo at a vertical wall, a distance £ away. At 
what angle should you throw the ball so that it hits the wall as high as 
possible? Assume £ < vg /g (why?). 


Firing a cannon +» 


A cannon, when aimed vertically, is observed to fire a ball to a maximum 
height of L. Another ball is then fired with this same speed, but with the 
cannon aimed up along a plane of length L, inclined at an angle 0, as 
shown in Fig. 3.31. What should @ be so that the ball travels the largest 
horizontal distance, d, by the time it returns to the height of the top of 
the plane? 


Perpendicular and horizontal x» 


A plane is inclined at an angle 0 below the horizontal. A person throws 
a ball with speed vo from the surface of the plane. How far down along 
the plane does the ball hit, if the person throws the ball (a) perpendicular 
to the plane? (b) horizontally? 


Cart, ball, and plane +x 


A cart is held at rest on an inclined plane. A tube is positioned in the 
cart with its axis perpendicular to the plane. The cart is released, and 
at some later time a ball is fired from the tube. Will the ball even- 
tually land back in the tube? Hint: Choose your coordinate system 
wisely. 


. Perpendicular to plane xx 


A hill is sloped downward at an angle 8 with respect to the horizontal. 
A projectile is fired with an initial velocity perpendicular to the hill. 
When it eventually lands on the hill, let its velocity make an angle 0 
with respect to the horizontal. What is 0? What £ yields the minimum 
value of 0? What is this minimum 0? 


Fig. 3.30 
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3:32: 


3.53. 


Increasing distance x» 


(a) What is the maximum angle at which you can thrown a ball so 
that its distance from you never decreases during its flight? 

(b) This maximum angle equals the minimum 0 from Exercise 3.51. 
Explain why this is true. (It isn’t necessary to have done that 
exercise.) 


Projectile with drag +++ 


A ball is thrown with speed vo at an angle 6. Let the drag force from the 
air take the form Fg = —Bv = —mav. 

(a) Find x(t) and y(t). 

(b) Assume that the drag coefficient takes the value that makes the 
magnitude of the initial drag force equal to the weight of the 
ball. If your goal is to have x be as large as possible when 
y achieves its maximum value (you don’t care what this max- 
imum value actually is), show that 0 should satisfy sind = 
(/5 — 1)/2, which just happens to be the inverse of the golden 
ratio. 


Section 3.5: Motion in a plane, polar coordinates 


3.54. 


3.56. 


Fig. 3.32 


Low-orbit satellite 


What is the speed of a satellite whose orbit is just above the earth’s 
surface? Give the numerical value. 


. Weight at the equator x 


A person stands on a scale at the equator. If the earth somehow stopped 
spinning but kept its same shape, would the reading on the scale increase 
or decrease? By what fraction? 


Banking an airplane « 


An airplane flies at speed v in a horizontal circle of radius R. At what 
angle should the plane be banked so that you don’t feel like you are 
getting flung to the side in your seat? At this angle, what is your apparent 
weight (that is, what is the normal force from the seat)? 


. Rotating hoop « 


A bead lies on a frictionless hoop of radius R that rotates around a vertical 
diameter with constant angular frequency w, as shown in Fig. 3.32. What 
should w be so that the bead maintains the same position on the hoop, 
at an angle 0 with respect to the vertical? There is a special value of w; 
what is it, and why is it special? 


3.7 Exercises 81 


3.58. Swinging in circles « 


A large number of masses are attached by strings of various lengths to a 
point on the ceiling. All of the masses swing around in horizontal circles 
of various radii with the same frequency œw (one such circle is drawn 
in Fig. 3.33). If you take a picture (from the side) of the setup at an 
instant when all the masses lie in the plane of the paper (as shown for 
four masses), what does the “curve” formed by the masses look like? 


3.59. Swinging triangle « 


Two masses m are attached to two vertices of an equilateral triangle 
made of three massless rods of length £. A pivot is located at the third 


vertex, and the triangle is free to swing back and forth in a vertical plane, F'9. 3-33 
as shown in Fig. 3.34. If it is initially released from rest when one of the 
rods is vertical (as shown), find the tensions in all three rods (and specify pivot 
tension or compression), and also the accelerations of the masses, at the I 
instant right after it is released. 
3.60. Circular and plane pendulums « e] l m 


Consider the circular pendulum in the example in Section 3.5. Let the 

x-y plane be the horizontal plane of the circle. For small 6, what (approx- i 
imately) is the F, component of the force on the mass when it is at the ~ 
position (x,y) on the circle? 

Consider now a standard plane pendulum where the mass swings back 
and forth in the vertical plane containing the x axis. Let the maximum 
angle of the pendulum be the same small angle 6.7+ What (approxi- 
mately) is the Fy component of the force on the mass in terms of its x 
coordinate? 

Your two results should be the same, which means that the x motions 
of the two systems are the same, because when they are each at their 
maximum x value (£ sin £), they have the same x speed (zero), and we 
just showed that they always have the same x acceleration, independent 
of any y motion. So the frequencies of the two systems must be equal. 
(We’ll see in Section 4.2 that the frequency ofa plane pendulum is ./¢/€, 
in agreement with this observation.) 


3.61. Rolling wheel x 


If you paint a dot on the rim of a rolling wheel, the coordinates of the 
dot may be written as? 


(x,y) = (RO + Rsin 8, R + Rcos6). (3.55) 


24 It’s actually not required that this be the same angle, as long as it’s small. See Problem 3.10. 

25 This follows from writing (x,y) as (RO, R) + (R sin 0, R cos 0). The first term here is the position 
of the center of the wheel, and the second term is the position of the dot relative to the center, 
where @ is measured clockwise from the top. 


Fig. 3.35 


side point 


Fig. 3.36 
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The path of the dot is called a cycloid. Assume that the wheel is rolling 
at constant speed, which implies 0 = wt. 


(a) Find v(t) and a(t) of the dot. 

(b) At the instant the dot is at the top of the wheel, what is the radius 
of curvature of its path? The radius of curvature is defined to be 
the radius of the circle that matches up with the path locally at a 
given point. Hint: You know v and a. 


. Radius of curvature «« 


A projectile is fired at speed vg and angle 6. What is the radius of 
curvature (defined in Exercise 3.61) of the parabolic motion 


(a) at the top? 

(b) at the beginning? 

(c) At what angle should the projectile be fired so that the radius of 
curvature at the top equals half the maximum height, as shown in 
Fig. 3.35? 


. Driving on tilted ground ++ 


A driver encounters a large tilted parking lot, where the angle of the 
ground with respect to the horizontal is 9. The driver wishes to drive in 
acircle of radius R at constant speed. The coefficient of friction between 
the tires and the ground is u. 


(a) What is the largest speed the driver can have if he wants to avoid 
slipping? 

(b) What is the largest speed the driver can have, assuming he is 
concerned only with whether or not he slips at one of the “side” 
points on the circle (that is, halfway between the top and bottom 
points; see Fig. 3.36)? 


Car on a banked track «« 


A car travels around a circular banked track of radius R. The 
angle of the bank is 0, and the coefficient of friction between the 
tires and the track is u. For what range of speeds does the car 
not slip? 


. Horizontal acceleration +» 


A bead rests at the top of a fixed frictionless hoop of radius R that lies in 
a vertical plane. The bead is given a tiny push so that it slides down and 
around the hoop. At what points on the hoop is the bead’s acceleration 
horizontal? Note: We haven’t studied conservation of energy yet, but 
use the fact that the bead’s speed after it has fallen a height h is given 


by v = /2gh. 


3.66. 


3.67. 


3.68. 


3.69. 


3.70. 


3.7 Exercises 


Maximum horizontal force «« 


A bead rests at the top of a fixed frictionless hoop of radius R that 
lies in a vertical plane. The bead is given a tiny push so that it slides 
down and around the hoop. Consider the horizontal component of the 
force from the hoop on the bead. At what points on the hoop does this 
component achieve a local maximum or minimum? As in Exercise 3.65, 


use v = ,/2gh. 


Derivation of F, and Fg « 


In Cartesian coordinates, a general vector takes the form, 
r=xx+yy =r cosb  +rsinðĵ. (3.56) 


Derive Eq. (3.51) by taking two derivatives of this expression for r, and 
then using Eq. (3.46) to show that the result can be written in the form 
of Eq. (3.50). Note that unlike f and 6, the vectors x and y do not change 
with time. 


A force Fọ = 3mrO xx 


Consider a particle that feels an angular force only, of the form Fg = 
3mi'6. Show that > = +V/Ar4 +B, where A and B are constants of 
integration, determined by the initial conditions. Also, show that if the 
particle starts with Ô 4 0 andi > 0, it reaches r = œo ina finite time. 
(As in Problem 3.23, there’s nothing all that physical about this force. 
It simply makes the F = ma equations solvable.) 


A force Fg = 2mið ss 


Consider a particle that feels an angular force only, of the form Fọ = 
2mr6. Show that r = Ae? + Be~’, where A and B are constants of 
integration, determined by the initial conditions. (This force is actually 
a physical one. If you put a bead on a stick and swing the stick around 
one end at a constant rate, then the normal force from the stick happens 
to be 2mr6.?°) 


Stopping on a cone x» 


When viewed from the side, the cone in Fig. 3.37 subtends an angle 20 
at its tip. A block of mass m is connected to the tip by a massless string 
and moves in a horizontal circle of radius R around the surface. If the 


26 Depending on what is meant by “physical,” the forces in Exercise 3.68 and Problem 3.23 might 


also be considered to be physical. They correspond to putting a bead on a stick and swinging the 


stick around with angular speeds proportional to the bead’s r or 1/r, respectively (as is evident 
from the values of 6 in the solutions). This can also be deduced from t = dL/dt, but we won’t 


get to torque until Chapter 8. 
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initial speed is vo, and if the coefficient of kinetic friction between the 
block and the cone is u, how much time does it take the block to stop? 
(The answer is a little messy, but there are some limits you can check 
that will make you feel better about it.) 


Motorcycle circle x«* 


A motorcyclist wishes to travel in a circle of radius R on level ground. 
The coefficient of friction between the tires and the ground is u. The 
motorcycle starts at rest. What is the minimum distance it must travel in 
order to achieve its maximum allowable speed, that is, the speed above 
which it will skid out of the circular path???” Solve this in two ways: 


(a) Write down the radial and tangential F = ma equations (you'll 
want to write a as v du/dx), and then demand that the magnitude 
of the friction force equals umg in the optimal case. Take it from 
there. 

(b) Let the friction force make an angle 6(t) with respect to the tan- 
gential direction. Write down the radial and tangential F = ma 
equations (you’ll want to write a as dv/dt), and then take the 
derivative of the radial equation. Take it from there (this is the 
slick way). 


3.8 Solutions 


3.1. 


3.2, 


Atwood’s machine 


Let T be the tension in the string, and let a be the acceleration of mı (with upward 
taken to be positive). Then —a is the acceleration of mz. So the F = ma equations are 


T—mg=ma, and T—mg=mp2(—a). (3.57) 
Solving these two equations for a and T gives 


— 2 
a= a= m)g and fet. (3.58) 
m + mı m2 + my 


Remarks: As a double-check, a has the correct limits when m2 >> mı, mj > m, 
and m) = m; (namely a © g, a © —g, and a = 0, respectively). As far as T goes, if 
mı = m = m, then T = mg, as it should. And if m; < mz, then T © 2myg. This is 
correct, because it makes the net upward force on m; equal to m;g, which means that 
its acceleration is g upward, which is consistent with the fact that mz is essentially in 
free fall. & 


Double Atwood’s machine 


Let the tension in the lower string be T. Then the tension in the upper string is 2T (by 
balancing the forces on the bottom pulley). The three F = ma equations are therefore 
(with all the a’s taken to be positive upward) 


2T —mg=ma,, T — mg = ma, T- mg = m3a3. (3.59) 


27 This problem can be traced to an old edition of the Russian magazine Kvant. 


3.3. 


3.8 Solutions 


And conservation of string says that the acceleration of m is 


a --(5*). (3.60) 


This follows from the fact that the average position of m2 and m3 moves the same 
distance as the bottom pulley, which in turn moves the same distance (but in the 
opposite direction) as mı. We now have four equations in the four unknowns, a1, a2, 
a3, and T. With a little work, we can solve for the accelerations, 


4mzm3 — mı (m2 + m3) 


4 =8 > 
4mzm3 + mı (m2 + m3) 
4mm + 23 
nes mm3 + mı (m2 m3) (3.61) 
4mzm3 + mı (m2 + m3) 
4m2m3 + mı (m3 — 3m2) 
a3 = —g i 
4mım3 + mı (m2 + m3) 


REMARKS: There are many limits we can check here. A couple are: (1) If m2 = m3 = 
mı /2, then all the a’s are zero, which is correct. (2) If m3 is much less than both mı and 
mz, then aj = —g, a2 = —g, and a3 = 3g. To understand this 3g, convince yourself 
that if mı and mz go down by d, then m3 goes up by 3d. 

Note that a; can be written as 


4 4 
a =g mom mı / vema Hml. (3.62) 
m + m3 m + m3 


In view of the result for a in Eq. (3.58) in Problem 3.1, we see that as far as mı is 
concerned here, the m2, m3 pulley system acts just like a mass of 4m2m3/(m2 + m3). 
This has the expected properties of equaling zero when either m2 or m3 is zero, and 
equaling 2m if m) = m3 = m. & 


Infinite Atwood’s machine 


FIRST SOLUTION: If the strength of gravity on the earth were multiplied by a factor 
n, then the tension in all of the strings in the Atwood’s machine would likewise be 
multiplied by 7. This is true because the only way to produce a quantity with the units 
of tension (that is, force) is to multiply a mass by g. Conversely, if we put the Atwood’s 
machine on another planet and discover that all of the tensions are multiplied by n, 
then we know that the gravity there must be ng. 

Let the tension in the string above the first pulley be 7. Then the tension in the string 
above the second pulley is 7/2 (because the pulley is massless). Let the downward 
acceleration of the second pulley be a2. Then the second pulley effectively lives in a 
world where gravity has strength g — az. Consider the subsystem of all the pulleys 
except the top one. This infinite subsystem is identical to the original infinite system 
of all the pulleys. Therefore, by the arguments in the above paragraph, we must have 
4 = T/2 : (3.63) 

€ &-an 
which gives a2 = g/2. But az is also the acceleration of the top mass, so our answer 
is g/2. 


REMARKS: You can show that the relative acceleration of the second and third pulleys 
is g/4, and that of the third and fourth is g/8, etc. The acceleration of a mass far down 
in the system therefore equals g(1/2 + 1/4 + 1/8+.---) = g, which makes intuitive 
sense. 

Note that T = 0 also makes Eq. (3.63) true. But this corresponds to putting a mass 
of zero at the end of a finite pulley system (see the following solution). & 
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SECOND SOLUTION: Consider the following auxiliary problem. 


Problem: Two setups are shown in Fig. 3.38. The first contains a hanging mass m. 
The second contains two masses, mı and mz, hanging over a pulley. Let both supports 
have acceleration as downward. What should m be, in terms of mı and mz, so that the 
tension T in the top string is the same in both cases? 


Answer: In the first case, we have 
mg — T = mas. (3.64) 


In the second case, let a be the acceleration of mz relative to the support (with downward 
taken to be positive). Then we have 


T T 
mg-5 5 mı(as— a), and mg— in mz (as + a). (3.65) 
If we define g’ = g — ag, then we may write the above three equations as 
f t T / T 
mg =T, mg — z7 —mja, mg — J ma. (3.66) 


Eliminating a from the last two of these equations gives T = 4mım2g'/(mı + m2). 
Using this value of T in the first equation then gives 


4mm 
m= ——. (3.67) 
mı + m2 
Note that the value of as is irrelevant. We effectively have a fixed support in a world 
where the acceleration due to gravity is g’ (see Eq. (3.66)), and the desired m can’t 
depend on g’, by dimensional analysis. This auxiliary problem shows that for any as 
the two-mass system in the second case can equivalently be treated like a mass m, 
given by Eq. (3.67), as far as the upper string is concerned. W 


Now let’s look at our infinite Atwood’s machine. Assume that the system has N pulleys, 
where N — oo. Let the bottom mass be x. Then the auxiliary problem shows that the 
bottom two masses, m and x, can be treated like an effective mass f (x), where 
4mx 4x 

m+x 1+@&/m) ` 

We can then treat the combination of the mass f (x) and the next m as an effective 
mass f (f(x)). These iterations can be repeated, until we finally have a mass m and a 
mass fOD (x) hanging over the top pulley. So we must determine the behavior of 


FQ) = 


(3.68) 


f(x), as N — oo. This behavior is clear if we look at the plot of f (x) in Fig. 3.39. 


We see that x = 3m is a fixed point of f(x). That is, f (3m) = 3m. This plot shows 
that no matter what x we start with, the iterations approach 3m (unless we start at 
x = 0, in which case we remain there). These iterations are shown graphically by the 
directed lines in the plot. After reaching the value f(x) on the curve, the line moves 
horizontally to the x value off (x), and then vertically to the value f (f (x)) on the curve, 
and so on. Therefore, since f N(x) + 3m as N — oo, our infinite Atwood’s machine 
is equivalent to (as far as the top mass is concerned) just two masses, m and 3m. You 
can then quickly show that the acceleration of the top mass is g/2. Note that as far 
as the support is concerned, the whole apparatus is equivalent to a mass 3m. So 3mg 
is the upward force exerted by the support. 


Line of pulleys 

Let m be the common mass, and let T be the tension in the string. Let a be the 
acceleration of the end masses, and let a’ be the acceleration of the other N masses, 
with upward taken to be positive. These N accelerations are indeed all equal, because 
the same net force acts on all of the internal N masses, namely 27 upwards and mg 
downwards. The F = ma equations for the end and internal masses are, respectively, 


T—mg=ma, and 2T —mg=nnd'. (3.69) 


3.8 Solutions 


4m 


3m 


2m 


m 2m 3m 4m 5m 


Fig. 3.39 Problem 3.3, second solution 


3,5. 


But the string has fixed length. Therefore, 
N(2a')+a+a=0. (3.70) 


The “2” here comes from the fact that if one of the inside masses goes up by a distance 

d, then a length 2d of string has disappeared and must therefore appear somewhere 

else (namely, in the outer two segments). Eliminating T from Eq. (3.69) gives a’ = 

2a+ g. Combining this with Eq. (3.70) then gives 
Ng 


cranes mee and d = i (3.71) 


Remarks: For N = 1, we have a = —g/3 anda’ = g/3. For larger N, a increases 
in magnitude and approaches —g/2 as N — oo, and a’ decreases in magnitude and 
approaches zero as N —> oo. The signs of a and a’ in Eq. (3.71) may be surprising. 
You might think that if, say, NV = 100, then these 100 masses will “win” out over the 
two end masses, so that the N masses will fall. But this is not correct, because there 
are many (2N, in fact) tensions acting up on the NV masses. They do not behave like 
a mass Nm hanging below one pulley. In fact, two masses of m/2 on the ends will 
balance any number N of masses m in the interior (with the help of the upward forces 
from the top row of pulleys). & 


Ring of pulleys 
Let T be the tension in the string. Then F = ma for m; gives 


2T — mg = maj, (3.72) 


with upward taken to be positive. The a;’s are related by the fact that the string has 
fixed length, which implies that the sum of the displacements of all the masses is zero. 
In other words, 


Pee EEE ee | (3.73) 
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If we divide Eq. (3.72) by m;i, and then add the N such equations together and use 
Eq. (3.73), we find that T is given by 


1 1 1 
ar ( } paeng ) Ng = 0. (3.74) 
m m my 
Therefore, 
NM, 1 1 1 1 
T= E where = 1 Fe (3.75) 
M mı m2 MN 


is the so-called reduced mass of the system. Substituting this value for T into (3.72) 


gives 
NM 
aj =e( — i) ; (3.76) 


REMARK: A few special cases are: If all the masses are equal, then all the a; = 0. 
If mg = 0 (and all the others are not zero), then a; = (N — 1)g, and all the other 
ai = —g. If N — 1 of the masses are equal and much smaller than the remaining one, 
mx, then mg © —g, and all the other a; ~ g/(N — 1). & 


3.6. Sliding down a plane 


(a) The component of gravity along the plane is gsin@. The acceleration in the 
horizontal direction is therefore ay = (g sin 0) cos @. Our goal is to maximize 
axy. By taking the derivative, or by noting that sin 6 cos 0 = (sin 20)/2, we obtain 
6 = 7/4. The maximum a, is then g/2. 

(b) The normal force from the plane is mg cos, so the kinetic friction force is 
umg cos 0. The acceleration along the plane is therefore g(sin 0 — u cos @), and 
so the acceleration in the horizontal direction is ay = g(sin@ — u cos 0) cos@. 
We want to maximize this. Setting the derivative equal to zero gives 


(cos*@ — sin?@) + 2u sin cosé = 0 cos 20 + u sin 20 = 0 
1 
=> tan20 = ——. (3.77) 
u 


For u — 0, this gives the 2/4 result in part (a). For u —> œo, we obtain 0 ~ 7/2, 
which makes sense. 


REMARKS: The time to travel a horizontal distance d is obtained from axt? /2 = 
d. In part (a), this gives a minimum time of 2,/d/g. In part (b), you can show 


that the maximum ax is (g/2) (v 1+? -— n), which then leads to a minimum 


time of 2/d/g(y 1+? + p). This has the correct u — 0 limit, and it 
behaves like 2/2ud/g for u > œ. & 


3.7. Sliding sideways on a plane 


The normal force from the plane is N = mg cos 0. Therefore, the friction force on the 
block is uN = (tan 6)(mg cos 0) = mg sin 8. This force acts in the direction opposite 
to the motion. The block also feels the gravitational force of mg sin 6 pointing down 
the plane. 

Because the magnitudes of the friction force and the gravitational force along the 
plane are equal, the acceleration along the direction of motion equals the negative of 
the acceleration in the direction down the plane. Therefore, in a small increment of 
time, the speed that the block loses along its direction of motion exactly equals the 
speed that it gains in the direction down the plane. Letting v be the total speed of the 


3.8. 


3.8 Solutions 


block, and letting v, be the component of the velocity in the direction down the plane, 
we therefore have 


uty =C, (3.78) 


where C is a constant. C is given by its initial value, which is V + 0 = V. The final 
value of C is Vy + Vr = 2Vy (where Vy is the final speed of the block), because the 
block is essentially moving straight down the plane after a very long time. Therefore, 


(3.79) 


2V =V V; = —. 
f Ia 

Moving plane 

Let N be the normal force between the block and the plane. Note that we cannot assume 

that N = mg cos 0, because the plane recoils. We can see that N = mg cos @ is in fact 

incorrect, because in the limiting case where M = 0, we have no normal force at all. 

The various F = ma equations (vertical and horizontal for the block, and horizontal 
for the plane) are 


mg — Ncos@ = may, 
N sin = max, (3.80) 
N sin 0 = MAy, 


where we have chosen the positive directions for ay, ax, and A, to be downward, 
rightward, and leftward, respectively. There are four unknowns here: ax, ay, Ax, and 
N, so we need one more equation. This fourth equation is the constraint that the block 
remains in contact with the plane. The horizontal distance between the block and its 
starting point on the plane is (ax + Ay)? /2, and the vertical distance is at? /2. The 
ratio of these distances must equal tan @ if the block is to remain on the plane (imagine 
looking at things in the frame of the plane). Therefore, we must have 
ay 
ax + Ax 


Using Eq. (3.80) to solve for a,, ax, and A; in terms of N, and then plugging the results 
into Eq. (3.81), gives 


= tand. (3.81) 


g — cos ; 1 1 cos@\~! 
N- W = tan 0 N =g | sinf tan | — + — }] + : 
z 5in0 + 47 sind m M m 

(3.82) 


(In the limit M — ov, this reduces to N = mg cos 0, as it should.) Having found N, 
the third of Eqs. (3.80) gives Ax, which may be written as 


_ Nsin@ _ mgsin@cosé 
M M+msin76 ` 


(3.83) 


x 


REMARKS: 

1. For given M and m, you can show that the angle 6 that maximizes A, is tan 6) = 
/M/(M +m). IfM <« m, then 6 ~ 0; this makes sense, because the plane gets 
squeezed out very fast. If M >> m, then 9 ~ 7/4; this is consistent with the 7/4 
result from Problem 3.6(a). 

2. In the limit M « m, Eq. (3.83) gives Ay © g/ tan 0. This makes sense, because 
m falls essentially straight down with acceleration g, and the plane gets squeezed 
out to the left. 

3. In the limit M >> m, Eq. (3.83) gives Ay © g(m/M) sin 8 cos @. The correctness 
of this is more transparent if we instead look at ay = (M /m)Ax © g sin 0 cos0. 
Since the plane is essentially at rest in this limit, this value of a, implies that the 
acceleration of m along the plane equals a,/cos@ ~ gsin@, as expected. & 
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3.9. 


3.10. 


Exponential force 

F = ma gives ¥ = aoe. Integrating this with respect to time gives v(t) = 
—age—"'/b + A. Integrating again gives x(t) = age /b? + At + B. The initial condi- 
tion v(0) = 0 gives —ap/b+A = 0 A = ao/b. And the initial condition x(0) = 0 
gives ag /b* +B=0=>58B8= —ag/b?. Therefore, 


et t 1 
| mite z) (3.84) 


For tf + oo (more precisely, for bt —> 00), v approaches ao/b, and x approaches 
ao(t/b—1/b?). We see that the particle eventually lags a distance ap /b* behind another 
particle that starts at the same position but moves with the constant speed v = ag/b. 
For £ ~ 0 (more precisely, for bt œ~ 0), we can expand e~” in its Taylor series to 
obtain x(t) © aot? /2. This makes sense, because the exponential factor in the force is 
essentially equal to 1, so we essentially have a constant force with constant acceleration. 


—kx force 


This is simply a Hooke’s-law spring force, which we’ll see much more of in Chapter 4. 
F = ma gives —kx = mv dv/dx. Separating variables and integrating yields 


— kx dx = mv dv kx kx mv. (3.85) 
xo 0 2 2 


Solving for v = dx/dt and then separating variables and integrating again gives 


f dx Ta Ca (8.86) 
xo [xi = x2 D 0 m i i 


You can look up this integral, or you can solve it with a trig substitution. Letting 
x = xo cos 0 gives dx = —xọ sin 0 d0, and so we have 


8 . 
—x 0 d0 k k 
/ = Ht o= oe (3.87) 
0 xo sin @ m m 


From the definition of 8, the solution for x(t) is therefore 


x(t) = x9 cos (=) R (3.88) 
m 


We see that the particle oscillates back and forth sinusoidally. It completes a full 
oscillation when the argument of the cosine increases by 27. So the period of the 
motion is T = 27 ,/m/k, which interestingly is independent of xo. It increases with m 
and decreases with k, as expected. 


. Falling chain 


Let the density of the chain be p, and let y(t) be the length hanging down through the 
hole at time ¢. Then the total mass is p£, and the mass hanging below the hole is py. 
The net downward force on the chain is (py)g, so F = ma gives 

ld 


p = (peyy y= 7: (3.89) 


At this point, there are two ways we can proceed: 


First METHOD: Since we have a function whose second derivative is proportional to 
itself, a good bet for the solution is an exponential function. And indeed, a quick check 
shows that the solution is 

g 


y(t) = de“ +Be™, where a= ic (3.90) 


3.12. 


3.8 Solutions 


Taking the derivative of this to obtain (4), and using the given information that (0) = 
0, we find A = B. Using y(0) = yo, we then find 4 = B = yo/2. So the length that 
hangs below the hole is 


y(t) = A (e! +e) = yo cosh(at). (3.91) 
And the speed is 
y= = (ce — ee") = ay sinh (ot). (3.92) 


The time T that satisfies y(T) = £ is given by £ = yo cosh(«T). Using sinhx = 
v cosh” x — 1, we find that the speed of the chain right when it loses contact with the 


table is 
JT) = ayo sinh(@T) = a,/ l? — yg = /gt,/1— nå, (3.93) 


where yo = yo/£ is the initial fraction hanging below the hole. If no ~ 0, then the 
speed at time T is ./g@ (this quickly follows from conservation of energy, which is 
the subject of Chapter 5). Also, you can show that Eq. (3.91) implies that T goes to 
infinity logarithmically as no > 0. 


SECOND METHOD: Write ý as vdv/dy in Eq. (3.89), and then separate variables and 
integrate to obtain 


v dd 
/ vdv =a | yd => v= y y), (3.94) 
0 vo 


where œ = /g/€. Now write v as dy/dt and separate variables again to obtain 


f — =a fa 
vo y2 -y 0 


The integral on the left-hand side is cosh! (v/vo), so we arrive at y(t) = yo cosh (æt), 
in agreement with Eq. (3.91). The solution then proceeds as above. However, an easier 
way to obtain the final speed with this method is to simply use the result for v in 
Eq. (3.94). This tells us that the speed of the chain when it leaves the table (that is, 


when y = £) is v = œ, / 02 — y, in agreement with Eq. (3.93). 
Throwing a beach ball 


(3.95) 


On both the way up and the way down, the total force on the ball is 
F = —mg — mav. (3.96) 


On the way up, v is positive, so the drag force points downward, as it should. And on 
the way down, v is negative, so the drag force points upward. Our strategy for finding 
vf will be to produce two different expressions for the maximum height /, and then 
equate them. We’ll find these two expressions by considering the upward and then 
the downward motion of the ball. In doing so, we will need to write the acceleration 
of the ball as a = v dv/dy. For the upward motion, F = ma gives 


d h 0 d 
-mg —mav =m = f =- f HELAN 6.97) 
dy 0 v0 gtav 


where we have taken advantage of the fact that the speed of the ball at the top is zero. 
Writing v/(g + av) as [1 — g/(g + av)]/a, the integral yields 


ja Fie (1 2), (3.98) 
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Using F=ma 


Now consider the downward motion. Let vf be the final speed, which is a positive 
quantity. The final velocity is then the negative quantity, —ve. Using F = ma, we 


obtain 
0 = vdv 
dy = — : (3.99) 
h o gtav 
Performing the integration (or just replacing the vo in Eq. (3.98) with —vf) gives 
(Sea 32 in(1 ex), (3.100) 
a ga? g 


Equating the expressions for h in Eqs. (3.98) and (3.100) gives an implicit equation 
for ve in terms of vo, 


usage =e (=). (3.101) 
a g— ave 


Remarks: In the limit of small a (more precisely, in the limit avp/g « 1), we can 
use In(1 +x) = x —x?/2+--- to obtain approximate values for A in Eqs. (3.98) and 
(3.100). The results are, as expected, 


2 2 
Vv Ur 


xI x 
he 2g’ and h TA (3.102) 
We can also make approximations for large œ (or large avo/g). In this limit, the 
log term in Eq. (3.98) is negligible, so we obtain h © vo/æ. And Eq. (3.100) gives 
vf © g/a, because the argument of the log must be very small in order to give a very 
large negative number, which is needed to produce a positive / on the left-hand side. 
There is no way to relate vf and A in this limit, because the ball quickly reaches the 
terminal velocity of —g/a (which is the velocity that makes the net force equal to zero), 
independent of h. & 


Let’s now find the times it takes for the ball to go up and to go down. We’ll present 
two methods for doing this. 


First METHOD: Let T; be the time for the upward path. If we write the acceleration 
of the ball as a = dv/dt, then F = ma gives —mg — mav = mdv/dt. Separating 
variables and integrating yields 


Ti 0 dv 1 avo 
dt Ti In{ 14 ; (3.103) 
0 vo g + av a g 
In a similar manner, we find that the time T> for the downward path is 
1 
hsz G =), (3.104) 
a g€ 
Therefore, 


1 
Tah in( i (3.105) 
a 


g+ =) vo + ve 
g— avf g 
where we have used Eq. (3.101). This result is shorter than the time in vacuum (namely 


2vo/g) because vf < vo. 


SECOND METHOD: The very simple form of Eq. (3.105) suggests that there is a cleaner 
way of deriving it. And indeed, if we integrate m dv/dt = —mg — mav with respect to 
time on the way up, we obtain —vo = —gT; — ah (because f vdt = h). Likewise, if 
we integrate m dv/dt = —mg — mav with respect to time on the way down, we obtain 
—vf = —gT2+ah (because f vdt = —h). Adding these two results gives Eq. (3.105). 
Note that this procedure works only because the drag force is proportional to v. 


3.8 Solutions 


Remark: The fact that the time here is shorter than the time in vacuum isn’t obvious. 
On one hand, the ball doesn’t travel as high in air as it would in vacuum, so you might 
think that Tı + T2 < 2vo/g. But on the other hand, the ball moves slower in air on 
the way down, so you might think that T} + T2 > 2vo/g. It isn’t obvious which effect 
wins, without doing a calculation.*® For any a, you can use Eq. (3.103) to show that 
Tı < vo/g. But T is harder to get a handle on, because it is given in terms of vf. But in 
the limit of large œ, the ball quickly reaches terminal velocity, so we have T ~ h/ve. 
Using the results from the previous remark, this becomes To ~ (vo/a)/(g/a) = 
vo/g. Interestingly, this equals the downward (and upward) time for a ball thrown 
in vacuum. é& 


3.13. Balancing a pencil 


(a) The component of gravity in the tangential direction is mg sin 0 ~ mg@. There- 
fore, the tangential F = ma equation is mg@ = mé@, which may be written as 
6 = (g/€)0. The general solution to this equation is”’ 


O(t) = Ael + Be", where t = lg. (3.106) 
The constants A and B are found from the initial conditions, 
0 (0) = 6o A+B = 6, 
Å (0) = wo (A—B)/t = wp. (3.107) 


Solving for A and B, and then plugging the results into Eq. (3.106) gives 
1 1 
A(t) = 5 (+ aot) eT + 5 (60 — aot)e!/*, (3.108) 


(b) The constants A and B will turn out to be small (they will each be of order V/A). 
Therefore, by the time the positive exponential has increased enough to make 
0 of order 1, the negative exponential will have become negligible. We will 
therefore ignore this term from here on. In other words, 


1 
O(t) © 5 (0) + wot) e. (3.109) 


The goal is to keep @ small for as long as possible. Hence, we want to minimize 
the coefficient of the exponential, subject to the uncertainty-principle constraint, 
(€69)(m£a9) > h. This constraint gives wọ > h/(me769). Therefore, 


1 ht 
6(t) > : (+ + am) el, (3.110) 


Taking the derivative with respect to 69 to minimize the coefficient, we find that 
the minimum value occurs at 6) = ,/ht/mé2. Substituting this back into Eq. 


(3.110) gives 
hi 
A(t) > yin el, (3.111) 


Setting 0 ~ 1, and then solving for t gives (using tT = ./€/g) 


1/2 mn’ Og 
t< 7 —In eG (3.112) 
g 


28 For a similar setup where it again isn’t obvious (for good reason) which effect wins, see 
Exercise 3.36. 
29 TF you want, you can derive this by separating variables and integrating. The solution is essentially 


the same as in the second method presented in the solution to Problem 3.11. 
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With the given values, m = 0.01 kg and £ = 0.1 m, along with g = 10 m/s? and 
h = 1.06 - 10-34 Js, we obtain 


1 
t< z(01s) mO. 10°!) = 3.5s. (3.113) 


No matter how clever you are, and no matter how much money you spend on 
the newest cutting-edge pencil balancing equipment, you can never get a pencil 
to balance for more than about four seconds. 


REMARKS: 

1. The smallness of this answer is quite amazing. It is remarkable that a 
quantum effect on a macroscopic object can produce an everyday value 
for a time scale. Basically, the point is that the fast exponential growth of 
0 (which gives rise to the log in the final result for t) wins out over the 
smallness of A, and produces a result for t of order 1. When push comes to 
shove, exponential effects always win. 

2. The above value for t depends strongly on £ and g, through the ./@/g term. 
But the dependence on m, £, and g in the log term is very weak, because h 
is so small. If m is increased by a factor of 1000, for example, the result for 
t increases by only about 10%. This implies that any factors of order 1 that 
we neglected throughout this problem are completely irrelevant. They will 
appear in the argument of the log term, and will therefore have negligible 
effect. 

3. Note that dimensional analysis, which is generally a very powerful tool, 
won’t get you too far in this problem. The quantity ./@/g has dimensions 
of time, and the quantity n = m? g/h? is dimensionless (it is the only 
such quantity), so the balancing time must take the form, 


L 
ta [Ero (3.114) 
g 


where f is some function. If the leading term in f were a power (even, for 
example, a square root), then ¢ would essentially be infinite (¢ ~ 103° s ~ 
10?? years for the square root). But f in fact turns out to be a log (which 
you can’t know without solving the problem), which completely cancels 
out the smallness of h, reducing an essentially infinite time down to a few 
seconds. & 


3.14. Maximum trajectory area 


3.15. 


Let 0 be the angle at which the ball is thrown. Then the coordinates are given by 
x = (vcos@)t and y = (vsin6)t — gt? /2. The total time in the air is 2(v sin 0)/g, so 
the area under the trajectory, A = f y dx, is 


Xmax 2v sin 0/g 12 2v4 
f yax= f (vsin ĝ)t — = (vcos 6 dt) = <" sin? 6 cosd. 
0 0 2 3g? 
(3.115) 


Taking the derivative of this, we find that the maximum occurs when tan@ = V3, 
that is, when 6 = 60°. The maximum area is then Amax = /3v* /8g. Note that by 
dimensional analysis we know that the area, which has dimensions of distance squared, 
must be proportional to v*/g?. 


Bouncing ball 


The ball travels 2h during the first up-and-down journey. It travels 2hf during the 
second, then 2hf* during the third, and so on. Therefore, the total distance traveled is 


D=2(14 ft PtP (3.116) 


D 


3.16. 


3.17. 


3.8 Solutions 


The time it takes to fall down during the first up-and-down is obtained from 
h = gt?/2. Therefore, the time for the first up-and-down equals 2t = 2,/2h/g. 


Likewise, the time for the second up-and-down equals 2,/2(hf)/g. Each successive 
up-and-down time decreases by a factor of Whi , so the total time is 


2h 2h 1 
T=2 14 fe spe preg...) =2 i . 3.117 
T e ae: (3.117) 


The average speed is therefore 


T 14+ /f° 


D Neue (3.118) 


REMARK: The average speed forf ~ 1 is roughly half of the average speed for f ~ 0. 
This may seem counterintuitive, because in the f ~ 0 case the ball slows down far 
more quickly than in the f ~ 1 case. But the f ~ 0 case consists of essentially only 
one bounce, and the average speed for that one bounce is the largest of any bounce. 
Both D and T are smaller for f ~ 0 than for f ~ 1, but T is smaller by a larger 
factor. & 


Perpendicular velocities 
In the maximum-distance case, let v; be the initial speed of the ball, and let vf be the 


final speed right before it hits the plane (so vf = ,/ v? — 2gh, where h is the final height 
of the ball). Let the parabolic path be labeled P, and let the beginning and ending points 
be A and B, as shown in Fig. 3.40. 

Consider the question, “Given an initial speed vr, at what inclination angle should a 
ball be thrown down the plane from point B so that it travels the maximum distance?” 
The answer is that it should be thrown along the same path P, tracing out the path 
backward. This is certainly a possible physical trajectory (reversing time still leads to 
a solution to F = ma for projectile motion), and we claim that it does indeed yield the 
maximum distance. This can be seen in the following way. 

Assume (in search of a contradiction) that with initial speed vf, the maximum dis- 
tance down the plane is obtained via some other path P’ that lands farther down the 
plane, as shown in Fig. 3.40. Then if we decrease the initial speed vf by an appropriate 
amount, we can have the ball land at point A via some path P” (not shown, lest the 
figure get too cluttered). From the conservation-of-energy result, the final speed at A 
in this case is less than vi. But if we simply reverse the motion along path P”, we see 
that we can get from A to B by using an initial speed that is less than vi. So if we then 
increase the speed up to vi, we can hit a point above B on the plane via some other path 
P", contradicting the fact that B was the endpoint of the maximum-distance trajectory 
starting at A with initial speed vi. This contradiction shows that the maximum distance 
down the plane, starting at B with speed vf, must in fact be attained via the path P. 

Now, from the example in Section 3.4, we know that in the maximum-distance case, 
the throwing angle bisects the angle between the ground and the vertical. We therefore 
have the situation shown in Fig. 3.41, because the same path gives the maximum 
distance for both the upward and downward throws. Since 2a + 2y = 180°, we see 
that æ + y = 90°, as we wanted to show. 


Throwing a ball from a cliff 


Let the inclination angle be 6. Then the horizontal speed is vy = v cos 0, and the initial 
vertical speed is v, = vsin@. The time it takes for the ball to hit the ground is given 
by h + (vsin 8)t — gt? /2 = 0. Therefore, 


v . an) 2gh 
t= sinô + ysinð + |, where = —. (3.119) 
& 
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3.18. 


(The “—” solution for ¢ from the quadratic formula corresponds to the ball being thrown 
backward down through the cliff.) The horizontal distance traveled is d = (v cos 8)t, 


which gives 
2 
d= “cost (sino + ysin? +B). (3.120) 
g 


We want to maximize this function of 6. Taking the derivative, multiplying through 


by ,/sin?6 + 6, and setting the result equal to zero, gives 
(cos”@ — sin’),/ sin?6 + B = sin 0 (8 — (cos? — sin*6)). (3.121) 


Using cos? = 1 — sin?9, and then squaring and simplifying this equation, gives an 
optimal angle of 


1 1 
VZFB 24 2gh/v2 
Plugging this into Eq. (3.120), and simplifying, gives a maximum distance of 


v? wv 2gh 
dmx = VTF 8 = 14 E. (3.123) 
E g v 


ò 


sin Êmax = 


(3.122) 


Ifh = 0, then Omax = 7/4 and dmax = v? / g, in agreement with the example in Section 
3.4. If h > œ or v > 0, then Omax © 0, which makes sense. 


REMARK: Ifwe make use of conservation of energy (discussed in Chapter 5), it turns 
out that the final speed of the ball when it hits the ground is vf = yv? + 2gh. The 
maximum distance in Eq. (3.123) can therefore be written as (with vj = v being the 
initial speed) 
UjUE 
dmax = . (3.124) 
g 

Note that this is symmetric in v; and vf, as it must be, because we could imagine 
the trajectory running backward. Also, it equals zero if vj is zero, and it reduces to 
v?/g on level ground, as it should. We can also write the angle @ in Eq. (3.122) 
in terms of v¢ (instead of A). You can show that the result is tan = vj;/v¢. This 
implies that the initial and final velocities are perpendicular to each other, because 
running the trajectory backward interchanges v; and vp, which means that the product 
of the tangents of the two angles equals 1. This is basically the same result as in 
Problem 3.16. & 


Redirected motion 


FIRST SOLUTION: We will use the results of Problem 3.17, namely Eqs. (3.123) and 
(3.122), which say that an object projected from height y at speed v travels a maximum 
horizontal distance of 


vw 2 
Guage i, (3.125) 
g v 
and the optimal angle yielding this distance is 
1 
sin ð = ————————— (3.126) 


V2 +2gy/v? 
In the problem at hand, the object is dropped from height A, so the kinematic relation 
v? = v? + 2ad gives the speed at height y as v = „y2g(h — y). Plugging this into 
Eq. (3.125) tells us that the maximum horizontal distance, as a function of y, is 


dmax) = 2y h(h — y). (3.127) 


3.19. 


3.8 Solutions 


This is maximum when y = 0 (assuming we can’t have negative y), in which case the 
distance is dmax = 2h. Equation (3.126) gives the associated optimal angle as 0 = 45°. 


SECOND SOLUTION: Assume that the greatest distance dp is obtained when the surface 
is at y = yo. We will show that yo must be 0. We will do this by assuming yo 4 0 and 
explicitly constructing a situation that yields a greater distance. 

Let P be the point where the ball finally hits the ground after it bounces off the 
surface at height yo. Consider a second scenario where a ball is dropped from height 
h directly above P. The speed of this ball at P will be the same as the speed of the 
original ball at P. This follows from conservation of energy. (Or, since we haven’t 
covered energy yet, you can use the kinematic relation v? = v? + 2ad for the y speeds 
of the two balls; apply it in two steps for the first ball.) 

Now imagine putting a surface at P at the appropriate angle so that the second ball 
bounces off in the direction from which the first ball came. Then the second ball will 
travel backward along the parabolic trajectory of the first one. But this means that 
after the second ball gets to the location of the platform at yo (which we have now 
removed), it will cover more horizontal distance before it hits the ground. We have 
therefore constructed a setup in which the ball travels farther horizontally than in our 
proposed maximal case. So the optimal setup must have yọ = 0, in which case the 
example in Section 3.4 says that the optimal angle is 0 = 45°. If we want the ball to 
go even farther, we can simply dig a (wide enough) hole in the ground and have the 
ball bounce from the bottom of the hole. 


Maximum trajectory length 


Let @ be the angle at which the ball is thrown. Then the coordinates are given by 
x = (vcos6)t and y = (vsin@)t — gt? /2. The ball reaches its maximum height at 
t = vsin@/g, so the length of the trajectory is 


vsin0/g dx 2 dy 2 
kss =) a 
0 dt dt 


v sin d/g 
=) \ wcos8)? + (vsind — gt)? dt 
0 


v sin @/g gt 2 
=2v cose | 1+ (no = ) dt. (3.128) 
0 v cos 
Letting z = tan 0 — gt/v cos 0, we obtain 
2 2 26 0 
p= | Vi+22 dz. (3.129) 
E tan 6 


We can look up this integral, or we can derive it by making a z = sinha substitution. 
The result is 


2 2 26 1 tan@ 
ae, (z 1+22+In(z4 1 +2?) 
g 2 0 
ind+1 
ie (sine + cos?6 In (a). (3.130) 
g cos 0 


As an intermediate check, you can verify that L = 0 when @ = 0, and L = v? /g when 
0 = 90°. Taking the derivative of Eq. (3.130) to find the maximum, we obtain 
1+ sind ) 


0 = cos — 2 cos 0 sino in ( 
cos 


(3.131) 


2 ( cos 0 ) cos? + (1 + sin 0) sin 0 
+ cos*0 5 : 
cos+0 


i 1+sin@ 
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3.20. 


3.21. 


This reduces to 


(3.132) 


leant 
1 = singin (S), 


cos@ 


You can show numerically that the solution for 6 is 09 © 56.5°. 


Remark: A few possible trajectories are shown in Fig. 3.42. Using the standard result 
that 6 = 45° provides the maximum horizontal distance, it follows from the figure 
that the 6o that yields the maximum trajectory length must satisfy 0) > 45°. The exact 
angle, however, requires the above detailed calculation. & 


Centripetal acceleration 

The position and velocity vectors at two nearby times are shown in Fig. 3.43. Their 
differences, Ar = r2 — rj and Av = v2 — yj, are shown in Fig. 3.44. The angle 
between the v’s is the same as the angle between the r’s, because each v makes a right 
angle with the corresponding r. Therefore, the triangles in Fig. 3.44 are similar, so we 
have 


[Avi _ [Ari 


v F 


> (3.133) 
where r = |r| and v = |v|. Dividing through by At gives 
1 


v 


Av 
At 


1 
F 


Ar 
At 


ja] Ivl v? 
a l (3.134) 


v r r 


We have assumed that Aż is infinitesimal here, which allows us to get rid of the A’s 
in favor of instantaneous quantities. 
Vertical acceleration 
Let 6 be the angle down from the top of the hoop. The tangential acceleration is 
a, = g sin 0, and the radial acceleration is ar = v? /R = 2gh/R. But the height fallen 
is h = R — R cos 0, so we have 

2gR(1 — cos 0) 
~ R 


We want the total acceleration to be vertical, which means that we want the horizontal 
components of a; and a; in Fig. 3.45 to cancel. That is, a, cos 0 = a; sin 0. This gives 


ar 


= 2g(1 — cos 98). (3.135) 


(gsinf)cos0 = 2g(1—cos@)snd = > sind=0 or cos = 2/3. (3.136) 


The sin @ = 0 root corresponds to the top and bottom of the hoop (0 = 0 and 0 = 7). 
So we want the cos6 = 2/3 => 6 ~ +48.2° root. The vertical acceleration is the 
sum of the vertical components of a; and ar, so 


ay = asin + a, cos@ = (gsin@) sin 0 + 2g(1 — cos@) cos 8 
= g(sin?0 + 2.cos@ — 2 cos?0). (3.137) 


Using cos @ = 2/3, and hence sin @ = /5/3, we have 


aay eae ee (3.138) 
ay =g kee Sa =g. : 


Remark: The reason for this nice answer is the following. If there is no horizontal 
acceleration, then the normal force from the hoop must have no horizontal component. 
In other words, N sin = 0. Therefore, either sin@ = 0 (which gives the top and 
bottom solutions of 9 = 0 and 6 = zr), or N = 0, which means that there is no normal 
force, so the bead feels only gravity, so it’s in freefall with a, = g. 


3.22. 


3.23. 


3.24. 


3.8 Solutions 


If we want, we can use this N = 0 requirement as the starting point for a second 
solution. Using the a, from above, the radial F = ma equation is mgcos@ — N = 
2mg(1 — cos@), with positive N defined to point outward. Setting N = 0 gives the 
desired result, cos 0 = 2/3. & 


Circling around a pole 


Let F be the tension in the string. At the mass, the angle between the string and the 
radius of the dotted circle is 9 = sin”! (r/R). In terms of 6, the radial and tangential 
F = ma equations are 


2 


F cos0 = T, and Fsin@ = mi. (3.139) 
Dividing these two equations gives tan@ = (Rw)/v*. Separating variables and 
integrating gives 
f dv _ tané fa 1 1 _ (tan0)t 
D v2 R Jo vo v R 
1 (tana)t\7! 
v(t) = : (3.140) 
v0 R 
The speed v becomes infinite when 
R 
t=T= . 3.141 
vo tan 0 ( ) 


This means that you can keep the mass moving in the desired circle only up to time 
T. After that, it is impossible. (Of course, it will become impossible, for all practical 
purposes, long before v becomes infinite.) The total distance, d = f v dt, is infinite, 
because this integral diverges (barely, like a log) as t approaches T. 


A force Fy = mið 
With the given force, Eq. (3.51) becomes 


0=m(—r6"), and miò =m(rÖ +276). (3.142) 


The second of these equations gives —+Ê = rö. Therefore, 


6 i: ; . D 
[pa fia Ind =—Inr+C 6=—, (3.143) 
a r 


where D = ef isa constant of integration, determined by the initial conditions. Sub- 
stituting this value of @ into the first of Eqs. (3.142), and then multiplying through by 


7 and integrating, gives 


DY i a 
r=r(?) => fraz) -dt z =P lnr+E. (3.144) 
7 


r 


Therefore, 
i: = VJAlnr+B, (3.145) 
where A = 2D? and B = 2E. 
Free particle 
For zero force, Eq. (3.51) gives 
=r’, and ré = —276. (3.146) 


Separating variables in the second equation and integrating yields 


7) 27 : . D 
f -dt = | dt ln = —2lnr + C 0 = —, (3.147) 
8 r r2 
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CG 30 


where D = e* is a constant of integration, determined by the initial conditions. 
Substituting this value of @ into the first of Eqs. (3.146), and then multiplying through 
by 7 and integrating, gives 


N D 2 . j j 72 D2 
r=r 5 —- ri dt D 3 dt > 7 HE. (3.148) 
r 


2 2r- 


We want 7 = 0 when r = ro, which implies that £ = D? jar: Therefore, 


r2 
=V- 4, (3.149) 
1 


where V = D/ro. Separating variables and integrating gives 


i dt 
f zg [ra Jr R= Vt r=" + (Vt)? , 


(3.150) 


where the constant of integration is zero, because we have chosen t = 0 to correspond 
to r = ro. Plugging this value for r into the Ê = D/r? = Wo /r? result in Eq. (3.147) 
gives 


Vro dt = Vt ro 
d0 = 5 7 6 = tan => cos? = ——_.. 
ro + (Vt) ro r2 + (Vt)? 
(3.151) 


Finally, combining this with the result for r in Eq. (3.150) gives cos@ = ro/r, as 
desired. 


30 The statement that 26 is constant is simply the statement of conservation of angular momentum, 
because 126 = r(r6) = rug, where vp is the tangential velocity. More on this in Chapters 7 
and 8. 


Chapter 4 
Oscillations 


In this chapter we will discuss oscillatory motion. The simplest examples of such 
motion are a swinging pendulum and a mass on a spring, but it is possible to make 
a system more complicated by introducing a damping force and/or an external 
driving force. We will study all of these cases. 

We are interested in oscillatory motion for two reasons. First, we study it 
because we can study it. This is one of the few systems in physics where we can 
solve the motion exactly. There’s nothing wrong with looking under the lamppost 
every now and then. Second, oscillatory motion is ubiquitous in nature, for 
reasons that will become clear in Section 5.2. If there was ever a type of physical 
system worthy of study, this is it. We’ll start off by doing some necessary math 
in Section 4.1. And then in Section 4.2 we’ll show how the math is applied to the 
physics. 


4.1 Linear differential equations 


A linear differential equation is one in which x and its time derivatives enter only 
through their first powers. An example is 3x + 7x + x = 0. An example of a 
nonlinear differential equation is 3% + 74? + x = 0. If the right-hand side of the 
equation is zero, then we use the term homogeneous differential equation. If the 
right-hand side is some function of ź, as in the case of 3x — 4x = 97? — 5, then 
we use the term inhomogeneous differential equation. The goal of this chapter 
is to learn how to solve linear differential equations, both homogeneous and 
inhomogeneous. These come up again and again in physics, so we had better 
find a systematic method of solving them. 

The techniques that we will use are best learned through examples, so let’s 
solve a few differential equations, starting with some simple ones. Throughout 
this chapter, x is understood to be a function of t. Hence, a dot denotes time 
differentiation. 


Example 1 (x = ax): This is a very simple differential equation. There are two 
ways (at least) to solve it. 
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First method: Separate variables to obtain dx/x = adt, and then integrate to 
obtain In x = at +c, where c is a constant of integration. Then exponentiate to obtain 


x = Ae", (4.1) 


where A = e°. A is determined by the value of x at, say, t = 0. 


Second method: Guess an exponential solution, that is, one of the form x = 
Ae“, Substitution into x = ax immediately gives œ = a. Therefore, the solution is 
x = Ae“, Note that we can’t solve for A, due to the fact that our differential equation 
is homogeneous and linear in x (translation: A cancels out). A is determined by the 
initial condition. 

This method may seem a bit silly, and somewhat cheap. But as we will see below, 
guessing these exponential functions (or sums of them) is actually the most general 
thing we can try, so the method is indeed quite general. 


Remark: Using this method, you might be concerned that although we have found one 
solution, we might have missed another one. But the general theory of differential equations 
says that a first-order linear equation has only one independent solution (we'll just accept 
this fact here). So if we find one solution, then we know that we’ve found the whole 
thing. & 


Example 2 (x = ax): Ifa is negative, then we’ll see that this equation describes 
the oscillatory motion of, say, a spring. If a is positive, then it describes exponentially 
growing or decaying motion. There are two ways (at least) to solve this equation. 


First method: We can use the separation-of-variables method from Section 3.3 
here, because our system is one in which the force depends only on the position x. But 
this method is rather cumbersome, as you found if you did Problem 3.10 or Exercise 
3.38. It will certainly work, but in the case where our equation is linear in x, there is 
a much simpler method: 


Second method: As in the first example above, we can guess a solution of the 
form x(t) = Ae“ and then find out what œ must be. Again, we can’t solve for A, 
because it cancels out. Plugging de“ into ¥ = ax gives a = -+./a. We have therefore 
found two solutions. The most general solution is an arbitrary linear combination of 
these, 


x(t) = Aev! 4 Be~ V4", (4.2) 


which you can quickly check does indeed work. A and B are determined by the initial 
conditions. As in the first example above, you might be concerned that although 
we have found two solutions to the equation, we might have missed others. But the 
general theory of differential equations says that our second-order linear equation has 
only two independent solutions. Therefore, having found two independent solutions, 
we know that we’ve found them all. 


VERY IMPORTANT REMARK: The fact that the sum of two different solutions is again a solu- 
tion to our equation is a monumentally important property of linear differential equations. 


4.1 Linear differential equations 


This property does not hold for nonlinear differential equations, for example x7 = bx, 
because the act of squaring after adding the two solutions produces a cross term which 
destroys the equality, as you should check (see Problem 4.1). This property is called the 
principle of superposition. That is, superposing two solutions yields another solution. In 
other words, linearity leads to superposition. This fact makes theories that are governed by 
linear equations much easier to deal with than those that are governed by nonlinear ones. 
General Relativity, for example, is based on nonlinear equations, and solutions to most 
General Relativity systems are extremely difficult to come by. 


For equations with one main condition 
(Those linear), you have permission 
To take your solutions, 

With firm resolutions, 

And add them in superposition. & 


Let’s say a little more about the solution in Eq. (4.2). If a is negative, then it is 
helpful to define a = —w*, where w is a real number. The solution then becomes 
x(t) = Aet”! + Bei“, Using e? = cos@ + isin@, this can be written in terms of 
trig functions, if desired. Various ways of writing the solution are: 


x(t) = Aeivt + Bet, 


x(t) = C cos ot + Dsinat, 
(4.3) 
x(t) = Ecos(wt + ¢1), 


x(t) = F sin(wt + p2). 


Depending on the specifics of a given system, one of the above forms will work better 
than the others. The various constants in these expressions are related to each other. 
For example, C = E cos ¢; and D = —E sin ġ1, which follow from the cosine sum 
formula. Note that there are two free parameters in each of the above expressions for 
x(t). These parameters are determined by the initial conditions (say, the position and 
velocity at t = 0). In contrast with these free parameters, the quantity w is determined 
by the particular physical system we’re dealing with. For example, we’ll see that for 
a spring, œ = Jk/m, where k is the spring constant. w is independent of the initial 
conditions. 

If a is positive, then let’s define a = a, where æ is a real number. The solution in 
Eq. (4.2) then becomes x(t) = Ae®! + Be~@". Using e? = coshé + sinh@, this can 
be written in terms of hyperbolic trig functions, if desired. Various ways of writing 
the solution are: 


x(t) = Ae% + Be, 
x(t) = Ccoshat + Dsinhat, 
(4.4) 
x(t) = Ecosh(at + ¢1), 
x(t) = F sinh(at + ¢2). 


Again, the various constants are related to each other. If you are unfamiliar with the 
hyperbolic trig functions, a few facts are listed in Appendix A. 
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Although the solution in Eq. (4.2) is completely correct for both signs of a, it’s 
generally more illuminating to write the negative-a solutions in either the trig forms 
tiol exponential form where the i’s are explicit. 


or the e7 

The usefulness of our method of guessing exponential solutions cannot be overem- 
phasized. It may seem somewhat restrictive, but it works. The examples in the 
remainder of this chapter should convince you of this. 


This is our method, essential, 

For equations we solve, differential. 
It gets the job done, 

And it’s even quite fun. 

We just try a routine exponential. 


Example 3 (« +2yx + ax = 0): This will be our last mathematical example, and 
then we’ll start doing some physics. As we’ll see later, this example pertains to a 
damped harmonic oscillator. We’ve put a factor of 2 in the coefficient of x here to 
make some later formulas look nicer. The force in this example (if we switch from 
math to physics for a moment) is —2yx — ax (times m), which depends on both v 
and x. Our methods of Section 3.3 therefore don’t apply; we’re not going to be able 
to use separation of variables here. This leaves us with only our method of guessing 
an exponential solution, Ae’. So let’s see what it gives us. Plugging x(t) = Ae“ into 
the given equation, and canceling the nonzero factor of Ae’, gives 


a? +2ya+a=0. (4.5) 


The solutions for œ are —y + yy? —a. Call these œ] and œ2. Then the general 
solution to our equation is 


x(t) = Ae”! + Be! 
=g” (4e'v y*-a 4 Be-tWy?—a ). (4.6) 


If y? —a < 0, then we can write this in terms of sines and cosines, so we have 
oscillatory motion that decreases in time due to the e~” factor (or it increases, if 
y < 0, but this is rarely physical). If y? — a > 0, then we have exponential motion. 
We’ll talk more about these different possibilities in Section 4.3. 

In the first two examples above, the solutions were fairly clear. But in the present 
case, you ’re not apt to look at the above solution and say, “Oh, of course. It’s obvious!” 
So our method of trying solutions of the form 4e“ isn’t looking so silly anymore. 


In general, if we have an nth order homogeneous linear differential equation, 


d"x d"—1x 
— F ni 


dx 
dt” dt”! eel dt =O. 2) 


4.2 Simple harmonic motion 


then our strategy is to guess an exponential solution, x(t) = Ae®’, and to then (in 
theory) solve the resulting nth order equation, œ” + cn-1”7! +- - -+ c1œ + co = 
0, for a, to obtain the solutions a,...,@,. The general solution for x(t) is then 
the superposition, 


x(t) = Aye™! + Ape! + -F Aye", (4.8) 


where the A; are determined by the initial conditions. In practice, however, we 
will rarely encounter differential equations of degree higher than 2. (Note: if 
some of the a; happen to be equal, then Eq. (4.8) is not valid, so a modification 
is needed. We will encounter such a situation in Section 4.3.) 


4.2 Simple harmonic motion 


Let’s now do some real live physical problems. We’ll start with simple harmonic 
motion. This is the motion undergone by a particle subject to a force F (x) = —kx. 
The classic system that undergoes simple harmonic motion is a mass attached 
to a massless spring, on a frictionless table (see Fig. 4.1). A typical spring has 
a force of the form F(x) = —kx, where x is the displacement from equilibrium 
(see Section 5.2 for the reason behind this). This is “Hooke’s law,” and it holds as 
long as the spring isn’t stretched or compressed too far. Eventually this expression 
breaks down for any real spring. But if we assume a —kx force, then F = ma 
gives —kx = mx, or 


| k 
¥+o°x=0, where w=,/—. (4.9) 
m 


This is simply the equation we studied in Example 2 in the previous section. 
From Eq. (4.3), the solution may be written as 


x(t) = Acos(wt + ¢). (4.10) 


This trig solution shows that the system oscillates back and forth forever in time. 
w is the angular frequency. If t increases by 277/w, then the argument of the 
cosine increases by 27, so the position and velocity are back to what they were. 
The period (the time for one complete cycle) is therefore T = 27 /w = 27 m/k. 
The frequency in cycles per second (hertz) is v = 1/T = w/2z. The constant 
A (or rather the absolute value of A, if A is negative) is the amplitude, that is, 
the maximum distance the mass gets from the origin. Note that the velocity as a 
function of time is v(t) = x(t) = —Aw sin (wt + @). 

The constants A and ¢ are determined by the initial conditions. If, for example, 
x(0) = 0 and x(0) = v, then we must have Acos¢@ = 0 and —Awsing = v. 
Hence, ¢ = 7/2, and so A = —v/w (or ¢ = —z/2 and A = v/a, but this 
leads to the same solution). Therefore, we have x(t) = —(v/w) cos(wt + 1/2). 
This looks a little nicer if we write it as x(t) = (v/w) sin(wf). It turns out that 


105 


106 


Fig. 4.2 


Oscillations 


if the facts you’re given are the initial position and velocity, x9 and vo, then the 
x(t) = Ccos wt + Dsin wt expression in Eq. (4.3) usually works best, because 
(as you can verify) it yields the nice clean results, C = x9 and D = vuo/w. 
Problem 4.3 gives another setup that involves initial conditions. 


Example (Simple pendulum): Another classic system that undergoes (approx- 
imately) simple harmonic motion is the simple pendulum, that is, a mass that hangs 
on a massless string and swings in a vertical plane. Let £ be the length of the string, 
and let 0 (f) be the angle the string makes with the vertical (see Fig. 4.2). Then the 
gravitational force on the mass in the tangential direction is —mg sin 0. So F = ma 
in the tangential direction gives 


—mg sin 0 = m(£6). (4.11) 


The tension in the string combines with the radial component of gravity to produce the 
radial acceleration, so the radial F = ma equation serves only to tell us the tension, 
which we won’t need here. 

We will now enter the realm of approximations and assume that the amplitude of 
the oscillations is small. Without this approximation, the problem cannot be solved 
in closed form. Assuming 8 is small, we can use sin 0 ~ 0 in Eq. (4.11) to obtain 


6+076=0, where w= fz. (4.12) 
Therefore, 
O(t) = Acos(wt + $), (4.13) 


where A and ¢ are determined by the initial conditions. So the pendulum undergoes 
simple harmonic motion with a frequency of ./g/€. The period is therefore T = 
2n/w = 27 ./€/g. The true motion is arbitrarily close to this, for sufficiently small 
amplitudes. Exercise 4.23 deals with the higher-order corrections to the motion in the 
case where the amplitude is not small. 


There will be many occasions throughout your physics education where you 
will plow through a calculation and then end up with a simple equation of the 
form 7 + w?z = 0, where œ? is a positive quantity that depends on various 
parameters in the problem. When you encounter such an equation, you should 
jump for joy, because without any more effort you can simply write down the 
answer: the solution for z must be of the form z(t) = A cos(wt + ¢). No matter 
how complicated the system looks at first glance, if you end up with an equation 
that looks like 7 + w*z = 0, then you know that the system undergoes simple 
harmonic motion with a frequency equal to the square root of the coefficient of z, 
no matter what that coefficient is. If you end up with Z + (zucchini)z = 0, then 
the frequency is w = vy zucchini (well, as long as the zucchini is positive and has 
the dimensions of inverse time squared). 


4.3 Damped harmonic motion 


4.3 Damped harmonic motion 


Consider a mass m attached to the end of a spring with spring constant k. Let 
the mass be subject to a drag force proportional to its velocity, Ff = —bv (the 
subscript f here stands for “friction”; we’ll save the letter d for “driving” in the 
next section); see Fig. 4.3. Why do we study this Ff = —bv damping force? Two 
reasons: First, it is linear in x, which will allow us to solve for the motion. And 
second, it is a perfectly realistic force; an object moving at a slow speed through 
a fluid generally experiences a drag force proportional to its velocity. Note that 
this Ff = —bv force is not the force that a mass would feel if it were placed on 
a table with friction. In that case the drag force would be (roughly) constant. 

Our goal in this section is to solve for the position as a function of time. The 
total force on the mass is F = —bx — kx. So F = mx gives 


¥+2ye+o*x =0, (4.14) 


where 2y = b/m, and œ? = k/m. This is conveniently the same equation we 
already solved in Example 3 in Section 4.1 (with a —> w°). Now, however, we 
have the physical restrictions that y > 0 and œ? > 0. Letting Q? = y? — w? for 
simplicity, we may write the solution in Eq. (4.6) as 


x(t) =e” (de + Be“), where Q= yy? — o. (4.15) 
There are three cases to consider. 
Case 1: Underdamping (Q? < 0) 


If Q? < 0, then y < œw. Since Q is imaginary, let us define the real number 
= /a? — y?, so that Q = ið. Equation (4.15) then gives 


Sr 


x(t) = ev! (Aei + Bei?) 
=e ”'C cos(ðt + $). (4.16) 


These two forms are equivalent. Using e}? = cos + isin@, the constants in 
Eq. (4.16) are related by A + B = C cos ġ and A — B = iC sin @. Note that in a 
physical problem, x(t) is real, so we must have A* = B, where the star denotes 
complex conjugation. The two constants A and B, or the two constants C and ¢, 
are determined by the initial conditions. 

Depending on the given problem, one of the expressions in Eq. (4.16) will 
inevitably work better than the other. Or perhaps one of the other forms in Eq. (4.3) 
(times e7’) will be the most useful one. The cosine form makes it apparent that 
the motion is harmonic motion whose amplitude decreases in time, due to the 
e7! factor. A plot of such motion is shown in Fig. 4.4.! The frequency of the 


l To be precise, the amplitude doesn’t decrease exactly like Ce~Y’, as Eq. (4.16) suggests, because 
Ce—Y‘ describes the envelope of the motion, and not the curve that passes through the extremes of 
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motion, © = yœ? — y?, is smaller than the natural frequency w of the undamped 
oscillator. 


Remarks: Ify is very small (more precisely, if y « w), then ® © w, which makes sense 
because we almost have an undamped oscillator. If y is very close to w, then © ~ 0. So the 
oscillations are very slow (more precisely, © < w). Of course, for very small & it’s hard to even 
tell that the oscillations exist, because they will damp out on a time scale of order 1/y ~ 1/a, 
which is short compared with the long time scale of the oscillations, 1/@. & 


Case 2: Overdamping (Q? > 0) 
If Q? > 0, then y > w. Q is real (and taken to be positive), so Eq. (4.15) gives 


x(t) = Ae VT 44. Be VTE (4.17) 


There is no oscillatory motion in this case; see Fig. 4.5. Since y > Q = 
Vy? — w”, both of the exponents are negative, so the motion goes to zero for 
large t. This had better be the case, because a real spring is certainly not going 
to have the motion fly off to infinity. If we had obtained a positive exponent 
somehow, we’d know we had made a mistake. 


Remarks: If y is just slightly larger than wœ, then Q ~ 0, so the two terms in (4.17) are 
roughly equal, and we essentially have exponential decay, according to e~”. If y > œ 
(that is, strong damping), then Q ~ y, so the first term in (4.17) dominates (it has the less 
negative exponent), and we essentially have exponential decay according to e~"”—®)", We can 
be somewhat quantitative about this by approximating Q2 as 


22/7 o? = yy1 a/y? x y(1 — w /2y”). (4.18) 


Hence, the exponential behavior goes like e-@/2Y Because y > a, this is slow decay (that 
is, slow compared with t ~ 1/w), which makes sense if the damping is very strong. The mass 
slowly creeps back to the origin, as in the case of a weak spring immersed in molasses. & 


Case 3: Critical damping (Q? = 0) 


If Q? = 0, then y = w. Equation (4.14) therefore becomes ¥ + 2yx + y2x = 0. 
In this special case, we have to be careful in solving our differential equation. The 
solution in Eq. (4.15) is not valid, because in the procedure leading to Eq. (4.6), 
the roots œı and a2 are equal (to —y), so we have really found only one solution, 
e_Y', We’ll just invoke here the result from the theory of differential equations 
that says that in this special case, the other solution is of the form te~”’. 


Remark: You should check explicitly that te~”’ solves the equation ¥ + 2yx+ yx = 0. Or if 
you want to, you can derive it in the spirit of Problem 4.2. In the more general case where there 
are n identical roots in the procedure leading to Eq. (4.8) (call them all œ), the n independent 
solutions to the differential equation are tke, for 0 < k < (n— 1). But more often than not, 
there are no repeated roots, so you don’t have to worry about this. & 


the motion. You can show that the amplitude in fact decreases like Ce~Y* cos(tan7 ! (y/@)). This 
is the expression for the curve that passes through the extremes; see Castro (1986). But for small 
damping (y «X w), this is essentially equal to Ce~”". And in any event, it is proportional to e7¥*. 


4.4 Driven (and damped) harmonic motion 


Our solution is therefore of the form, 
x(t) = e" (A + Bt). (4.19) 


The exponential factor eventually wins out over the Bt term, so the motion goes 
to zero for large ¢ (see Fig. 4.6). 

If we are given a spring with a fixed w, and if we look at the system for 
different values of y, then critical damping (when y = œ) is the case where 
the motion converges to zero in the quickest way (which is like e~®’). This is 
true because in the underdamped case (y < w), the envelope of the oscillatory 
motion goes like e~”’, which goes to zero slower than e~®’, because y < œ. 
And in the overdamped case (y > w), the dominant piece is the e~”~2)” term. 
And as you can verify, if y > œ then y — Q = y — yy? — œ? < a, so this 
motion also goes to zero slower than e~’. Critical damping is very important in 
many real systems, such as screen doors and shock absorbers, where the goal is 
to have the system head to zero (without overshooting and bouncing around) as 
fast as possible. 


4.4 Driven (and damped) harmonic motion 


Before we examine driven harmonic motion, we must learn how to solve a new 
type of differential equation. How can we solve something of the form 


ž + 2yvx + ax = Coe", (4.20) 


where y, a, wo, and Co are given quantities? This is an inhomogeneous differential 
equation, due to the term on the right-hand side. It’s not very physical, because 
the right-hand side is complex, but let’s not worry about that for now. Equations 
of this sort come up again and again, and fortunately there’s a straightforward 
(although sometimes messy) method for solving them. As usual, the method 
involves making a reasonable guess, plugging it in, and seeing what condition 
comes out. Since we have the e!° sitting on the right-hand side of Eq. (4.20), 
let’s guess a solution of the form x(t) = Ae, 4 will depend on wo, among 
other things, as we will see. Plugging this guess into Eq. (4.20) and canceling the 
nonzero factor of e/’, we obtain 


(—ag)A + 2y (iwo)A + ad = Co. (4.21) 


Solving for A, we find that our solution for x is 


C 
x(t) = — elo, (4.22) 
—w + 2iy@o +a 


Note the differences between this technique and the one in Example 3 in 
Section 4.1. In that example, the goal was to determine the a in x(t) = Ae. 


x(t) = e” (4+BÀ 


Fig. 4.6 
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And there was no way to solve for A; the initial conditions determined A. But in 
the present technique, the wo in x(t) = Ae!” is a given quantity, and the goal 
is to solve for A in terms of the given constants. Therefore, in the solution in 
Eq. (4.22), there are no free constants to be determined by the initial conditions. 
We’ ve found one particular solution, and we’re stuck with it. The term particular 
solution is used for Eq. (4.22). 

With no freedom to adjust the solution in Eq. (4.22), how can we satisfy an 
arbitrary set of initial conditions? Fortunately, Eq. (4.22) does not represent the 
most general solution to Eq. (4.20). The most general solution is the sum of 
our particular solution in Eq. (4.22), plus the “homogeneous” solution we found 
in Eq. (4.6). This sum is certainly a solution, because the solution in Eq. (4.6) 
was explicitly constructed to yield zero when plugged into the left-hand side of 
Eq. (4.20). Therefore, tacking it on to our particular solution doesn’t change the 
equality in Eq. (4.20), because the left side is linear. The principle of superposition 
has saved the day. The complete solution to Eq. (4.20) is therefore 


x(t) = e” (4e'v y—a + Bev it) + 3 Co gk 
—@ + 2iywo +a 


(4.23) 


where A and B are determined by the initial conditions. 
With superposition in mind, it’s clear what the strategy should be if we have 
a slightly more general equation to solve, for example, 


X + 2yx + ax = Cie! 4+ Oe, (4.24) 


Simply solve the equation with only the first term on the right. Then solve the 
equation with only the second term on the right. Then add the two solutions. And 
then add on the homogeneous solution from Eq. (4.6). We are able to apply the 
principle of superposition because the left-hand side of Eq. (4.24) is linear. 

Finally, let’s look at the case where we have many such terms on the right-hand 
side, for example, 


N 
¥+2yk+ax => Cy", (4.25) 
n=1 
We need to solve N different equations, each with only one of the N terms on 
the right-hand side. Then we add up all the solutions, and then we add on the 
homogeneous solution from Eq. (4.6). If N is infinite, that’s fine; we just have 
to add up an infinite number of solutions. This is the principle of superposition 
at its best. 


REMARK: The previous paragraph, combined with a basic result from Fourier analysis, allows 
us to solve (in principle) any equation of the form 


ž+2yi+ax= fO). (4.26) 


4.4 Driven (and damped) harmonic motion 


Fourier analysis says that any (nice enough) function f (t) can be decomposed into its Fourier 
components, 


f= / glo)eda. (4.27) 


In this continuous sum, the functions g(@) (times dq) take the place of the coefficients Cn in 
Eq. (4.25). So if Sẹ (£) is the solution for x(t) when there is only the term e’’ on the right-hand 
side of Eq. (4.26) (that is, Se (t) is the solution given in Eq. (4.22), without the Co factor), then 
the principle of superposition tells us that the complete particular solution to Eq. (4.26) is 


x(t) = | g(w)Solt) do. (4.28) 


Finding the coefficients g (w) is the hard part (or rather, the messy part), but we won’t get into 
that here. We won’t do anything with Fourier analysis in this book, but it’s nevertheless good 
to know that it is possible to solve (4.26) for any function f(t). Most of the functions we’ll 
consider will be nice functions like cos wot, which has a very simple Fourier decomposition, 
namely cos wot = } (elo +e iol) yy 


Let’s now do a physical example. 


Example (Damped and driven spring): Consider a spring with spring constant 
k. A mass m at the end of the spring is subject to a drag force proportional to its 
velocity, Ff = —bv. The mass is also subject to a driving force, Fy(t) = Fy cos wgt 
(see Fig. 4.7). What is its position as a function of time? 


Solution: The force on the mass is F (x, x, t) = —bx—Ax+ F4 cos wgt. So F = ma 
gives 
¥4+2yk+ ox = F cos wat 
= i (cia + etal), (4.29) 
where 2y = b/m, wo = k/m, and F = F4/m. Note that there are two different 
frequencies here, w and wg, which need not have anything to do with each other. 


Equation (4.22), along with the principle of superposition, tells us that our particular 
solution is 


xp(t) = ( F/2 ja + ( F/2 jam (4.30) 


-o7 + 2iywd + o? -o7 — 2iywg + w? 


The complete solution is the sum of this particular solution and the homogeneous 
solution from Eq. (4.15). 

Let’s now eliminate the 7’s in Eq. (4.30) (which we had better be able to do, 
because x must be real), and write x in terms of sines and cosines. Getting the i’s 
out of the denominators (by multiplying both the numerator and the denominator by 
the complex conjugate of the denominator), and using et? = cos + i sin 8, we find, 


Fig. 4.7 
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after a little work, 


2 2 
ee) 2F yo 
cos wqt + 


Xp(t) = 7 7 
(o — wh) + 4204 (o — w3) + 4yo 


sin wat. 
(4.31) 


Remarks: If you want, you can solve Eq. (4.29) by taking the real part of the solution to 
Eq. (4.20) (with Co —> F), that is, the x(t) in Eq. (4.22). This is true because if we take 
the real part of Eq. (4.20), we obtain 


2 


d 
J2 (Re(x)) + 2y 


© (Ret) } a(Re(x)) = Re(Coe) 


= Cp cos(wof). (4.32) 


In other words, if x satisfies Eq. (4.20) with a Coe’ on the right-hand side, then Re(x) 
satisfies it with a Co cos(wof) on the right. At any rate, it’s clear that the real part of the 
solution in Eq. (4.22) (with Co — F) does indeed give the result in Eq. (4.31), because 
in Eq. (4.30) we simply took half of a quantity plus its complex conjugate, which is the 
real part. 

Ifyou don’t like using complex numbers, another way of solving Eq. (4.29) is to keep it in 
the form with the cos wat on the right, and guess a solution of the form A cos wgf+B sin wat, 
and then solve for A and B (this is the task of Problem 4.8). The result is Eq. (4.31). & 


We can now write Eq. (4.31) in a very simple form. If we define 


R= (2 = #2)” + yoa), (4.33) 


then we can rewrite Eq. (4.31) as 


2 2 
F [ao w 2ya, 
Xp(t) = — (2 COS Mgt + n d snow) 


F 
= R cos(wgt — ¢), (4.34) 
where @ (the phase) is defined by 


wo” — w 2 2 
“o ad sin = <V Od => tan — Yd 


cos = A ee 
eR R Pa 


(4.35) 


The triangle describing the angle @ is shown in Fig. 4.8. Note that 0 < @ < x, 
because the sin @ in Eq. (4.35) is greater than or equal to zero. See the end of this 
section for more discussion of @. 

Recalling the homogeneous solution in Eq. (4.15), we can write the complete 
solution to Eq. (4.29) as 


x(t) = a cos(wgt — p) + e7”! (4e% + Be) (4.36) 


4.4 Driven (and damped) harmonic motion 


The constants A and B are determined by the initial conditions. If there is any damping 
at all in the system (that is, if y > 0), then the homogeneous part of the solution 
goes to zero for large t, and we are left with only the particular solution. In other 
words, the system approaches a definite x(¢), namely xp (t), independent of the initial 
conditions. 


Resonance 
The amplitude of the motion given in Eq. (4.34) is proportional to 


1 1 
= (4.37) 


J (0? - 02) + Croa)? 


Given wg and y, this is maximum when w = wq. Given w and y, it is maximum 
when wg = yœ? — 2y2, as you can show in Exercise 4.29. But for weak damping 
(that is, y < œ, which is usually the case we are concerned with), this reduces 
to wq © w also. The term resonance is used to describe the situation where the 
amplitude of the oscillations is as large as possible. It is quite reasonable that 
this is achieved when the driving frequency equals the frequency of the spring. 
But what is the value of the phase ¢ at resonance? Using Eq. (4.35), we see that 
@ satisfies tan ġo ~ +00 when wq © w. Therefore, o = 2/2 (it is indeed 2/2, 
and not —z/2, because the sin ¢ in Eq. (4.35) is positive), and the motion of the 
particle lags the driving force by a quarter of a cycle at resonance. For example, 
when the particle moves rightward past the origin (which means it has a quarter 
of a cycle to go before it hits the maximum value of x), the force is already at its 
maximum. And when the particle makes it out to the maximum value of x, the 
force is already back to zero. 

The fact that the force is maximum when the particle is moving fastest makes 
sense from an energy point of view.” If you want the amplitude to become large, 
then you need to give the system as much energy as you can. That is, you must 
do as much work as possible on the system. And in order to do as much work 
as possible, you should have your force act over as large a distance as possible, 
which means that you should apply your force when the particle is moving fastest, 
that is, as it speeds past the origin. And similarly, you don’t want to waste your 
force when the particle is barely moving near the endpoints of its motion. In short, 
v is the derivative of x and therefore a quarter cycle ahead of x (which is a general 
property of a sinusoidal function, as you can show). Since we want the force to 
be in phase with v at resonance (by the above energy argument), we see that the 
force is also a quarter cycle ahead of x. 

Resonance has a large number of extremely important applications (both 
wanted and unwanted) in the real world. On the desirable side, resonance makes 


2 Energy is one of the topics of the next chapter, so you may want to come back and read this 
paragraph after reading that. 
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it possible to have a relaxing day at the beach at the Bay of Fundy, talking to 
a friend over your cell phone while pushing a child on a swing at low tide. On 
the undesirable side, your ride home on that newly discovered “washboard” dirt 
road will be annoyingly bumpy at a certain speed, and any attempt to take your 
mind off the discomfort by turning up the radio will result only in certain parts 
of your car rattling in perfect sync (well, actually 90° out of phase) with the bass 
line of your formerly favorite song.° 


The phase o 
Equation (4.35) gives the phase of the motion as 
2 
pigs. (4.38) 
a? — w$ 


where 0 < ¢ < z. Let’s look at a few cases for wa (not necessarily at resonance) 
and see what the resulting phase @ is. Using Eq. (4.38), we have: 


e Ifwg ~ 0 (or more precisely, if ywq K or = w4), then ø ~ 0. This means that the 
motion is in phase with the force. The mathematical reason for this is that if wg ~ 0, 
then both x and x are small, because they are proportional to oj and wg, respectively. 
Therefore, the first two terms in Eq. (4.29) are negligible, so we end up with x « cos wat. 
In other words, the phase is zero. 

The physical reason is that since there is essentially no acceleration, the net force is 
always essentially zero. This means that the driving force always essentially balances 
the spring force (that is, the two forces are 180° out of phase), because the damping 
force is negligible (since x x wg % 0). But the spring force is 180° out of phase with 
the motion (because of the minus sign in F = —kx). Therefore, the driving force is in 
phase with the motion. 

e Ifwg © a, then ¢ ~ 1/2. This is the case of resonance, discussed above. 

e Ifwg © œ (or more precisely, if yay K A — w*), then @ ~ x. The mathematical 
reason for this is that if wg * oo, then the ¥ term in Eq. (4.29) dominates, so we have 
X x cos wgt. Therefore, x is in phase with the force. But x is 180° out of phase with 
X (this is a general property of a sinusoidal function), so x is 180° out of phase with 
the force. 

The physical reason is that if wg œ% oo, then the mass hardly moves, because from 
Eq. (4.37) we see that the amplitude is proportional to 1 / 3. This amplitude then implies 
that the velocity is proportional to 1/wg. Therefore, both x and v are always small. But 
if x and v are always small, then the spring and damping forces can be ignored. So 
we basically have a mass that feels only one force, the driving force. But we already 
understand very well a situation where a mass is subject to only one oscillating force: 


a mass on a spring. The mass in our setup can’t tell if it’s being driven by an oscillating 


Some other examples of resonance that are often cited are in fact not actually examples of resonance, 
but rather of “negative damping” (also known as positive feedback). Musical instruments fall into 
this category, as does the well-known Tacoma Narrows bridge failure. For a detailed discussion of 
this issue, see Billah and Scanlan (1991) and also Green and Unruh (2006). 


4.5 Coupled oscillators 


driving force, or being pushed and pulled by an oscillating spring force. They both feel 
the same. Therefore, both phases must be the same. But in the spring case, the minus 
sign in F = —Ax tells us that the force is 180° out of phase with the motion. Hence, the 


same result holds in the wg © 00 case. 


Another special case for the phase occurs when y = 0 (no damping), for 
which we have tang = +0, depending on the sign of œ? — w4. So ¢ is either 
0 or x. The motion is therefore either exactly in phase or out of phase with the 
driving force, depending on which of w or wg is larger. 


4.5 Coupled oscillators 


In the previous sections, we dealt with only one function of time, x(t). What if 
we have two functions of time, say x(t) and y(t), that are related by a pair of 
“coupled” differential equations? For example, we might have 


2% + œ (5x — 3y) = 0, 
(4.39) 
2} + w* (5y — 3x) = 0. 
For now, let’s not worry about how these equations might arise. Let’s just try 
to solve them (we’ll do a physical example later in this section). We’ll assume 
w > 0 here, although this isn’t necessary. We’ll also assume there aren’t any 
damping or driving forces, although a few of the problems and exercises for this 
chapter deal with these additions. We call the above equations “coupled” because 
there are x’s and y’s in both of them, and it isn’t immediately obvious how to 
separate them to solve for x and y. There are two methods (at least) for solving 
these equations. 


First method: Sometimes it is easy, as in this case, to find certain linear combi- 
nations of the given equations for which nice things happen. If we take the sum, 
we find 


E+) +o +y) =0. (4.40) 


This equation involves x and y only in the combination of their sum, x + y. With 
z = x + y, Eq. (4.40) is just our old friend, 7 + w*z = 0. The solution is 


x +y =A, cos(wt+ ¢1), (4.41) 


where A, and ġı are determined by the initial conditions. We may also take the 
difference of Eqs. (4.39), which results in 


& —) + 4w*(x — y) = 0. (4.42) 


This equation involves x and y only in the combination of their difference, x — y. 
The solution is 


x — y = Á cos(2wt + h2). (4.43) 
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Taking the sum and difference of Eqs. (4.41) and (4.43), we find that x(t) and 
y(t) are given by 

x(t) = Bı cos(wt + $1) + B2 cos(2wt + ¢2), 

y(t) = Bı cos(wt + $1) — B2 cos(2wt + $2), 


(4.44) 


where the B;’s are half of the A;’s. The strategy of this solution was simply 
to fiddle around and try to form differential equations that involved only one 
combination of the variables. This allowed us to write down the familiar solution 
for these combinations, as we did in Eqs. (4.41) and (4.43). 

We’ve managed to solve our equations for x and y. However, it turns out that 
the more interesting thing we’ve done is produce the equations (4.41) and (4.43). 
The combinations (x + y) and (x — y) are called the normal coordinates of the 
system. These are the combinations that oscillate with one pure frequency. The 
motion of x and y will in general look complicated, and it may be difficult to tell 
that the motion is really made up of just the two frequencies in Eq. (4.44). But 
if you plot the values of (x + y) and (x — y) as time goes by, for any motion of 
the system, then you will find nice sinusoidal graphs, even if x and y are each 
behaving in a rather unpleasant manner. 


Second method: Inthe above method, it was fairly easy to guess which combi- 
nations of Eqs. (4.39) would produce equations involving only one combination 
of x and y. But surely there are problems in physics where the guessing isn’t so 
easy. What do we do then? Fortunately, there is a fail-safe method for solving for 
x and y. It proceeds as follows. 

In the spirit of Section 4.1, let’s try a solution of the form x = Ae’ and 
y = Bet% , which we will write, for convenience, as 


C) = (3) ae. (4.45) 


It isn’t obvious that there should exist solutions for x and y that have the same t 
dependence, but let’s try it and see what happens. We’ve explicitly put the 7 in 
the exponent, but there’s no loss of generality here. If œ happens to be imaginary, 
then the exponent is real. It’s personal preference whether or not you put the i in. 
Plugging our guess into Eqs. (4.39), and dividing through by e, we find 
2A(—a) + SAw* — 3Ba* = 0, 
(4.46) 
2B(—a) + SBw* — 34%? = 0, 


or equivalently, in matrix form, 


—2a7 + 5w? 30" A 0 
( 30" 2a? + su) (3) D (o) a 
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This homogeneous equation for A and B has a nontrivial solution (that is, one 
where A and B aren’t both 0) only if the matrix is not invertible. This is true 
because if it were invertible, then we could multiply through by the inverse to 
obtain (4, B) = (0,0). When is a matrix invertible? There is a straightforward 
(although tedious) method for finding the inverse. It involves taking cofactors, 
taking a transpose, and dividing by the determinant. The step that concerns 
us here is the division by the determinant, since this implies that the inverse 
exists if and only if the determinant is not zero. So we see that Eq. (4.47) has 
a nontrivial solution only if the determinant equals zero. Because we seek a 
nontrivial solution, we must therefore have 


0= —2a* + 5%? -3w 
a —30* —2a? + 5a? 
= 4af — 200707 + 160%. (4.48) 
This is a quadratic equation in a”, and the roots are a = +w anda = +2. 


We have therefore found four types of solutions. If a = +w, then we can plug 
this back into Eq. (4.47) to obtain A = B. (Both equations give this same result. 
This was essentially the point of setting the determinant equal to zero.) And if 
a = +2a, then Eq. (4.47) gives A = —B. (Again, the equations are redundant.) 
Note that we cannot solve specifically for A and B, but only for their ratio. Adding 
up our four solutions according to the principle of superposition, we see that x 
and y take the general form (written in vector form for the sake of simplicity and 


bookkeeping), 
x = Aj 1 e + Ao 1 e let 
1 1 
+ 43 (2) git Ay E :) ener (4.49) 


The four A; are determined by the initial conditions. We can rewrite Eq. (4.49) 
in a somewhat cleaner form. If the coordinates x and y describe the positions of 
particles, they must be real. Therefore, 4; and Az must be complex conjugates, 
and likewise for 43 and Aq. If we then define some @’s and B’s via A} = 4, = 
(By /2)e!9! and 4} = 43 = (B2/ 2)e'*2, we may rewrite our solution in the form, 
as you can verify, 


C) = Bi (i) cos(wt + ġ1) + Bo (1) cos(2wt + ¢2), (4.50) 


where the B; and ¢; are real (and are determined by the initial conditions). We 
have therefore reproduced the result in Eq. (4.44). 

It is clear from Eq. (4.50) that the combinations x + y and x — y (the normal 
coordinates) oscillate with the pure frequencies, w and 2m, respectively, because 
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the combination x + y makes the B2 terms disappear, and the combination x — y 
makes the Bı terms disappear. 

It is also clear that if By = 0, then x = y at all times, and they both oscillate 
with frequency w. And if Bı = 0, then x = —y at all times, and they both 
oscillate with frequency 2w. These two pure-frequency motions are called the 
normal modes. They are labeled by the vectors (1, 1) and (1, —1), respectively. 
In describing a normal mode, both the vector and the frequency should be stated. 
The significance of normal modes will become clear in the following example. 


Example (Two masses, three springs): Consider two masses m, connected to 
each other and to two walls by three springs, as shown in Fig. 4.9. The three springs 
have the same spring constant k. Find the most general solution for the positions 
of the masses as functions of time. What are the normal coordinates? What are the 
normal modes? 


Solution: Let x; (¢) and x2(f) be the positions of the left and right masses, respec- 
tively, relative to their equilibrium positions. Then the middle spring is stretched 
a distance x2 — xı compared with the stretch at equilibrium. Therefore, the net 
force on the left mass is —kx, + k(x2 — x1), and the net force on the right mass 
is —kxy — k(x2 — x1). It’s easy to make a mistake in the sign of the second term 
in these expressions, but you can check it by, say, looking at the force when x2 is 
very big. At any rate, the second term must have the opposite sign in the two expres- 
sions, by Newton’s third law. With these forces, F = ma on each mass gives, with 
o =k /m, 


2 


X1 +2%°x] — wx = 0, 


(4.51) 


2 


X2 + 20x — wx, = 0. 


These are rather friendly looking coupled equations, and we can see that the sum and 
difference are the useful combinations to take. The sum gives 


(iy + dp) + ©? (x1 +. x2) = 0, (4.52) 
and the difference gives 

(%1 — ¥2) + 3m? (xy — x2) = 0. (4.53) 
The solutions to these equations are the normal coordinates, 


xı +x = Á+ cos(wt + +), 


(4.54) 
x] —x2 = AL cos(V3at + p_). 
Taking the sum and difference of these normal coordinates, we have 
x1 (©) = B4 cos(wt + 64) + B- cos(V3at + o_), 
(4.55) 


x7(t) = By cos(w@t + b+) — BL cos(V3at + o_), 
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where the B’s are half of the A’s. Along with the #’s, they are determined by the initial 
conditions. 


REMARK: For practice, let’s also derive Eq. (4.55) by using the determinant method. 
Letting x; = Ae’ and x2 = Be'™ in Eq. (4.51), we see that for there to be a nontrivial 
solution for A and B, we must have 


0= —a? +20? —w 
>= -w —a? +20? 
= at — 4a + 307. (4.56) 
This is a quadratic equation in &?, and the roots are a = +w anda = +V3w. Ifa = +o, 


then Eq. (4.51) gives A = B. Ifa = +./3a, then Eq. (4.51) gives A = —B. The solutions 
for xı and x2 therefore take the general form 


v1) 1 iwt 1 —iwt 
(a) =a (i) ra (iJe 
+ A3 (2) ev diet +44 (4) e7 V3iot 


= B} (i) cos(wt + p+) + B- (2) cos(V3at + b_), (4.57) 


where the last line follows from the same substitutions that led to Eq. (4.50). This expression 
is equivalent to Eq. (4.55). & 


The normal modes are obtained by setting either B_ or B+ equal to zero in Eq. (4.55) 

or Eq. (4.57). Therefore, the normal modes are (1, 1) and (1, — 1). How do we visualize 

these? The mode (1, 1) oscillates with frequency œ. In this case (where B— = 0), we 

have xı (t) = x2(t) = B+ cos(wt + $+) at all times. So the masses simply oscillate 

back and forth in the same manner, as shown in Fig. 4.10. It is clear that such motion 

has frequency w, because as far as the masses are concerned, the middle spring is 

effectively not there, so each mass moves under the influence of only one spring, and 

therefore has frequency w. Fig. 4.10 
The mode (1, —1) oscillates with frequency /3w. In this case (where By = 0), 

we have x1 (t) = —x2 (t) = B_ cos(/3t + o_) at all times. So the masses oscillate 

back and forth with opposite displacements, as shown in Fig. 4.11. It is clear that this 

mode should have a frequency larger than that for the other mode, because the middle 

spring is stretched (or compressed), so the masses feel a larger force. But it takes a 

little thought to show that the frequency is /3w. * 


The normal mode (1, 1) above is associated with the normal coordinate x1 +x. 
They both involve the frequency w. However, this association is not due to the 
fact that the coefficients of both x; and x2 in this normal coordinate are equal to 1. Fig. 4.11 
4 Tf you want to obtain this /3w result without going through all of the above work, just note that 

the center of the middle spring doesn’t move. Therefore, it acts like two “half springs,” each with 
spring constant 2k (as you can verify). Hence, each mass is effectively attached to a “k” spring and 
a “2k” spring, yielding a total effective spring constant of 3k. Thus the /3. 
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Rather, it is due to the fact that the other normal mode, namely (x1, x2) « (1, —1), 
gives no contribution to the sum x; + x2. There are a few too many 1’s floating 
around in the above example, so it’s hard to see which results are meaningful and 
which results are coincidence. But the following example should clear things up. 
Let’s say we solved a problem using the determinant method, and we found the 
solution to be 


C) = B; G) cos(wıt + $1) + B2 a cos(@2t + %2). (4.58) 


Then 5x + y is the normal coordinate associated with the normal mode (3,2), 
which has frequency w1. (This is true because there is no cos(w2t + ¢2) depen- 
dence in the combination 5x +y.) And similarly, 2x — 3y is the normal coordinate 
associated with the normal mode (1, —5), which has frequency w2 (because there 
is no cos(@;t + 1) dependence in the combination 2x — 3y). 

Note the difference between the types of differential equations we solved in 
the previous chapter in Section 3.3, and the types we solved throughout this 
chapter. The former dealt with forces that did not have to be linear in x or x, but 
that had to depend on only x, or only x, or only ¢. The latter dealt with forces 
that could depend on all three of these quantities, but that had to be linear in 
x and x. 


4.6 Problems 


Section 4.1: Linear differential equations 


4.1. Superposition 


Let xı (£) and x(t) be solutions to ¥? = bx. Show that xı (t) + x2(t) is 
not a solution to this equation. 


4.2. A limiting case « 


Consider the equation x = ax. If a = 0, then the solution to x = 0 is 
simply x(t) = C + Dt. Show that in the limit a > 0, Eq. (4.2) reduces 
to this form. Note: a — 0 is a sloppy way of saying what we mean. 
What is the proper way to write this limit? 


Section 4.2: Simple harmonic motion 


4.3. Increasing the mass x« 


A mass m oscillates on a spring with spring constant k. The ampli- 
tude is d. At the moment (let this be £ = 0) when the mass is at 
position x = d/2 (and moving to the right), it collides and sticks to 
another mass m. The speed of the resulting mass 2m right after the col- 
lision is half the speed of the moving mass m right before the collision 


4.4. 


4.5. 


4.6 Problems 


(from momentum conservation, discussed in Chapter 5). What is the 
resulting x(t)? What is the amplitude of the new oscillation? 


Average tension +x 
Is the average (over time) tension in the string of a pendulum larger 
or smaller than mg? By how much? As usual, assume that the angular 
amplitude A is small. 


Walking east on a turntable xx 


A person walks at constant speed v eastward with respect to a turntable 
that rotates counterclockwise at constant frequency w. Find the general 
expression for the person’s coordinates with respect to the ground (with 
the x direction taken to be eastward). 


Section 4.3: Damped harmonic motion 


4.6. 


Maximum speed xx 


A mass on the end of a spring (with natural frequency w) is released 
from rest at position x9. The experiment is repeated, but now with the 
system immersed in a fluid that causes the motion to be overdamped 
(with damping coefficient y). Find the ratio of the maximum speed in 
the former case to that in the latter. What is the ratio in the limit of strong 
damping (y > œw)? In the limit of critical damping? 


Section 4.4: Driven (and damped) harmonic motion 


4.7. 


4.8. 


Exponential force x 

A particle of mass m is subject to a force F(t) = maọe™. The initial 
position and speed are both zero. Find x(t). (This problem was already 
given as Problem 3.9, but solve it here by guessing an exponential 
function, in the spirit of Section 4.4.) 


Driven oscillator « 


Derive Eq. (4.31) by guessing a solution of the form x(t) = A cos wat + 
B sin wat in Eq. (4.29). 


Section 4.5: Coupled oscillators 


4.9. 


4.10. 


Unequal masses +x 

Three identical springs and two masses, m and 2m, lie between two walls 
as shown in Fig. 4.12. Find the normal modes. 

Weakly coupled «x 


Three springs and two equal masses lie between two walls, as shown 
in Fig. 4.13. The spring constant, k, of the two outside springs is much 


m 


m 


m 
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4.11. 


4.12. 


larger than the spring constant, x, of the middle spring. Let x; and x2 be 
the positions of the left and right masses, respectively, relative to their 
equilibrium positions. If the initial positions are given by xı (0) = a and 
x2(0) = 0, and if both masses are released from rest, show that x; and 
x2 can be written as (assuming x < k) 


x1 (£) © acos ((w + €)t) cos(et), 
(4.59) 
x2(t) © asin ((w + €)t) sin(et), 


where œw = J/k/m and € = (k/2k)w. Explain qualitatively what the 
motion looks like. 


Driven mass on a circle xx 


Two identical masses m are constrained to move on a horizontal hoop. 
Two identical springs with spring constant k connect the masses and wrap 
around the hoop (see Fig. 4.14). One mass is subject to a driving force 
F4 cos wgt. Find the particular solution for the motion of the masses. 


Springs ona circle wxxw 


(a) Two identical masses m are constrained to move on a horizontal 
hoop. Two identical springs with spring constant k connect the 
masses and wrap around the hoop (see Fig. 4.15). Find the normal 
modes. 

(b) Three identical masses are constrained to move on a hoop. Three 
identical springs connect the masses and wrap around the hoop 
(see Fig. 4.16). Find the normal modes. 

(c) Now do the general case with N identical masses and N identical 
springs. 


4.7 Exercises 


Fig. 4.16 Section 4.1: Linear differential equations 


4.13. 


4.14. 


kx force « 


A particle of mass m is subject to a force F(x) = kx, with k > 0. What 
is the most general form of x(t)? If the particle starts out at x9, what is 
the one special value of the initial velocity for which the particle doesn’t 
eventually get far away from the origin? 


Rope on a pulley +x 


A rope with length Z and mass density o kg/m hangs over a massless 
pulley. Initially, the ends of the rope are a distance x9 above and below 
their average position. The rope is given an initial speed. If you want 


4.7 Exercises 


the rope to not eventually fall off the pulley, what should this initial 
speed be? (Don’t worry about the issue discussed in Calkin (1989).) 


Section 4.2: Simple harmonic motion 


4.15. Amplitude x 


Find the amplitude of the motion given by x(t) = C cos wt + D sin wt. 


4.16. Angled rails x 


4.17. 


4.18. 


4.19. 


4.20. 


Two particles of mass m are constrained to move along two horizontal 
frictionless rails that make an angle 26 with respect to each other. They 
are connected by a spring with spring constant k, whose relaxed length 
is at the position shown in Fig. 4.17. What is the frequency of oscilla- 
tions for the motion where the spring remains parallel to the position 
shown? 


Effective spring constant x 


(a) Two springs with spring constants kı and kz are connected in par- 
allel, as shown in Fig. 4.18. What is the effective spring constant, 
kefe? In other words, if the mass is displaced by x, find the keff 
for which the force equals F = —keff x. 

(b) Two springs with spring constants ky and kz are connected in 
series, as shown in Fig. 4.19. What is the effective spring 
constant, kere? 


Changing k +x 

Two springs each have spring constant k and equilibrium length £. They 
are both stretched a distance £ and attached to a mass m and two walls, 
as shown in Fig. 4.20. At a given instant, the right spring constant is 


somehow magically changed to 3% (the relaxed length remains £). What 
is the resulting x(t)? Take the initial position to be x = 0. 

Removing a spring «« 

The springs in Fig. 4.21 are at their equilibrium length. The mass oscil- 
lates along the line of the springs with amplitude d. At the moment (let 
this be t = 0) when the mass is at position x = d/2 (and moving to the 
right), the right spring is removed. What is the resulting x(t)? What is 
the amplitude of the new oscillation? 


Springs all over +» 


(a) A mass m is attached to two springs that have relaxed lengths of 
zero. The other ends of the springs are fixed at two points (see 
Fig. 4.22). The two spring constants are equal. The mass sits at 
its equilibrium position and is then given a kick in an arbitrary 


123 


Fig. 4.18 
| ky ky E 
Fig. 4.19 

21 m 21 

k k—3k 
Fig. 4.20 

k k 

m 

Fig. 4.21 

k m k 
WAQQLQHAVAON Cy 
Fig. 4.22 


124 Oscillations 


direction. Describe the resulting motion. (Ignore gravity, although 
you actually don’t need to.) 

(b) A mass m is attached to n springs that have relaxed lengths of 
zero. The other ends of the springs are fixed at various points in 
space (see Fig. 4.23). The spring constants are k1, k2, ..., kn. The 
mass sits at its equilibrium position and is then given a kick in an 
arbitrary direction. Describe the resulting motion. (Again, ignore 

Fig. 4.23 gravity, although you actually don’t need to.) 


4.21. Rising up ««« 

In Fig. 4.24, a mass hangs from a ceiling. A piece of paper is held 
Oo up to obscure three strings and two springs; all you see is two other 
sabasa strings protruding from behind the paper, as shown. How should the 
three strings and two springs be attached to each other and to the two 
? ' visible strings (different items can be attached only at their endpoints) 
i so that if you start with the system at its equilibrium position and then 

cut a certain one of the hidden strings, the mass will rise up?° 


4.22. Projectile on a spring xx» 


A projectile of mass m is fired from the origin at speed vo and angle 0. It 
Fig. 4.24 is attached to the origin by a spring with spring constant k and relaxed 
length zero. 

(a) Find x(t) and y(t). 

(b) Show that for small œ = Vk/m, the trajectory reduces to nor- 
mal projectile motion. And show that for large w, the trajectory 
reduces to simple harmonic motion, that is, oscillatory motion 
along a line (at least before the projectile smashes back into the 
ground). What are the more meaningful statements that should 
replace “small w” and “large w”? 

What value should w take so that the projectile hits the ground 
when it is moving straight downward? 


(c 


wm 


4.23. Corrections to the pendulum «xx 


(a) For small oscillations, the period of a pendulum is approximately 
T ~ 2x./€/g, independent of the amplitude, 9. For finite 
oscillations, use dt = dx/v to show that the exact expression 
for T is 


8e % do 
T= ; (4.60) 
g Jo /cos@ — cos 


5 Thanks to Paul Horowitz for this extremely cool problem. For more applications of the idea behind 
it, see Cohen and Horowitz (1991). 


4.7 Exercises 


(b) Find an approximation to this 7, up to second order in o, in the 
following way. Make use of the identity cos @ = 1 — 2 sin? (b /2) 
to write T in terms of sines (because it’s more convenient to work 
with quantities that go to zero as 0 — 0). Then make the change of 
variables, sinx = sin(0/2)/sin(09/2) (you’ll see why). Finally, 
expand your integrand in powers of 69, and perform the integrals 


to show that® 
£ 0% 
Trine (1+ i¢+-~ J. (4.61) 


Section 4.3: Damped harmonic motion 


4.24. 


4.25. 


4.26. 


4.27. 


4.28. 


Crossing the origin 


Show that an overdamped or critically damped oscillator can cross the 
origin at most once. 


Strong damping 

In the strong damping (y >> w) case discussed in the remark in the 
overdamping subsection, we saw that x(t) « e~®”"/2” for large t. Using 
the definitions of w and y, this can be written as x(t) « e~/>, where b 


is the coefficient of the damping force. By looking at the forces on the 
mass, explain why this makes sense. 


Maximum speed x 


A critically damped oscillator with natural frequency œw starts out at 
position x9 > 0. What is the maximum initial speed (directed toward 
the origin) it can have and not cross the origin? 


Another maximum speed x+ 


An overdamped oscillator with natural frequency w and damping coeffi- 
cient y starts out at position x9 > 0. What is the maximum initial speed 
(directed toward the origin) it can have and not cross the origin? 


Ratio of maxima «« 


A mass on the end of a spring is released from rest at position x9. The 
experiment is repeated, but now with the system immersed in a fluid 
that causes the motion to be critically damped. Show that the maximum 
speed of the mass in the first case is e times the maximum speed in the 
second case.’ 


é If you like this sort of thing, you can show that the next term in the parentheses is (11/ 3072)0¢ x 


But be careful, this fourth-order correction comes from two terms. 


7 The fact that the maximum speeds differ by a fixed numerical factor follows from dimensional 


analysis, which tells us that the maximum speed in the first case must be proportional to wxg. And 
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Section 4.4: Driven (and damped) harmonic motion 


4.29. 


4.30. 


Resonance 


Given w and y, show that the R in Eq. (4.33) is minimum when wq = 
yœ? — 2y? (unless this is imaginary, in which case the minimum occurs 
at wq = 0). 


No damping force x 


A particle of mass m is subject to a spring force, —kx, and also a driving 
force, Fg cos wat. But there is no damping force. Find the particular 
solution for x(t) by guessing x(t) = A cos wat + B sin wat. If you write 
this in the form C cos(wgt — @), where C > 0, what are C and @? Be 
careful about the phase (there are two cases to consider). 


Section 4.5: Coupled oscillators 


m m 


4.31. 


4.32. 


Fig. 4.26 


Fig. 4.27 


4.35. 


4.33. 


4.34. 


Springs and one wall «« 


Two identical springs and two identical masses are attached to a wall as 
shown in Fig. 4.25. Find the normal modes, and show that the frequencies 
can be written as k/m (V5 + 1)/2. This numerical factor is the golden 
ratio (and its inverse). 


Springs between walls xx 


Four identical springs and three identical masses lie between two walls 
(see Fig. 4.26). Find the normal modes. 


Beads on angled rails +» 


Two horizontal frictionless rails make an angle 6 with each other, as 
shown in Fig. 4.27. Each rail has a bead of mass m on it, and the beads 
are connected by a spring with spring constant k and relaxed length zero. 
Assume that one of the rails is positioned a tiny distance above the other, 
so that the beads can pass freely through the crossing. Find the normal 
modes. 


Coupled and damped «« 


The system in the example in Section 4.5 is modified by immersing it in 
a fluid so that both masses feel a damping force, Ff = —bv. Solve for 
x(t) and x2(t). Assume underdamping. 


Coupled and driven +» 


The system in the example in Section 4.5 is modified by subjecting the 
left mass to a driving force Fa cos(2œwt), and the right mass to a driving 


since y = w in the critical-damping case, the damping doesn’t introduce a new parameter, so the 


maximum speed has no choice but to again be proportional to wxo. But showing that the maximum 


speeds differ by the nice factor of e requires a calculation. 


4.8 Solutions 


force 2F4 cos(2wt), where w = ,/k/m. Find the particular solution for 
xı (t) and x2(t), and explain why your answer makes sense. 


4.8 Solutions 


4.1. 


Superposition 
The sum x, + x2 is a solution to ¥? = bx if 


2 2 
(Sr) ee, 


dt? 
< Gi +342)? = ba +22) 
< 5? + Qt. +3 = b +22). (4.62) 


But xf = bx, and i = bx2, by assumption. So we are left with the 2¥1¥2 term on the 
left-hand side, which destroys the equality. (Note that 2¥1¥2 can’t be zero, because if 
either ¥ı or X2 is identically zero, then either x; or x2 is also, so we didn’t really have 
a solution to begin with.) 


4.2. A limiting case 


4.3. 


The expression “a —> 0” is sloppy because a has units of inverse time squared, and the 
number 0 has no units. The proper statement is that Eq. (4.2) reduces to x(t) = C+ Dt 
when yat < 1, or equivalently when t < 1/,/a, which is now a comparison of 
quantities with the same units. The smaller a is, the larger ¢ can be. Therefore, if 
“a —> 0,” then ¢ can basically be anything. Assuming vat < 1, we can write 
etV4t ~ 1 + Jat, and Eq. (4.2) becomes 


x(t) ¥ AL +/at)+ BU — Jat) 
= (A + B) + Ja(A — B)t 
=C+Dt. (4.63) 


C is the initial position, and D is the speed of the particle. If these quantities are of 
order 1 in the units chosen, then if we solve for A and B, we see that they must be 
roughly negatives of each other, and both of order 1/./a. So if the speed and initial 
position are of order 1, then A and B actually diverge in the “a — 0” limit. Ifa is small 
but nonzero, then ¢ will eventually become large enough so that Vat < 1 won’t hold, 
in which case the linear form in Eq. (4.63) won’t be valid. 


Increasing the mass 

The first thing we must do is find the velocity of the mass right before the collision. The 
motion before the collision looks like x(t) = dcos(wt + @), where w = ,/k/m. The 
collision happens at t = 0 (although it actually doesn’t matter what time we plug in 
here), so we have d/2 = x(0) = d cos ġ, which gives ø = 77/3. The velocity right 
before the collision is therefore 


v(0) = (0) = —wd sin @ = —wd sin(+7/3) = =(V3/2)od. (4.64) 


We want the plus sign, because we are told that the mass is moving to the right. Finding 
the motion after the collision is now reduced to an initial conditions problem. We have 
a mass 2m on a spring with spring constant k, with initial position d/2 and initial 
velocity (/3/4)ad (half of the result above). In situations where we know the initial 
position and velocity, it turns out that the best form to use for x(¢) from the expressions 
in Eq. (4.3) is 


x(t) = Ccosa’t+ Dsina’t, (4.65) 
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because the initial position at £ = 0 is simply C, and the initial velocity at t = 0 
is w'D. The initial conditions are therefore easy to apply. We have put a prime on 
the frequency in Eq. (4.65) to remind us that it is different from the initial frequency, 
because the mass is now 2m. So we have w = \/k/2m = w/V2. The initial conditions 
therefore give 


x(0) = d/2 C=d/2, 


v0) = (V3/4od => w'D=(V3/4od => D=(V6/4)d. (4.66) 


C N j k 
sinw t, where œ =,/—. (4.67) 
2m 


To find the amplitude, we must calculate the maximum value of x(t). This is the task 
of Exercise 4.15, and the result is that the amplitude of the x(t) = C cos w't+ D sin œt 
motion is A = /C? + D2. So we have 


d2 6d2 5 
A= =,/-d. (4.68) 
4 ' 16 8 


This is smaller than the original amplitude d, because energy is lost to heat during the 
collision (but energy is one of the topics of the next chapter). 


Our solution for x(t) is therefore 


d , 
x(t) = JOSO 


4.4. Average tension 


4.5. 


Let the length of the pendulum be £. We know that the angle 6 depends on time 
according to 


0(t) = Acos(wt), (4.69) 


where w = ./g/¢. If T is the tension in the string, then the radial F = ma equation is 
T — mg cos 0 = m€67. Using Eq. (4.69), this becomes 


T = mg cos (A cos(wt)) + me( — wA sin(t))”. (4.70) 


Since A is small, we can use the small-angle approximation cos a ~ 1 — «7/2, which 
gives 


1 
T ~ mg (: = 4 cos?) + mlo? A? sin? (wt) 


1 
=mg+ mgA? (sin?on T3 coston) ; (4.71) 


where we have made use of œ? = g/l. The average value of both sin?0 and cos?0 
over one period is 1/2 (you can show this by doing the integrals, or you can just note 
that the averages are equal and they add up to 1), so the average value of T is 


mgA? 
Tavg = mg + £ ; (4.72) 


which is larger than mg, by mgA?/4. It makes sense that Tavg > mg, because the 
average value of the vertical component of T equals mg (because the pendulum has 
no net rise or fall over a long period of time), and there is some nonzero contribution 
to the magnitude of T from the horizontal component. 


Walking east on a turntable 


The velocity of the person with respect to the ground is the sum of vx and u, where 
u is the velocity (at the person’s position) of the turntable with respect to the ground. 


4.6. 


4.8 Solutions 


In terms of the angle 0 in Fig. 4.28, the velocity components with respect to the ground 
are 


x=v-—usiné, and ý= ucosð. (4.73) 
But u = ra. So we have, using r sin 0 = y andr cos 0 = x, 
x=v—oy, and ý= ox. (4.74) 


Taking the derivative of the first equation, and then plugging in ý from the second, 
gives ¥ = —wx. Therefore, x(t) = Acos(wt + $). The first equation then quickly 
gives y(t), and the result is that the general expression for the person’s position is 


(x,y) = (Acos(wt +), Asin(wt + 6) + v/a). (4.75) 


This describes a circle centered at the point (0,v/w). The constants A and @ are 
determined by the initial x and y values. You can show that 


A= Jx2+(y—v/o)?, and tang= w= vie (4.76) 
x 


0 


Remarks: It turns out that in the frame of the turntable, the person’s path is also a 
circle. This can be seen in the following way. Imagine a distant object (say, a star) 
located in the eastward direction. In the frame of the turntable, this star rotates clock- 
wise with frequency w. And in the frame of the turntable, the person’s velocity always 
points toward the star. Therefore, the person’s velocity rotates clockwise with fre- 
quency w. And since the magnitude of the velocity is constant, this means that the 
person travels clockwise in a circle in the frame of the turntable. From the usual 
expression v = ra, we see that this circle has a radius v/o. 

This result leads to another way of showing that the person’s path is a circle as 
viewed in the ground frame. In short, when the person’s clockwise circular motion at 
speed v with respect to the turntable is combined with the counterclockwise motion of 
the turntable with respect to the ground (with the same frequency œw, but in the opposite 
direction), the resulting motion of the person with respect to the ground is a circle with 
its center at the point (0, v/a). 

The situation is summarized in Fig. 4.29. From the above result (that the path 
is a circle in the turntable frame), we may characterize the person’s motion in the 
turntable frame by imagining her riding on a merry-go-round that rotates clockwise 
with frequency w with respect to the turntable. This merry-go-round is shown at five 
different times in the figure. The effect of the merry-go-round’s clockwise rotational 
motion is to cancel the counterclockwise rotation of the turntable, so that the merry- 
go-round ends up not rotating at all with respect to the ground. Therefore, if the person 
(the dot in the figure) starts at the top of the merry-go-round (which she in fact does, 
because this corresponds to walking eastward with respect to the turntable), then she 
always remains at the top. She therefore travels in a circle that is simply shifted upward 
by the vertical radius of the merry-go-round (the dotted lines shown, which have length 
v/q@), relative to the center of the turntable. This agrees with the original result. You 
can also see from the figure how the values of A and ¢ in Eq. (4.76) arise. For example, 
A is the length of the solid lines shown. & 


Maximum speed 
For the undamped case, the general form of x is x(t) = Ccos(@t + @). The initial 
condition v(0) = 0 tells us that @ = 0, and then the initial condition x(0) = x9 tells 
us that C = xo. Therefore, x(t) = xo cos(wt), and so v(t) = —wxg sin(wt). This has a 
maximum magnitude of wxo. 

Now consider the overdamped case. Equation (4.17) gives the position as 


x(t) = Ae YF 4 Be“ FOE, (4.77) 


Fig. 4.28 
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4.7. 


The initial conditions are 


x(0) = xo A+B=x0, 


(4.78) 
v(0) =0 Y-A- (y + 2)B=0. 


Solving these equations for A and B, and then plugging the results into Eq. (4.77), 
gives 


x(t) = = (ty +2) 9 — Gy — eH) (4.79) 


Taking the derivative to find v(t), and using y? — Q? = w”, gives 


Das 
vA = —w'xo fe oom = e Wem?) ‘ (4.80) 
2Q 


Taking the derivative again, we find that the maximum speed occurs at 


1 +2 
fmax = 5 In (5) (4.81) 


Plugging this into Eq. (4.80), and taking advantage of the logs in the exponentials, 
gives 


y= 2 y/22 
ro CS =) (4.82) 
yY 


The desired ratio, R, of the maximum speeds in the two scenarios is therefore 


Q\ 7/20 
gaT , (4.83) 
y-2 


In the limit of strong damping (y > @), we have Q = yy? — œ? ~ y — w /2y. 


So the ratio becomes 
2y SF 2 
RY oles, = ad í (4.84) 
w? /2y w 


In the limit of critical damping (y © w, Q ~ 0), we have, with Q/y = €, 


I 1/2« 
ra (1) x (1 +26) x e, (4.85) 


in agreement with the result of Exercise 4.28 (the solution to which is much quicker 
than the one above, since you don’t need to deal with all the Q’s). You can also show 
that in these two limits, fmax equals In(2y/w)/y and 1/y ~ 1/a, respectively. 


Exponential force 


F = ma gives ¥ = age—". Let’s guess a particular solution of the form x(t) = Ce". 
Plugging this in gives C = ay/b. And since the solution to the homogeneous equation 
X = 0 is x(t) = At + B, the complete solution for x is 


bt 


ey es (4.86) 


x(t) = 5 


4.8. 


4.9. 


4.8 Solutions 


The initial condition x(0) = 0 gives B = —ao/ b*. And the initial condition v(0) = 0 
applied to u(t) = —ape™™ /b + A gives A = ag/b. Therefore, 


et it 1 
x(t) = ao b2 T b z’ (4.87) 


in agreement with Problem 3.9. 


Driven oscillator 
Plugging x(t) = A cos wat + B sin wat into Eq. (4.29) gives 


— oA COS Wgt — oB sin wqt 
— 2yagA sin wat + 2y waB cos wat 
+ œA cos @at + ow Bsin wat = F cos wagt. (4.88) 


If this is true for all t, the coefficients of cos wgt on both sides must be equal. And 
likewise for sin wat. Therefore, 


oA + 2ymqB wA = Fs 
(4.89) 
wB 2yaqA w B= 0. 
Solving this system of equations for A and B gives 
F (@ — oå 2F 
A= e i o ge ra , (490) 
(P R a a E o 


in agreement with Eq. (4.31). 


Unequal masses 


Let xı and x2 be the positions of the left and right masses, respectively, relative to 
their equilibrium positions. The forces on the two masses are —Ax, + k(x2 — xı) and 
—kx2 — k(x2 — x1), respectively, so the F = ma equations are 


ži + 2m*x1 — wx) = 0, 


(4.91) 


2 


252 + 20x2 — wx) = 0. 


The appropriate linear combinations of these equations aren’t obvious, so we'll use 
the determinant method. Letting xı = 41e% and x2 = Aze'“', we see that for there to 
be a nontrivial solution for A and B, we must have 


d= —a + 2w? —w 
~ -W —2a7 + 20? 
= 2af — 607w + 30°. (4.92) 
The roots of this quadratic equation in a? are 
3 3 3—43 
a = +w = =ta,, and a=+w = = +a. (4.93) 


Ifa? = at, then the normal mode is proportional to (/3 + 1,—1). And if a? = a, 
then the normal mode is proportional to (v3 — 1, 1). So the normal modes are 


a = c ') cos(ajt+¢)), and 


Co = ea ') cos(azt + $2). 


(4.94) 
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4.10. 


Note that these two vectors are not orthogonal (there is no need for them to be). The 
normal coordinates associated with these normal modes are x, — (v3 — 1)x2 and 
xı + (v3 + 1)x2, respectively, because these are the combinations that make the a2 
and a frequencies disappear, respectively. 


Weakly coupled 


The magnitude of the force in the middle spring is « (x2 — x), so the F = ma equations 
are 


mx) = —kx; + kK — x1), 
(4.95) 
mš? = —kx2 — K(x2 — x1). 
Adding and subtracting these equations gives 
mı +¥2) = —k x1 + x2) xı +x2 = Acos(wt + $), 
. (4.96) 
mı — X2) = —(kK + 2k) — x2) => x1 —x2 = Bcos(ðt + ), 
where 
k k+2. 
wen. dik ga = (4.97) 
m m 


The initial conditions are x; (0) = a, x; (0) = 0, x2(0) = 0, and x2(0) = 0. The easiest 
way to apply these is to plug them into the normal coordinates in Eq. (4.96), before 
solving for x; (t) and x2(t). The velocity conditions quickly give ọ = $ = 0, and then 
the position conditions give A = B = a. Solving for xı (t) and x2 (t) yields 


t) = . t)+ : wt 
x(t) = z cos(w: 3 cos(at), 


P F (4.98) 
x2(t) = z cos(wt) — 5 cos(@t). 
Writing w and @ as 
+w o-w +w o-w 

= = p= —— + —— 4.99 
w 7 z’ and © 7 + 7 ( ) 

and using the identity cos(a + f) = cosa cos £ — sina sin B, gives 

O+o O-o 
(0) = acos ( 1) cos (== ‘), 

(4.100) 


ee to, . OO | 
x2(t) = asin 5) sin 5 s 


If we now approximate @ as 


R42 F 2 
on Ta Ji- <~o(14+ 7) 20426 (4.101) 
m m k k 


where € = (k/2k)@ = (k/2m),/m/k, we can write x; and x2 as 


xı (t) © acos ((@ + €)t) cos(et), 
(4.102) 
x2(t) © asin (Œw + €)t) sin(€f), 


as desired. To get an idea of what this motion looks like, let’s examine x1. Since € < w, 
the cos(e€f) oscillation is much slower than the cos((w + €)f) oscillation. Therefore, 
cos(€t) is essentially constant on the time scale of the cos((w + €)f) oscillations. This 
means that for the time span of a few of these oscillations, xı essentially oscillates with 
frequency w + € © œw and amplitude a cos(et). This a cos(et) term is the “envelope” 
of the oscillation, as shown in Fig. 4.30, for €/w = 1/10. Initially, the amplitude of 
xı is a, but it decreases to zero when et = 2/2. By this time, the amplitude of the x2 


4.8 Solutions 


oscillation, which is a sin(€t), has grown to a. So at t = x /2e, the right mass has all 
the motion, and the left mass is at rest. This process keeps repeating. After each time 
period of x /2e€, the motion of one mass gets transferred to the other. The weaker the 
coupling (that is, the spring constant x) between the masses, the smaller the €, and so 
the longer the time period. 


Remarks: The above reasoning also holds for two pendulums connected by a weak 
spring. All the above steps carry through, with the only change being that k is replaced 
by mg/€, because the spring force, —kx, is replaced by the tangential gravitational 
force, —mg sin 0 ~ —mg(x/€). So after a time 


uo (= e ) zm 
t= = > ; (4.103) 
2€ 2\« Vm k Ve 
the pendulum that was initially oscillating is now momentarily at rest, and the other 
pendulum has all the motion. Since the time scale, 7,, of a single mass on the end of 
the weak spring is proportional to /m/«, and since the time scale, Tp, of a simple 
pendulum is proportional to ./2/g, we see that the above ż is proportional to T2/T, p- 
The existence of the “beats” in Fig. 4.30 can be traced to the fact that the expressions 
in Eq. (4.98) are linear combinations of sinusoidal functions with two very close 
frequencies. The physics here is the same as the physics that produces the beats you 
hear when listening to two musical notes of nearly the same pitch, as when tuning 
a guitar.’ The time between the zeros of, say, xı in Fig. 4.30 is 7 /e, so the angular 
frequency of the beats is 27/(a/e) =2€. & 


4.11. Driven mass on a circle 
Label two diametrically opposite points as the equilibrium positions. Let the positions 
of the masses relative to these points be x; and x2, measured counterclockwise. If the 
driving force acts on mass “1,” then the F = ma equations are 


mx, + 2k(x1 — x2) = Fa cos wat, 
(4.104) 


mš2 + 2k(x2 — x1) = 0. 


To solve these equations, we can treat the driving force as the real part of Fye!*! 
and try solutions of the form x) (t) = A;e!@«! and x2 (t) = A2e!«", and then solve for 
A, and 42. Or we can try some trig functions. If we take the latter route, we quickly 
find that the solutions can’t involve any sine terms (this is due to the fact that there 
are no first derivatives of the x’s in Eq. (4.104)). Therefore, the trig functions must 
look like xı (£) = Aj cos wat and x2(t) = A2 cos wat. Using either of the two methods, 
Eq. (4.104) becomes 


w341 + 2" (41 — 42) =F, 
(4.105) 


w442 + 20° (42 — A1) = 0, 


where w = „k/m and F = Fg/m. Solving for A; and A2, we find that the desired 
particular solution is 


—F (20° — w7) —2F w? 


a = — > i; t) = = 
OS Ga a) ON OT haa 


coswgt. (4.106) 


8 If the two frequencies involved aren’t too close to each other, then you can actually hear a faint 
note with a frequency equal to the difference of the original frequencies (and possibly some other 
notes too, involving various combinations of the frequencies). But this is a different phenomenon 
from the above beats; it is due to the nonlinear way in which the ear works. See Hall (1981) for 
more details. 
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The most general solution is the sum of this particular solution and the homogeneous 
solution found in Eq. (4.111) in the solution to Problem 4.12 below. 


REMARKS: 

1. If wq = 2%, the amplitudes of the motions go to infinity. This makes sense, 
considering that there is no damping, and that the natural frequency of the system 
(calculated in Problem 4.12) is 2. 

2. If wg = V2, then the mass that is being driven doesn’t move. The reason for 
this is that the driving force balances the force that the mass feels from the two 
springs due to the other mass’s motion. And indeed, you can show that /2 is 
the frequency that one mass moves at if the other mass is at rest (and thereby 
acts essentially like a brick wall). Note that ag = /2w is the cutoff between the 
masses moving in the same direction or in opposite directions. 

3. Ifwa — ov, then both motions go to zero. But x2 is fourth-order small, whereas 
xı is only second-order small. 

4. Ifwa — 0, then 4] © A2 © -F /2%, which is very large. The slowly changing 
driving force basically spins the masses around in one direction for a while, and 
then reverses and spins them around in the other direction. We essentially have the 
driving force acting on a mass 2m, and two integrations of Fy cos wat = (2m)x 
show that the amplitude of the motion is F / 203, as above. Equivalently, you can 
calculate the 4; — A2 difference in the wy — 0 limit to show that the springs 
stretch just the right amount to cause there to be a net force of (Fa/2) cos wat on 
each mass. This leads to the same F [203 amplitude. & 


4.12. Springs on a circle 


(a) Label two diametrically opposite points as the equilibrium positions. Let 
the positions of the masses relative to these points be xı and x2, measured 
counterclockwise. Then the F = ma equations are 


mx + 2k(x1 — x2) = 0, 
(4.107) 


mx = 2k (x2 = x1) = 0. 


The determinant method works here, but let’s just do it the easy way. Adding 
the equations gives 


x1 +52 = 0, (4.108) 
and subtracting them gives 


Gi — 2) + 4@* (x) — x2) = 0. (4.109) 


The normal coordinates are therefore 
xı +x2 = Át + B, 
(4.110) 
xı — X2 = Ccos(2a@t+ ¢). 


Solving these two equations for xı and x2, and writing the results in vector form, 


gives 
) 7 a ar+)+c(") cos(2et + $), (4.111) 


x2 


where the constants A, B, and C are defined to be half of what they were in 
Eq. (4.110). The normal modes are therefore 


X1 1 
a = (i) (At + B), and 
(z) =C G cos(2wt + ġ). 


(4.112) 
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The first mode has frequency zero. It corresponds to the masses sliding around 
the circle, equally spaced, at constant speed. The second mode has both masses 
moving to the left, then both to the right, back and forth. Each mass feels a force 
of 4kx (because there are two springs, and each one stretches by 2x), hence the 
V4 = 2 in the frequency. 

(b) Label three equally spaced points as the equilibrium positions. Let the posi- 
tions of the masses relative to these points be x1, x2, and x3, measured 
counterclockwise. Then the F = ma equations are, as you can show, 


mx, + k(x — x2) + kx — x3) = 0, 
mxX2 + k(x2 — x3) + k(x2 — x1) = 0, (4.113) 


mx3 + k(x3 — x1) + k(x3 — x2) = 0. 


The sum of all three of these equations definitely gives something nice. Also, 
differences between any two of the equations give something useful. But let’s 
use the determinant method to get some practice. Trying solutions of the form 
xı = Aye, xy = Ane! and x3 = A3e"’, we obtain the matrix equation, 


—a? + 2w? —w —w Ay 0 
—w —a? + 20? —w Ay} =]0]. (4.114) 
—w —w —a? + 2w* A3 0 


Setting the determinant equal to zero yields a cubic equation in a. But it’s a 
nice cubic equation, with œ? = 0 as a solution. The other solution is the double 
root a? = 3”. 

The a = 0 root corresponds to A; = Az = 43. That is, it corresponds to the 
vector (1, 1, 1). This œ = 0 case is the one case where our exponential solution 
isn’t really an exponential. But œ? equalling zero in Eq. (4.114) basically tells 
us that we’re dealing with a function whose second derivative is zero, that is, a 
linear function At + B. Therefore, the normal mode is 


X1 1 
x2 | = | 1 | (4At +B). (4.115) 
X3 1 


This mode has frequency zero. It corresponds to the masses sliding around the 
circle, equally spaced, at constant speed. 

The two g? = 3%? roots correspond to a two-dimensional subspace of normal 
modes. You can show that any vector of the form (a,b,c) with a +b + c = 0 
is a normal mode with frequency /3w. We will arbitrarily pick the vectors 
(0, 1,—1) and (1,0, —1) as basis vectors for this space. We can then write the 
normal modes as linear combinations of the vectors 


Xx] 0 
x)= Ci 1 cos(V3at + $1), 
X3 —1 
(4.116) 
x] 1 
x2} =Co] 0 cos(V/3at + 2). 
X3 —1 


Remarks: The a? = 3%? case is very similar to the example in Section 4.5 
with two masses and three springs oscillating between two walls. The way we’ve 
written the two modes in Eq. (4.116), the first one has the first mass stationary 
(so there could be a wall there, for all the other two masses know). Similarly for 
the second mode. Hence the V3% result here, as in the example. 
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(c) 


The normal coordinates in this problem are x; +x2+ 3 (obtained by adding the 
three equations in (4.113)), and also any combination of the form axı +bx2+cx3, 
where a+b+c = 0 (obtained by taking a times the first equation in Eq. (4.113), 
plus b times the second, plus c times the third). The three normal coordinates 
that correspond to the mode in Eq. (4.115) and the two modes we chose in Eq. 
(4.116) are, respectively, x, + x2 + x3, x1 — 2x2 + x3, and —2x1 + x2 + X3, 
because each of these combinations gets no contribution from the other two 
modes (demanding this is how you can derive the coefficients of the x;’s, up to 
an overall constant). & 


In part (b), when we set the determinant of the matrix in Eq. (4.114) equal to zero, 
we were essentially finding the eigenvectors and eigenvalues” of the matrix, 


2 =, =] f LY 
=] 2 -l}/=37=]1 1 1), (4.117) 
=l.-=1. 2 1 1 1 


where J is the identity matrix. We haven’t bothered writing the common factor 
w” here, because it doesn’t affect the eigenvectors. As an exercise, you can show 
that for the general case of N springs and N masses on a circle, the above matrix 
becomes the N x N matrix, 


1 1 0 0 1 
1 1 1 0 0 
0 1 1 1 0 
37J-| 0 0 1 1 0 | =3l-mM. (4.118) 
1 0 0 0 -:. 1 


In the matrix M, the three consecutive 1’s keep shifting to the right, and they 
wrap around cyclicly. We must now find the eigenvectors of M, which will 
require being a little clever. 

We can guess the eigenvectors and eigenvalues of M if we take a hint from 
its cyclic nature. A particular set of things that are rather cyclic are the Nth roots 
of 1. If £ is an Nth root of 1, you can verify that (1, B, B, x im pY is an 
eigenvector of M with eigenvalue 67! + 1 + £. (This general method works for 
any matrix where the entries keep shifting to the right. The entries don’t have 
to be equal.) The eigenvalues of the entire matrix in Eq. (4.118) are therefore 
3— (B7! +1+ 8) = 2-— p7! — p. There are N different Nth roots of 1, namely 
Bn = e?7in/N  for0 <n <N — 1. So the N eigenvalues are 


An =2— mn + guln) = 2 —2cos(27n/N) 
= Asin? (rn/N). (4.119) 
The corresponding eigenvectors are 
(UR ade), (4.120) 


Since the numbers n and N — n yield the same value for A, in Eq. (4.119), 
the eigenvalues come in pairs (except for n = 0, and n = N/2 if N is even). 
This is fortunate, because we can then form real linear combinations of the 


? An eigenvector v of a matrix M is a vector that gets taken into a multiple of itself when acted 


upon by M. That is, Mv = iv, where A is some number (the eigenvalue). This can be rewritten as 


(M — d1)v = 0, where J is the identity matrix. By our usual reasoning about invertible matrices, 


a nonzero vector v exists only if A satisfies det |M — AZ| = 0. 
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two corresponding complex eigenvectors given in Eq. (4.120). We see that the 


vectors 
1 
cos(27n/N) 
= a + Vin) = i (4.121) 
cos (2(N = Dan/N) 
and 
0 
sin(2xn/N) 
y= Lo, =P = Hii (4.122) 
sin (20N L Dan/N) 
both have eigenvalue àn = Ay—n» (as does any linear combination of these 


vectors). For the special case of n = 0, the eigenvector is Vo = (1,1, 1,..., 1) 
with eigenvalue Ao = 0. And for the special case of n = N/2 if N is even, the 
eigenvector is Vy;2 = (1,—1,1,...,—1) with eigenvalue Ay 2 = 4. 

Referring back to the N = 3 case in Eq. (4.114), we see that we must take the 
square root of the eigenvalues and then multiply by w to obtain the frequencies 
(because it was an a? that appeared in the matrix, and because we dropped the 
factor of œw). The frequency corresponding to the above two normal modes is 
therefore, using Eq. (4.119), 


On = Øy Àn = 2w sin(rn/N). (4.123) 


For even N, the largest value of the frequency is 2w, with the masses moving 
in alternating equal positive and negative displacements. But for odd N, it is 
slightly less than 2. 

To sum everything up, the M normal modes are the vectors in Eqs. (4.121) 
and (4.122), where n runs from 1 up to the greatest integer less than N /2. And 
then we have to add on the Vo vector, and also the Vjv/2 vector if N is even.!? 
The frequencies are given in Eq. (4.123). Each frequency is associated with two 
modes, except the Vo mode and the Vy /2 mode if N is even. 


REMARK: Let’s check our results for N = 2 and N = 3. For N = 2: The values 
of n are the two “special” cases of n = 0 and n = N/2 = 1. Ifn = 0, we have 
@ = 0 and Vo = (1,1). Ifn = 1, we have w = 2w and V; = (1, —1). These 
results agree with the two modes in Eq. (4.112). 

For N = 3: Ifn = 0, we have w) = 0 and Vọ = (1,1, 1), in agreement 
with Eq. (4.115). If 7 = 1, we have w = V3, and V? = (1, —1/2,—1/2) 
and V; = (0,1/2,—1/2). These two vectors span the same space we found in 
Eq. (4.116). And they have the same frequency as in Eq. (4.116). You can also 
find the vectors for N = 4. These are fairly intuitive, so try to write them down 
first without using the above results. & 


10 Tf you want, you can treat the n = 0 and n = N /2 cases the same as all the others. But in both of 
these cases, the V~ vector is the zero vector, so you can ignore it. So no matter what route you 
take, you will end up with exactly N nontrivial eigenvectors. 
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Chapter 5 


Conservation of energy and 
momentum 


Conservation laws are extremely important in physics. They are enormously 
helpful, both quantitatively and qualitatively, in figuring out what is going on in 
a physical system. When we say that something is “conserved,” we mean that 
it is constant over time. If a certain quantity is conserved, for example, while a 
ball rolls around in a valley, or while a group of particles interact, then the possi- 
ble final motions are greatly restricted. If we can write down enough conserved 
quantities (which we are generally able to do, at least for the systems we’ll be 
concerned with), then we can restrict the final motions down to just one possibil- 
ity, and so we have solved our problem. Conservation of energy and momentum 
are two of the main conservation laws in physics. A third, conservation of angular 
momentum, is discussed in Chapters 7—9. 

It should be noted that it isn’t necessary (in principle) to use conservation of 
energy and momentum when solving a problem. We’ll derive these conservation 
laws from Newton’s laws. Therefore, if you felt like it, you could always (in 
theory) simply start with first principles and use F = ma, etc. However, at best, 
you would soon grow weary of this approach. And at worst, you would throw 
in the towel after finding the problem completely intractable. For example, you 
would get nowhere trying to analyze the collision between two shopping carts 
(whose contents are free to shift around) by looking at the forces on all the various 
objects. But conservation of momentum can quickly give you a great deal of 
information. The point of conservation laws is that they make your calculations 
much easier, and they also provide a means for getting a good idea of the overall 
qualitative behavior of a system. 


5.1 Conservation of energy in one dimension 


Consider a force, in just one dimension for now, that depends only on position. 
That is, F = F (x). If the force acts on a particle of mass m, and if we write a as 
v dvu/dx, then F = ma becomes 


F(x) = ee i (5.1) 
dx 


5.1 Conservation of energy in one dimension 


We can separate variables here and integrate from a given point x9 where the 
velocity is vo to an arbitrary point x where the velocity is v. The result is 
I 9 


x Vv x 1 
/ F(x) dx = f mdy => ‘| F(x’) dx! = =mv? — -muĝ 
x0 vo x0 2 2 


1 X 
=> E= 3m?’ — / Fœ) dx, (5.2) 


x0 


where E = mua /2. E depends on vo, so it therefore also depends on the choice 
of xo, because a different x9 would yield (in general) a different v9. What we’ve 
done here is simply follow the procedure in Section 3.3, for a function that 
depends only on x. If we now define the potential energy, V (x), as 


Vœ) = -— / FO) de’, (5.3) 


0 


then Eq. (5.2) becomes 
l 2 
ae +V(QX)=E. (5.4) 


We define the first term here to be the kinetic energy. Since this equation is 
true at all points in the particle’s motion, the sum of the kinetic energy and 
potential energy is constant. In other words, the total energy is conserved. If a 
particle loses (or gains) potential energy, then its speed increases (or decreases). 
Common examples of potential energy are kx? /2 for a Hooke’s-law spring force 
(—kx), with xo chosen to be zero; and mgy for the gravitational force (—mg), with 
yo chosen to be zero. 


In Boston, lived Jack as did Jill, 

Who gained mgh on a hill. 

In their liquid pursuit, 

Jill exclaimed with a hoot, 

“T think we’ve just climbed a landfill!” 


While noting, “Oh, this is just grand,” 
Jack tripped on some trash in the sand. 
He changed his potential 

To kinetic, torrential, 


But not before grabbing Jill’s hand. 


So that’s what really happened on that hill. People don’t just magically come 
“tumbling after” for no reason, of course. 

For a particle undergoing a given motion, both E£ and V(x) depend on the 
arbitrary choice of xg in Eq. (5.3). This implies that Æ and V(x) have no real 
meaning by themselves. Only the difference between E and V (x) is relevant (and 
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it equals the kinetic energy); this difference is independent of the choice of xo. 
But in order to be concrete in a given setup, we need to pick an arbitrary xo, so we 
must remember to state which x9 we’ve chosen. For example, it makes no sense 
to simply say that the gravitational potential energy of an object at height y above 
the ground is — f F dy = — [(—mg) dy = mgy. We have to say that the potential 
energy is mgy with respect to the ground (assuming that our yo is at ground level). 
If we wanted to, we could say that the potential energy is mgy + mg(7m) with 
respect to a point 7 meters below the ground. This is perfectly legitimate, although 
a bit unconventional.! But no matter what reference point we pick, the difference 
in the potential energies at the points, say, y = 3 m and y = 5 m equals mg(2 m). 

Note that although we introduced the xg in Eq. (5.2) as a point where the 
particle was at some time, the particle in fact need not ever be at the point x9. For 
example, we can throw a ball up at 8 m/s from a height of 5 m while measuring the 
gravitational potential energy with respect to a point a kilometer high. The ball is 
certainly not going to reach a height of a kilometer, but that’s fine. All that matters 
is the difference between £ and V (x) (both of which are very much negative here) 
throughout the motion, so a constant shift is irrelevant. We’ve simply added on 
the same negative quantity to both £ and V (x) in Eq. (5.4), compared with the 
values someone would measure if the ground were the reference point. 

If we take the difference between Eq. (5.4) evaluated at two points, x; and x2 
(or if we just integrate Eq. (5.1) from x; to x2), then we obtain 


1 2 1 2 
l (x2) — ii (x1) = Vx) — V(x2) 


x2 
f F(x’) dx’ = (Work),, >x- (5.5) 
X] 
Here it is clear that only differences in potential energies matter. If we define the 
integral in this equation to be the work done on the particle as it moves from xı 
to x2, then we have produced the work—-energy theorem:* 


Theorem 5.1 The change in a particles kinetic energy between points x, 
and x equals the work done on the particle between x, and xp. 


If the force points in the same direction as the motion (that is, if the F (x) and 

the dx in Eq. (5.5) have the same sign), then the work is positive and the speed 
increases. If the force points in the direction opposite to the motion, then the 
work is negative and the speed decreases. 
' Tt gets to be a pain to keep repeating “with respect to the ground.” Therefore, whenever anyone talks 
about gravitational potential energy in a setup on the surface of the earth, it’s generally understood 
that the ground is the reference point. If, on the other hand, the experiment reaches out to distances 
far from the earth, then r = oo is understood to be the reference point, for reasons of convenience 
we will see in the first example below. 


N 


In the form stated here, this theorem holds only for a point particle with no internal structure. See 
the “Work vs. potential energy” subsection below for the general theorem. 


5.1 Conservation of energy in one dimension 


Referring back to Eq. (5.4), and assuming we’ve chosen a reference point 
xo for the potential energy (and perhaps we’ve added on a constant to V(x), 
just because we felt like it), let’s draw in Fig. 5.1 the V(x) curve and also the 
constant E line (which we can determine if we’re given, say, the initial position 
and speed). Then the difference between E and V (x) gives the kinetic energy. The 
places where V(x) > E are the regions where the particle cannot go. The places 
where V(x) = E are the “turning points” where the particle stops and changes 
direction. In the figure, the particle is trapped between x; and x2, and oscillates 
back and forth. The potential V (x) is extremely useful this way because it makes 
clear the general properties of the motion. 


REMARK: It may seem silly to introduce a specific xg as a reference point, considering that 
it’s only the differences in the potential (which are independent of xo) that have any meaning. 
It’s sort of like taking the difference between 17 and 8 by first finding their sizes relative to 
5, namely 12 and 3, and then subtracting 3 from 12 to obtain 9. However, since integrals are 
harder to do than simple subtractions, it’s advantageous to do the integral once and for all and 
thereby label all positions with a definite number V (x), and to then take differences between 
the V’s when needed. & 


The differential form of Eq. (5.3) is 


dV (x) 


TEn 


(5.6) 


Given V(x), it is easy to take its derivative to obtain F(x). But given F(x), it 
may be difficult (or impossible) to perform the integration in Eq. (5.3) and write 
V (x) in closed form. But this is not of much concern. The function V (x) is well 
defined (assuming that the force is a function of x only), and if needed it can be 
computed numerically to any desired accuracy. 


Example (Gravitational potential energy): Consider two point masses, M 
and m, separated by a distance r. Newton’s law of gravitation says that the force 
between them is attractive and has magnitude GMm/ r? (we'll talk more about gravity 
in Section 5.4.1). The potential energy of the system at separation, measured relative 
to separation ro, is therefore 


"—GMm _, —GMm GMm 
dr = — + 


yn dr! = < I 


V(r) — Vro) = I 


ro F TO 


where the minus sign in the integrand comes from the attractive nature of the force. A 
convenient choice for rp is co, because this makes the second term vanish. It will be 
understood from now on that this 79 = oo reference point has been chosen. Therefore 
(see Fig. 5.2), 


vn = <a : (5.8) 


V(x) 


Fig. 5.1 


V(r) 


Fig. 5.2 
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Example (Gravity near the earth): What is the gravitational potential energy 
of a mass m at height y, relative to the ground? We know, of course, that it is 
mgy, but let’s do it the hard way. If M is the mass of the earth and R is its radius, 
then Eq. (5.8) gives (assuming y < R) 


—GMi —GMi —GMi 1 
VR+y)- VR) = GMm GMm _ —G “( ) 


R+y R è R 


Q 
pena 
° ia, 
= 
= 
Say 
EI 
oS 
= 
— 


= , (5.9) 


where we have used the Taylor series approximation for 1/(1 + €) to obtain the second 
line. We have also used the fact that a sphere can be treated like a point mass, as far 
as gravity is concerned. We’ll prove this in Section 5.4.1. 

Using g = GM/R?, we see that the potential energy difference in Eq. (5.9) equals 
mgy. We have, of course, simply gone around in circles here. We integrated in 
Eq. (5.7), and then we basically differentiated in Eq. (5.9) by taking the difference 
between the forces at nearby points. But it’s good to check that everything works out. 


A good way to visualize a potential V (x) is to imagine a ball sliding around ina 
valley or on a hill. For example, the potential of a typical spring is V (x) = kx?/2 
(this produces the Hooke’s-law force, F(x) = —dV/dx = —kx), and we can get 
a decent idea of what is going on if we imagine a valley with height given by 
y = x?/2. The gravitational potential of the ball is then mgy = mgx?/2. Choosing 
mg = k gives the desired potential. If we then look at the projection of the ball’s 
motion on the x axis, it seems like we have constructed a setup identical to the 
original spring. 

However, although this analogy helps in visualizing the basic properties of 
the motion, the two setups are not the same. The details of this fact are left 
for Problem 5.7, but the following observation should convince you that they 
are indeed different. Let the ball be released from rest in both setups at a large 
value of x. Then the force kx due to the spring is very large. But the force in 
the x direction on the particle in the valley is only a fraction of mg, namely 
(mg sin 0) cos @, where @ is the angle of the valley at that point. The setups are 
approximately the same, however, for small oscillations near the bottom of the 
valley. See Problem 5.7 for more details. 


Conservative forces 

Given any force (it can depend on x, v, t, and/or whatever), the work it does on 
a particle is defined by W = f F dx. If the particle starts at x; and ends up at 
x2, then no matter how it gets there (it might speed up or slow down, or reverse 
direction a few times), we can calculate the total work done on it by all the forces 


5.1 Conservation of energy in one dimension 


in the setup, and then equate the result with the change in kinetic energy, via 


%2 2 vdv 1 1 
Wrotal = / Frotal dx = f m( d Ja = 5m = zmot. (5.10) 
x x 


1 1 


For some forces, the work done is independent of how the particle moves. 
A force that depends only on position (in one dimension) has this property, 
because the integral in Eq. (5.10) depends only on the endpoints. The W = fF dx 
integral is the (signed) area under the F vs. x graph, and this area is independent 
of how the particle goes from x; to x2. 

For other forces, the work done depends on how the particle moves. Such is 
the case for forces that depend on ¢ or v, because it then matters when or how 
quickly the particle goes from x; to x2. A common example of such a force is 
friction. If you slide a brick across a table from x; to x2, then the work done 
by friction equals — umg|A^x|. But if you slide the brick by wiggling it back 
and forth for an hour before you finally reach x2, then the amount of negative 
work done by friction is very large. Since friction always opposes the motion, the 
contributions to the W = fF dx integral are always negative, so there is never 
any cancellation. The result is therefore a large negative number. 

The issue with friction is that although the umg force looks like a constant 
force (which is a subset of position-only dependent forces), it actually isn’t. At a 
given location, the friction can point to the right or to the left, depending on 
which way the particle is moving. Friction is therefore a function of velocity. 
True, it’s a function only of the direction of the velocity, but that’s enough to ruin 
the position-only dependence. 

We now define a conservative force as one for which the work done on a 
particle between two given points is independent of how the particle makes the 
journey. From the preceding discussion, we know that a one-dimensional force 
is conservative if and only if it depends only on x (or is constant).? The point 
we’re leading up to here is that although we can calculate the work done by any 
force, it makes sense to talk about the potential energy associated with a force 
only if the force is conservative. This is true because we want to be able to label 
each value of x with a unique number, V (x), given by V(x) = — EF dx. If this 
integral were dependent on how the particle goes from x9 to x, then it wouldn’t be 
well defined, so we wouldn’t know what number to assign to V (x). We therefore 
talk about potential energies only if they are associated with conservative forces. 
In particular, it makes no sense to talk about the potential energy associated with 
a friction force. 

A useful fact about the gravitational potential energy, mgz, is that it doesn’t 
depend on the path the particle takes, even in two or three dimensions. This 
is true because even if the particle moves in a complicated direction, only the 


3 In two or three dimensions, however, we will see in Section 7 that a conservative force must satisfy 
another requirement, in addition to being dependent only on position. 
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vertical z component of the displacement is relevant in calculating the work done 
by gravity. If we break the path up into many little pieces, the total work done 
by gravity is obtained by adding up the many little —mg (dz) terms. But the sum 
of all the dz’s is always equal to the total z, independent of the path. Therefore, 
no matter what the particle is doing in the two horizontal directions, the change 
in gravitational potential energy is always just mgz. So the gravitational force is 
a conservative force in three dimensions. We’ll see in Section 5.3 that this is a 
special case of a more general result. 


Example (Unwinding string): A mass is connected to one end of a massless 
string, the other end of which is connected to a very thin frictionless vertical pole. 
The string is initially wound completely around the pole, in a very large number of 
tiny horizontal circles, so that the mass touches the pole. The mass is released, and 
the string gradually unwinds. What angle does the string make with the pole at the 
moment it becomes completely unwound? 


Solution: Let £ be the length of the string, and let 6 be the final angle it makes with 
the pole. Then the final height of the mass is £ cos 0 below the starting point. So the 
mass loses a potential energy of mg (£ cos 6). Conservation of energy therefore gives 
(picking the initial height as y = 0, although this doesn’t matter) 


1 
K,+V,=Ke+Ve => 0+0 = 5m? — mgt cos9 => u? = 2glcosd. 
(5.11) 


There are two unknowns here, v and 0, so we need one more equation. This will be the 
radial F = ma equation for the (essentially) final circular motion. Because the pole is 
very thin, the motion can always be approximated by a horizontal circle, which very 
slowly lowers as time goes by. Because there is essentially no motion in the vertical 
direction, the total force in this direction is zero. Therefore, the vertical component 
of the tension is essentially mg. The horizontal component is then mg tan 0, so the 
F = ma equation for the final circular motion (which has a radius of £ sin 8) is 


mv? 


i v? = glsin6 tan. (5.12) 
€sin@ 


mg tan 0 = 


Equating our two expressions for v? gives tan@ = /2 => 9 ~ 54.7°. Interestingly, 
this angle is independent of £ and g. 


Work vs. potential energy 

When you drop a ball, does its speed increase because the gravitational force is 
doing work on it, or because its gravitational potential energy is decreasing? Well, 
both (or more precisely, either). Work and potential energy are two different ways 
of talking about the same thing (at least for conservative forces). Either method of 


5.1 Conservation of energy in one dimension 


reasoning gives the correct result. However, be careful not to use both reasonings 
and “double count” the effect of gravity on the ball. Which choice of terminology 
you use depends on what you call your “system.” Just as with F = ma and free- 
body diagrams, it is important to label your system when dealing with work and 
energy, as we’ll see in the example below. 

The work—energy theorem stated in Theorem 5.1 is relevant to one particle. 
What if we are dealing with the work done on a system that is composed of 
various parts? The general work—energy theorem states that the work done on a 
system by external forces equals the change in energy of the system. This energy 
may come in the form of (1) overall kinetic energy, (2) internal potential energy, 
or (3) internal kinetic energy (heat falls into this category, because it’s simply 
the random motion of molecules). So we can write the general work—energy 
theorem as 


Wexternal = AK+AV+ AK internal (5.13) 


For a point particle, there is no internal structure, so we have only the first of 
the three terms on the right-hand side, in agreement with Theorem 5.1. But to 
see what happens when a system has internal structure, consider the following 
example. 


Example (Raising a book): Assume that you lift a book up at constant speed, so 
there is no change in kinetic energy. Let’s see what the general work—energy theorem 
says for various choices of the system. 


e System = (book): Both you and gravity are external forces, and there is no 
change in the energy of the book as a system in itself. So the W-E theorem says 


Wyou + Weray =0 <= > mgh + (—mgh) = 0. (5.14) 


e System = (book + earth): Now you are the only external force. The gravitational 
force between the earth and the book is an internal force which produces an 
internal potential energy. So the W-E theorem says 


Wyou = AVearth-book <=> mgh = mgh. (5.15) 


e System = (book + earth + you): There is now no external force. The internal 
energy of the system changes because the earth—-book gravitational potential 
energy increases, and also because your potential energy decreases. In order to 
lift the book, you have to burn some calories from the dinner you ate. So the 
W-E theorem says 


0 = AVearth-book + AVyoun <=> 0 = mgh + (—mgh). (5.16) 
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Actually, a human body isn’t 100% efficient, so what really happens here is that 
your potential energy decreases by more than mgh, but heat is produced. The 
sum of these two changes in energy equals —mgh. So, including an amount n of 
energy in the form of heat, we have 


0 = AV earth_-book + AVyou + AKinternal 


<> 0=mgh+ (—mgh—n) +n. (5.17) 


Another contribution to this 7 heat term can come from, for example, the heat 
from friction if you slide the book up a rough wall as you lift it. 


The moral ofall this is that you can look at a setup in various ways, depending 
on what you pick as your system. Potential energy in one way might show up 
as work in another. In practice, it is usually more convenient to work in terms 
of potential energy. So for a dropped ball, people usually consider (consciously 
or not) gravity to be an internal force in the earth—ball system, as opposed to an 
external force on the ball system. In general, “conservation of energy,” commonly 
used in setups involving gravity and/or springs, is a straightforward principle to 
apply (and you’ll get plenty of practice with it in the problems and exercises for 
this chapter). So it turns out that you can usually ignore all these issues about 
work and about picking your system. 

But let’s look at one more example, just to make sure we’re on the same 
page. Consider a car that is braking (but not skidding). The friction force from 
the ground on the tires is what causes the car to slow down. But this force 
does no work on the car, because the ground isn’t moving; the force acts over 
zero distance. So the external work on the left side of Eq. (5.13) is zero. The 
right side is therefore also zero. That is, the total energy of the car doesn’t 
change. This is indeed true, because although the overall kinetic energy of the 
car decreases, there is an equal increase in internal kinetic energy in the form 
of heat in the brake pads and discs. In other words, AK = —AK internal, and the 
total energy remains constant. The unfortunate fact about this process is that the 
energy that goes into heat is lost and can’t be converted back into overall kinetic 
energy of the car. It makes much more sense to convert the overall kinetic energy 
into some form of internal potential energy (that is, AK = —AU), which can 
then be converted back into overall kinetic energy. Such is the case with hybrid 
cars which convert overall kinetic energy into chemical potential energy in a 
battery. 

Conversely, when a car accelerates, the friction force from the ground does 
no work (because the ground isn’t moving), so the total energy of the car remains 


4 In this case we have included the wall as a fourth object in our system, because it might contain 
some of the heat. If you want to instead consider the wall as an external object providing a force, 
then things get tricky; see Problem 5.6. 
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the same. The internal potential energy of the gasoline (or battery) is converted 
into overall kinetic energy (along with some heat and sound). A similar thing 
happens when you stand at rest and then start walking. The friction force from 
the ground does no work on you, so your total energy remains the same. You’re 
simply trading your breakfast for overall kinetic energy (plus some heat). For 
further discussion of work, see Mallinckrodt and Leff (1992). 


5.2 Small oscillations 


Consider an object in one dimension, subject to the potential V (x). Let the object 
initially be at rest at a local minimum of V (x), and then let it be given a small 
kick so that it moves back and forth around the equilibrium point. What can we 
say about this motion? Is it simple harmonic? Does the frequency depend on the 
amplitude? 

It turns out that for small amplitudes, the motion is indeed simple harmonic, 
and the frequency can easily be found, given V (x). To see this, expand V(x) in 
a Taylor series around the equilibrium point, xo, 


1 
V (x) = V (x0) + V' (x0) — xo) + J V" (xo) (x — x0)? 


1 m 3 

tar (xo) (x — xo) +---. (5.18) 
This looks like a bit ofa mess, but we can simplify it greatly. V (xo) is an irrelevant 
additive constant. We can ignore it because only differences in energy matter (or 
equivalently, because F = —dV/dx). And V'(xọ) = 0, by definition of the 
equilibrium point. So that leaves us with the V”(xọ) and higher-order terms. 
But for sufficiently small displacements, these higher-order terms are negligible 
compared with the V” (xo) term, because they are suppressed by additional powers 
of (x — xo). So we are left with? 


1 
V(x) & 5" COE — x0). (5.19) 


But this has exactly the same form as the Hooke’s-law potential, V(x) = 
(1/2)k(x — xo)*, provided that we let V” (xo) be our “spring constant” k. Equiv- 
alently, the force is F = —dV/dx = —V"(xo)(x — xo) = —k(x — xo). The 
frequency of small oscillations, œ = Jk/m, is therefore 


ee ai (5.20) 
m 


> Even if V” (xo) is much larger than V” (xo), we can always pick (x — xq) small enough so that 
the third-order term is negligible. The one case where this is not true is when V” (x9) = 0. But the 
result in Eq. (5.20) is still valid in this case. The frequency w just happens to be zero, in the limit 
of infinitesimal oscillations. 
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Example: A particle moves under the influence of the potential Vœ) = 
A/x* — B/x, where A, B > 0. Find the frequency of small oscillations around the 
equilibrium point. This potential is relevant to planetary motion, as we’ll see in 
Chapter 7. The rough shape is shown in Fig. 5.3. 


Solution: The first thing we need to do is calculate the equilibrium point xg. The 
minimum occurs where 


(a7 922 => x= =x (5.21) 
~ ~ p x2 Tp o l 


The second derivative of V (x) is 


64 2B 


'@ =z (5.22) 
x 


x3" 


Plugging in x9 = 24/B, we find 
y" BA 
ig) aE E E (5.23) 
m 8m4? 


Equation (5.20) is an important result, because any function V (x) looks basi- 
cally like a parabola (see Fig. 5.4) in a small enough region around a minimum 
(except in the special case where V” (xo) = 0). 


A potential may look quite erratic, 
And its study may seem problematic. 
But down near a min, 

You can say with a grin, 


“It behaves like a simple quadratic!” 


5.3 Conservation of energy in three dimensions 


The concepts of work and potential energy in three dimensions are somewhat 
more complicated than in one dimension, but the general ideas are the same.° As 
in the 1-D case, we’ll start with Newton’s second law, which now takes the vector 
form, F = ma. And as in the 1-D case, we’ll deal only with forces that depend 
only on position, that is, F = F(r), because these are the only ones that have 
any chance of being conservative. The F = ma vector equation is shorthand for 
three equations analogous to Eq. (5.1), namely mv, (dv, /dx) = Fy, and likewise 
for y and z. The F, here is a function of position, so we should really be writing 


é We’ll invoke a few results from vector calculus here. If you haven’t seen such material before, a 
brief review is given in Appendix B. 
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F\(x,y,z), but we’ll drop the arguments, lest our expressions get too cluttered. 
Multiplying through by dx, etc., in these three equations, and then adding them 
together gives 


Fx dx + Fy dy + Fz dz = m(vx dux + vy duy + vz dvz). (5.24) 


The left-hand side is the work done on the particle. With dr = (dx, dy, dz), 
this work can be written as F - dr (see Appendix B for the definition of the “dot 
product”). Using Eq. (B.2), we see that the work can also be written as F'|dr| cos 0, 
where @ is the angle between F and dr. Grouping this as (F cos @)|dr| shows 
that the work equals the distance moved times the component of the force along 
the displacement. Alternatively, grouping it as F (|dr| cos 0) shows that the work 
also equals the magnitude of the force times the component of the displacement 
in the direction of the force. 
Integrating Eq. (5.24) from the point (xo, vo, Zo) to the point (x, y, z) yields’ 


x y Zz 1 1 
E +f F, dx’ +f F, dy’ +f F; dz = -m(v? + v? + v2) = =mv’, 
x0 Yo f Zo 2 2 


(5.25) 


where E is a constant of integration; it equals mve /2, where vo is the speed at 
(x0, y0,Z0). Note that the integrations on the left-hand side depend on the path 
in 3-D space that the particle takes in going from (xo, yo, zo) to (x,y,z), because 
the components of F are functions of position. We’ll address this issue below. 
In terms of the dot product, Eq. (5.25) can be written in the more compact form, 


1 F 
m? — / F(r’) - dr' =E. (5.26) 
2 a 


Therefore, if we define the potential energy V (r) as 


V(r) = - f Fe) -dr’, (5.27) 


0 


then we can write 

1 2 

z” +V(r) =E. (5.28) 
In other words, the sum of the kinetic energy and potential energy is constant. 


7 We’ve put primes on the integration variables so that we don’t confuse them with the limits of 
integration. And as mentioned above, Fy is really F(x’, y’, z’), etc. 
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Conservative forces in three dimensions 

For a force that depends only on position (as we have been assuming), there is 
one complication that arises in 3-D that we didn’t have to worry about in 1-D. 
In 1-D, there is only one route that goes from xo to x. The motion itself may 
involve speeding up or slowing down, or backtracking, but the path is always 
restricted to be along the line containing xo and x. But in 3-D, there is an infinite 
number of routes that go from ro to r. In order for the potential V(r) to have 
any meaning and to be of any use, it must be well defined. That is, it must be 
path-independent. As in the 1-D case, we call the force associated with such 
a potential a conservative force. Let’s now see what types of 3-D forces are 
conservative. 


Theorem 5.2 Given a force F(r), a necessary and sufficient condition for the 
potential, 


r 
Vir) = -f Fc’) - dr’, (5.29) 
ro 
to be well defined (that is, to be path-independent) is that the curl of F is zero 
everywhere (that is, V x F = 0; see Appendix B for the definition of the curl).® 


Proof: First, let us show that V x F = 0 is a necessary condition for 
path-independence. In other words, “If V(r) is path-independent, then V x 
F=0.” This follows quickly from the discussion of the curl in Appendix B. 
We show in Eq. (B.24) that the integral f F - dr (that is, the work done) around a 
little rectangle in the x-y plane equals the z component of the curl times the area. 
If the work done in going from corner A to corner B in Fig. 5.5 is the same for 
paths “1” and “2” (as we are assuming), then the round-trip integral around the 
rectangle is zero, because one of the paths is being traced out backwards, so it 
cancels the contribution from the other path. So path-independence implies that 
the round-trip integral f F - dr is zero for any arbitrary rectangle in the x-y plane. 
Equation (B.24) therefore says that the z component of the curl must be zero 
everywhere. Likewise for the y and x components. We have therefore shown that 
V x F = 0 is a necessary condition for path independence. 

Now let us show that it is sufficient. In other words, “If V x F = 0, then 
V(r) is path-independent.” The proof of sufficiency follows immediately from 
Stokes’ theorem, which is stated in Eq. (B.25). This theorem implies that if 
V x F = 0 everywhere, then fọ F - dr = 0 for any closed curve. But Fig. 5.6 
shows that traversing the loop C counterclockwise entails traversing path “1” in 
the “forward” direction, and then traversing path “2” in the “backward” direction. 
Therefore, from the same reasoning as in the previous paragraph, the integrals 


8 If the force is infinite at any point, then the proof of sufficiency below (which is based on Stokes’ 
theorem) isn’t valid, and it turns out that a second condition is required; see Feng (1969). But we 
won’t worry about that here. 
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from ro to r along paths “1” and “2” are equal. This holds for arbitrary points ro 
and r, and arbitrary curves C, so V(r) is path-independent. a 


Remark: Another way to show that V x F = 0 is a necessary condition for path-independence 
(that is, “If V(r) is path-independent, then V x F = 0”) is the following. If V(r) is path- 
independent (and therefore well defined), then it is legal to write down the differential form of 
Eq. (5.27), namely 


dV (r) = —F(r) - dr = —(Fy dx + F, dy + F; dz). (5.30) 
But another expression for dV is 


av av av 
dV (r) = dx +4 dy 4 dz. (5.31) 
ox ay * az 


These two expressions must be equivalent for arbitrary dx, dy, and dz. So we have 


ƏV ƏV ƏV 
(Fx, Fy, Fz) (: Nae ar ) Fir) = -VV (r). (5.32) 
f əx dy az 


In other words, the force is the gradient of the potential. Therefore, 


VxF=-Vx VV(r) = 0, (5.33) 


because the curl of a gradient is identically zero, as you can verify by using the definition of the 
curl in Eq. (B.20) and the fact that partial differentiation is commutative (that is, ©? V/dx 3y = 
a°V/dy dx). æ 


Example (Central force): A central force is defined to be a force that points 
radially and whose magnitude depends only on r. That is, F(r) = F (r)f. Show that 
a central force is conservative by explicitly showing that V x F = 0. 


Solution: The force F may be written as 
x x yz 
F(x,y,2) = F(t = F@) (2,2,5). (5.34) 
| cae See a 


Now, as you can verify, 


or 3V x2 + y2 +22 _ x 


ax ax r 


> (5.35) 
and similarly for y and z. Therefore, the z component of V x F equals (writing F 
for F (r), and F’ for dF (r) /dr, and making use of the chain rule) 


OFy OFy ƏQF/r)  d(xF/r) 
ox oy Ox oy 


Y pt OF ae z) (Erz Fl z) 


ll 
~~ 
`~ 
2% 
a 


r2 Ox roy r2 oy 
yxF'  yxF xyF’ xF 
a ( 2 A ) ( Z a= 0. (5.36) 


Likewise for the x and y components. 
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5.4 Gravity 

5.4.1 Newton’s universal law of gravitation 

The gravitational force on a point mass m, located a distance r from a point mass 
M, is given by Newton’s law of gravitation, 


—GMm 


FO=—, 


(5.37) 


where the minus sign indicates an attractive force. The numerical value of G 
is 6.67 - 107!! m3/(kgs?). We’ll show how this value is obtained in Section 
5.4.2 below. 

What is the force if we replace the point mass M by a sphere of radius R and 
mass M? The answer (assuming that the sphere is spherically symmetric, that 
is, the density is a function only of r) is that it is still -GMm/r?. A sphere acts 
just like a point mass at its center, for the purposes of gravity (as long as we’re 
considering a mass m outside the sphere). This is an extremely pleasing result, 
to say the least. If it were not the case, then the universe would be a far more 
complicated place than it is. In particular, the motion of planets and such things 
would be much harder to describe. 

To demonstrate that spheres behave like points, as far as gravity is concerned, 
it turns out to be much easier to calculate the potential energy due to a sphere, and 
to then take the derivative to obtain the force, rather than to calculate the force 
explicitly.” So this is the route we will take. It will suffice to demonstrate the 
result for a thin spherical shell, because a sphere is the sum of many such shells. 
Our strategy for calculating the potential energy at a point P, due to a spherical 
shell, will be to slice the shell into rings as shown in Fig. 5.7. Let the radius of 
the shell be R, let P be a distance r from the center of the shell, and let the ring 
make the angle 0 shown. The distance, £, from P to the ring is a function of R, 
r, and 0. It may be found as follows. In Fig. 5.8, segment AB has length R sin 6, 
and segment BP has length r — R cos 6. So the length £ in triangle ABP is 


l = J (Rsin0)? + (r — Roos6)2 = V R2 + r? — 2rR cos 0. (5.38) 


What we’ve done here is just prove the law of cosines. 

The area of a ring between 0 and 6 + d@ is its width (which is R d@) times its 
circumference (which is 27 R sin 0). Letting o = M/(4z R?) be the mass density 
per unit area of the shell, we see that the potential energy of a mass m at P due to 
a thin ring is — Gmo (Rd6)(27R sin 0)/£. This is true because the gravitational 
potential energy, 


-G 
ro = A, (5.39) 


? The reason for this is that the potential energy is a scalar quantity (just a number), whereas the 
force is a vector. If we tried to calculate the force, we would have to worry about forces pointing 
in all sorts of directions. With the potential energy, we simply have to add up a bunch of numbers. 


5.4 Gravity 


is a scalar quantity, so the contributions from the little mass pieces simply add. 
Every piece of the ring is the same distance from P, and this distance is all that 
matters; the direction from P is irrelevant (unlike it would be with the force). 
The total potential energy at P is therefore 


™ IxoGR-msin6 dé 
VR2 + r2 — 2rR 0s 0 


2noGR 
= RLY (Re a ee 
: 


V(r) = 


T 


(5.40) 


0 
The sin @ in the numerator is what makes this integral nice and doable. We must 


now consider two cases. Ifr > R, then we have 


2 
EE ERT R) p G(4r Ro )m s GMm ! 


r r 
(5.41) 


27o GRm 
V(r)= -2 


which is the potential due to a point mass M located at the center of the shell, as 
desired. Ifr < R, then we have 

G(4r R? GM 
(+R) (R »)= (4 Rio )m _ m 


UAS R R 
(5.42) 


27o GRm 
r 


which is independent of r. Having found V (r), we can now find F (r) by taking 
the negative of the gradient of V. The gradient is just f(d /dr) here, because V is 
a function only of r. Therefore, 


GM) 
Fr) =-->", ifr>R, 
r (5.43) 


F(r)=0, ifr<R. 


These forces are directed radially, of course. A solid sphere is the sum of many 
spherical shells, so if P is outside a given sphere, then the force at P is —GMm/r?, 
where M is the total mass of the sphere. This result holds even if the shells have 
different mass densities (but each one must have uniform density). Note that the 
gravitational force between two spheres is the same as if they were both replaced 
by point masses. This follows from two applications of our “point mass” result. 


Newton looked at the data, numerical, 
And then observations, empirical. 

He said, “But, of course, 

We get the same force 


From a point mass and something that’s spherical!” 


If P is inside a given sphere, then the only relevant material is the mass inside 
a concentric sphere through P, because all the shells outside this region give zero 
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force, from the second equation in Eq. (5.43). The material “outside” of P is, for 
the purposes of gravity, not there. 

It isn’t obvious that the force inside a spherical shell is zero. Consider the 
point P in Fig. 5.9. A piece of mass dm on the right side of the shell gives a larger 
force on P than a piece of mass dm on the left side, due to the 1/r? dependence. 
But from the figure, there is more mass on the left side than the right side. These 
two effects happen to exactly cancel, as you can show in Problem 5.10. 


Example (Supporting a tube): Imagine the following unrealistic undertaking. 
Drill a narrow tube, with cross-sectional area A, from the surface of the earth down 
to the center. Then line the cylindrical wall of the tube with a frictionless coating. 
Then fill the tube back up with the dirt (and magma, etc.) that you originally removed. 
What force is necessary at the bottom of the tube of dirt (that is, at the center of the 
earth) to hold it up? Let the earth’s radius be R, and assume (incorrectly) a uniform 
mass density p. 


Solution: The gravitational force on a mass dm at radius r is effectively due to the 
mass inside the radius r (call this M,). The mass outside r is effectively not there. 
The gravitational force is therefore 

GM, dm  G((4/3)xr? p) dm 


4 
Fin — = = -x Gpr dm, 5.44 
dm a 2 3 pram ( ) 


which we see increases linearly with r. The dirt in the tube between r and r + dr has 
volume A dr, so its mass is dm = pA dr. The total gravitational force on the entire 
tube is therefore 


R 4 4 R 
F= [Fin -f z7 Gor pd dr) = Snapa | rdr 
0 0 


4 R 2 
= 37GA: z= STGP AR. (5.45) 


The force at the bottom of the tube must be equal and opposite to this force. In terms 
of the mass of the earth, Mg = (4/3) R? p, and the total mass of the tube, M, = pAR, 
this result can be written as F = GMgM;/ 2R. So the required force is half of what 
a scale on the surface of the earth would read if all of the tube’s dirt sat in a lump on 
top of it. The reason for this is basically that the force in Eq. (5.44) is linear in r. 


An important subtopic of gravity is the tidal force, but because this is most 
easily discussed in the context of accelerating frames of reference and fictitious 
forces, we’ll postpone it until Chapter 10. 


5.4.2 The Cavendish experiment 


How do we determine the numerical value of G in Eq. (5.37)? If we can produce 
a setup in which we know the values F, M, m, and r, then we can determine G. 


5.4 Gravity 


The first strategy that comes to mind is to take advantage of the fact that the 
gravitational force on an object on the earth’s surface is known to be F = mg. 
Combining this with Eq. (5.37) gives g = GMg/R?. We know the values of g 
and R,!° so this tells us what the product GM; is. However, this information 
unfortunately doesn’t help us, because we don’t know what the mass of the earth 
is (without knowing G first, which we’re assuming we don’t know yet; see the last 
paragraph in this section). For all we know, the mass of the earth might be 10 times 
larger than what we think it is, with G being 10 times smaller. The only way to 
find G is to use a setup with two known masses. But this then presents the problem 
that the resulting force is extremely small. So the task boils down to figuring out 
a way to measure the tiny force between two known masses. Henry Cavendish 
solved this problem in 1798 by performing an extremely delicate experiment 
(which was devised a few years earlier by John Michell, but he died before being 
able to perform it).!' The basic idea behind the experiment is the following. 

Consider the setup in Fig. 5.10, which shows the top view. A dumbbell with 
two masses m on its ends hangs froma very thin wire. The dumbbell is free to twist, 
although if it twists, the wire will provide a tiny restoring torque.!* The dumbbell 
starts with no twist in the wire, and then two other masses M are placed (fixed) 
at the positions shown. These masses produce attractive forces on the dumbbell 
masses and cause the dumbbell to twist counterclockwise. The dumbbell will 
oscillate back and forth before finally settling down at some tiny angle 0 away 
from the initial position. 

The torque on the dumbbell that arises from the twist in the wire takes the 
form of t = —bé@ (with counterclockwise torque taken to be positive), where b 
is a constant that depends on the thickness and makeup of the wire. This linear 
relation between t and @ holds for small @ for all the same reasons that the 
F = —kx Hooke’s-law result in Section 5.2 holds. 

The gravitational force between each pair of masses is GMm/d*, where d 
is the separation between the centers of the masses in each pair. So the torque 
on the dumbbell due to the two gravitational forces is 2(GMm/ dE, where £ is 
half the length of the dumbbell. Demanding that the total torque on the dumb- 
bell be zero gives 


2GMm£ bod? 
—_— -b =0 = G= . 
d2 2Mm£ 


(5.46) 


10 The radius of the earth has been known (at least roughly) since the time of Eratosthenes, about 
250 BC. For an interesting way to measure it yourself, see Rawlins (1979). 

1L The purpose of the experiment, as intended by Michell and Cavendish, was actually to measure 
the density of the earth, and not G; see Clotfelter (1987). But as we’ll see below, this is equivalent 
to measuring G. 

12 We won’t talk about torque until Chapter 8, so you may want to come back and read this section 
after that. We’ll invoke some results about rotational dynamics here, but the general setup should 
be clear even if you’re not familiar with rotations. 
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We know all the parameters on the right-hand side except b, so if we can determine 
that, then we’re done. It’s very difficult to measure b directly with any reasonable 
accuracy, because the torque in the wire is so tiny. But fortunately there’s a sneaky 
way to determine b that involves taking a page from our playbook on oscillations. 
The bread-and-butter equation for rotations is T = J 6, where I is the moment of 
inertia (which we can calculate for the dumbbell); this is the rotational analog of 
Newton’s second law, F = mx. Now, if the torque takes the form of t = —b0 
for small 6, then t = JÖ becomes —b@ = IÖ. This is a good old simple- 
harmonic-oscillator equation, so we know that the frequency of the oscillations 
is œ = vb . Therefore, all we need to do is measure the period, T = 27/a, 
of the oscillations while the dumbbell is settling down, and we can determine b 
from b = Iœ? = I (27 /T). (The time T is large, because b is small on the scale 
of things, because otherwise there wouldn’t be any noticeable twist in the wire.) 
Plugging this value of b into Eq. (5.46) finally gives 


_ 4x7 10d? 


= 5.47 
2MmeT2 ( ) 


The Cavendish experiment is also known as the “weighing the earth” (or 
perhaps the “massing the earth”) experiment, because now that we know G (and 
also g and R), we can use g = GMg/R? to calculate the mass of the earth, Mg. The 
only possible way to determine Mg (without examining every cubic meter of the 
inside of the earth, which is obviously impossible) is to determine G first, as we 
have done.!? Interestingly, the resulting value of Mg, which is roughly 6- 104 kg, 
leads to an average density of the earth of about 5.5 g/cm?. This is larger than 
the density of the earth’s crust and mantle, so we conclude that there must be 
something very dense deep down inside the earth. So the Cavendish experiment, 
which involves masses hanging from a wire, amazingly tells us something about 
the earth’s core! '* 


5.5 Momentum 
5.5.1 Conservation of momentum 


Newton’s third law says that for every force there is an equal and opposite force. 
In other words, if Fz, is the force that particle a feels due to particle b, and if 
Fz, is the force that particle b feels due to particle a, then Fpa = —Fgp at all 
times. This law has important implications concerning momentum, p = mv. 
Consider two particles that interact over a period of time. Assume that they are 
isolated from outside forces. From Newton’s second law, F = dp/dt, we see by 
integrating this that the total change in a particle’s momentum equals the time 


B if you want to determine Mg without using g or R (but still using G, of course, because Mg appears 
only through the combination GMg), see Celnikier (1983). 
14 For a comprehensive discussion of the earth’s core, see Brush (1980). 
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integral of the force acting on it. That is, 


bh 


p(t) — p(t) = J Fat. (5.48) 


ti 


This integral is called the impulse. If we now invoke the third law, Fpa = —F3b, 
we find 


Palt) — Pa(ti) = f 


ti 


to b 
Fo di = — f Fpa dt = — (po (t2) — pa (t1)). (5.49) 
ti 
Therefore, 


Pa(t2) + Ph (f2) = Pa (tı) + Po (ti). (5.50) 


This is the statement that the total momentum of this isolated system of two 
particles is conserved; it does not depend on time. Note that Eq. (5.50) is a vector 
equation, so it is really three equations, namely conservation of py, py, and pz. 


REMARK: Newton’s third law makes a statement about forces. But force is related to momentum 
via F = dp/dt. So the third law essentially postulates conservation of momentum. (The “proof” 
above in Eq. (5.49) is hardly a proof. It involves one simple integration.) So you might wonder 
if momentum conservation is something you can prove, or if it’s something you have to assume, 
as we have basically done because we have simply accepted the third law. 

The difference between a postulate and a theorem is rather nebulous. One person’s postulate 
might be another person’s theorem, and vice versa. You have to start somewhere in your 
assumptions. We chose to start with the third law. In the Lagrangian formalism in Chapter 6, 
the starting point is different, and momentum conservation is deduced as a consequence of 
translational invariance (as we will see). So it looks more like a theorem in that formalism. 

But one thing is certain. Momentum conservation for two particles cannot be proved from 
scratch for arbitrary forces, because it is not necessarily true. For example, if two charged 
particles interact in a certain way through the magnetic fields they produce, then the total 
momentum of the two particles might not be conserved. Where is the missing momentum? It is 
carried off in the electromagnetic field. The total momentum of the system is indeed conserved, 
but the crucial point is that the system consists of the two particles plus the electromagnetic 
field. Said in another way, each particle actually interacts with the electromagnetic field, and 
not the other particle. Newton’s third law does not necessarily hold for particles subject to such 
a force. & 


Let’s now look at momentum conservation for a system of many particles. 
As above, let F;; be the force that particle i feels due to particle j. Then F; = —F;; 
at all times. Assume that the particles are isolated from outside forces. The change 
in momentum of the ith particle from ¢ to t2 is (we won’t bother writing all the 
t’s in the expressions below) 


av= f (£0) dt. (5.51) 
j 


Therefore, the change in the total momentum of all the particles is (switching the 
order of the integration and the sum over i on the right-hand side here) 


AP =) Apj= J (x DF) dt. (5.52) 
i i j 
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But by F; = 0 at all times, because for every term F4, there is a term Fpa, 
and Fap + Fpa = 0 (and also, Faa = 0). The forces all cancel in pairs. Therefore, 
the total momentum of an isolated system of particles is conserved. 


Example (Snow on a sled): You are riding on a sled that is given an initial push 
and slides across frictionless ice. Snow is falling vertically (in the frame of the ice) on 
the sled. Assume that the sled travels in tracks that constrain it to move in a straight 
line. Which of the following three strategies causes the sled to move the fastest? The 
slowest? 


A: You sweep the snow off the sled so that it leaves the sled in the direction 
perpendicular to the sled’s tracks, as seen by you in the frame of the sled. 

B: You sweep the snow off the sled so that it leaves the sled in the direction 
perpendicular to the sled’s tracks, as seen by someone in the frame of the ice. 

C: You do nothing. 


First solution: The sideways motion of the snow after you sweep it offis irrelevant, 
because although Newton’s third law says that the swept snow applies a sideways force 
on the sled, the normal force from the tracks keeps the sled from sliding off the tracks. 
Also, the vertical motion of the snow when it hits the sled is irrelevant, because the 
vertical normal force from the tracks keeps the sled from falling through the ground. 
We are therefore concerned only with the motion in the direction of the tracks. And 
since there are no external forces in this direction on the you/sled/snow system, the 
momentum in this direction is conserved. 

In general, the speed of the sled can (possibly) change due to two kinds of events: 
(1) new snow hitting the sled (and eventually coming to rest with respect to the sled), 
and (2) snow being swept off the sled. 

Let’s first compare A with C. In strategy A, your sweeping action doesn’t change 
the speed of the sled, because you are sweeping the snow directly sideways in your 
frame. Since the sideways motion of the snow is irrelevant as far as the forward 
momentum goes, you are essentially just reaching out and plopping a ball of snow 
on the ice. This snow then simply travels along next to the sled at the same speed; 
it might as well be connected to the sled (at least for a moment, until new snow hits 
the sled). In strategy C, your (non) action obviously don’t change the speed of the 
sled. So in comparing A with C, the departure of snow doesn’t differentiate them. We 
therefore need only consider what happens to the sled when new snow hits it. But 
this is easy to do: Since the sled is heavier in C than in A, a new snowflake slows it 
down less in C than in A. Therefore, C is faster than A. 

Now let’s compare B with C. B is faster than C because in B the snowflakes 
have zero forward momentum in the end, whereas in C they have nonzero forward 
momentum (because they are sitting on the moving sled). The momenta of the two 
systems must be the same (equal to the initial momentum of the sled plus you), so 
the sled must be moving faster in B. 

Therefore, B is faster than C, which is faster than 4. As a consistency check, it’s 
easy to see that B is faster than A, because in B you must push the snow backward with 
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respect to the sled. So by Newton’s third law, the swept snow in B exerts a forward 
force on the sled. 
Second solution: In the end, this solution is basically the same as the first one, but 
it’s a slightly more systematic way of looking at things: In B, all the snow is moving 
slower than the sled; in fact, it is all at rest (at least in the forward direction) with 
respect to the ice. In C, all the snow is on the sled. And in A, all the snow is moving 
faster than the sled; it is moving along with various forward speeds, ranging from the 
initial speed of the sled down to the present speed, depending on when it was swept off. 
By conservation of momentum, the total momentum of the sled (including you) 
plus the snow at any given time is the same in all three cases. The only consistent 
way to combine the facts in the previous paragraph with conservation of momentum 
is for the speed of the sled to satisfy B > C > A. This is true because if someone 
claimed that C > B, then the slowest object (namely everything, since all the snow 
is on the sled) in C would be going faster than the fastest object (the sled) in B; this 
would contradict the fact that the momenta of the two systems are equal. Therefore, 
we must have B > C. Likewise, if someone claimed that A > C, then the slowest 
object (the sled) in A would be going faster than the fastest object (everything) in C; 
this would again be a contradiction. Therefore, we must have C > A. Putting these 
together gives B > C > A. See Exercise 5.70 for a quantitative treatment of this setup. 


5.5.2 Rocket motion 


The quantitative application of momentum conservation can get a little tricky 
when the mass m is allowed to vary. Such is the case with rockets, because most 
of their mass consists of fuel which is eventually ejected. 

Let mass be ejected backward with (constant) speed u relative to the rocket.!° 
Since u is a speed here, it is defined to be positive. This means that the velocity of 
the ejected particles is obtained by subtracting u from the velocity of the rocket. 
Let the rocket have initial mass M, and let m be the (changing) mass at a later 
time. Then the rate of change of the rocket’s mass is dm/dt, which is negative. 
So mass is ejected at a rate |dm/dt| = —dm/dt, which is positive. In other words, 
during a small time dt, a negative mass dm gets added to the rocket, and a positive 
mass (—dm) gets shot out the back. (If you wanted, you could define dm to be 
positive, and then subtract it from the rocket’s mass, and have dm get shot out the 
back. Either way is fine.) It may sound silly, but the hardest thing about rocket 
motion is picking a sign for these quantities and sticking with it. 

Consider a moment when the rocket has mass m and speed v. Then at a time 
dt later (see Fig. 5.11), the rocket has mass m + dm and speed v + dv, while the 


15 Just to emphasize, u is the speed with respect to the rocket. It wouldn’t make sense to say “relative 
to the ground,” because the rocket’s engine shoots out the matter relative to itself, and the engine 
has no way of knowing how fast the rocket is moving with respect to the ground. 
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exhaust has mass (—dm) and speed v — u (which may be positive or negative, 
depending on the relative sizes of v and u). There are no external forces, so the 
total momentum at each of these times must be equal. Therefore, 


mv = (m+ dm)(v + dv) + (—dm)(v — u). (5.53) 
Ignoring the second-order term dm dv, this simplifies to m dv = —u dm. Dividing 
by m and integrating from t; to h gives 
v2 ™ dm mı 
/ dv=- | u— => wv =uln—. (5.54) 
vı mı m m2 


For the case where the initial mass is M and the initial speed is 0, we have 
v = uln(M/m). Note that we haven’t assumed anything about dm/dt in this 
derivation. There is no need for it to be constant; it can change in any way it 
wants. The only thing that matters (assuming that M and u are given) is the final 
mass m. In the special case where dm/dt happens to be constant (call it —7, where 
n is positive), we have v(t) = uln[M/(M — nft)). 

The log in the result in Eq. (5.54) is not very encouraging. If the mass of 
the metal in the rocket is m, and if the mass of the fuel is 9m, then the final 
speed is only uln 10 ~ (2.3)u. If the mass of the fuel is increased by a factor 
of 11 up to 99m (which is probably not even structurally possible, given the 
amount of metal required to hold it),!° then the final speed only doubles to 
uln 100 = 2(uln 10) ~ (4.6)u. How do you make a rocket go significantly 
faster? Exercise 5.69 deals with this question. 


REMARK: Ifyou want, you can solve this rocket problem by using force instead of conservation 
of momentum. If a chunk of mass (—dm) is ejected out the back, then its momentum changes 
by udm (which is negative). Therefore, because force equals the rate of change in momentum, 
the force on this chunk is u dm/dt. By Newton’s third law, the remaining part of the rocket then 
feels a force of —u dm /dt (which is positive). This force accelerates the remaining part of the 
rocket, so F = ma gives —udm/dt = m dv/dt," which is equivalent to the mdv = —u dm 
result above. 

We see that this rocket problem can be solved by using either force or conservation of 
momentum. In the end, these two strategies are really the same, because the latter was derived 
from F = dp/dt. But the philosophies behind the approaches are somewhat different. The 
choice of strategy depends on personal preference. In an isolated system such as a rocket, 
conservation of momentum is usually simpler. But in a problem involving an external force, 
you have to use F = dp/dt. You'll get lots of practice with F = dp/dt in the problems for this 
section and also in Section 5.8. Note that we used both F = dp/dt and F = ma in this second 
solution to the rocket problem. For further discussion of which expression to use in a given 
situation, see Appendix C. & 


'6 The space shuttle’s external fuel tank, just by itself, has a fuel-to-container mass ratio of only 
about 20. 

17 Whether we use m or m + dm here for the mass of the rocket doesn’t matter. Any differences are 
of second order. 
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5.6 The center of mass frame 
5.6.1 Definition 


When talking about momentum, it is understood that a certain frame of reference 
has been chosen. After all, the velocities of the particles have to be measured 
with respect to some coordinate system. Any inertial (that is, nonaccelerating) 
frame is legal to pick, but we will see that there is one particular reference frame 
that is often advantageous to use. 

Consider a frame S and another frame S’ that moves at constant velocity u 
with respect to S (see Fig. 5.12). Given a system of particles, the velocity of the 
ith particle in S is related to its velocity in S’ by 


vi =v +u. (5.55) 


This relation implies that if momentum is conserved during a collision in frame 
S’, then it is also conserved in frame S. This is true because both the initial and 
final momenta of the system in S are increased by the same amount, (X_ m;)u, 
compared with what they are in S’.!* 

Let us therefore consider the unique frame in which the total momentum of a 
system of particles is zero. This is called the center of mass frame, or CM frame. 
If the total momentum is P = }_ m;v; in frame S, then the CM frame is the frame 
S’ that moves with velocity 


P X miyi 
M M 


(5.56) 


with respect to S, where M = }_ m; is the total mass. This follows from using 
Eq. (5.55) to write 


P 
P= Xomiy; = So mi (« z x) =P—P=0. (5.57) 


The CM frame is extremely useful. Physical processes are generally much more 
symmetrical in this frame, and this makes the results more transparent. The CM 
frame is sometimes called the “zero momentum” frame. But the “center of mass” 
name is commonly used because the center of mass of the particles doesn’t move 
in the CM frame, for the following reason. The position of the center of mass is 
defined by 


_ mix; 


Rew = (5.58) 


'8 Alternatively, nowhere in our earlier derivation of momentum conservation did we say what frame 
we were using. We assumed only that the frame wasn’t accelerating. If it were accelerating, then 
F would not equal ma. We will see in Chapter 10 how F = ma is modified in a noninertial frame. 
But there’s no need to worry about that here. 


Fig. 5.12 
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This is the location of the pivot upon which a rigid system would balance, as 
we’ll see in Chapter 8. The fact that the CM doesn’t move with respect to the CM 
frame follows from the fact that the derivative of Rcy is the velocity of the CM 
frame in Eq. (5.56). The center of mass may therefore be chosen as the origin of 
the CM frame. 

If we take two derivatives of Eq. (5.58), we obtain 


Macy = J mia; = ) F; = Foot (5.59) 


So as far as the acceleration of the CM goes, we can treat the system of particles 
like a point mass at the CM, and then just apply F = ma to this point mass. Since 
the internal forces cancel in pairs, we need only consider the external forces when 
calculating Ftotal- 

Along with the CM frame, the other frame that people generally work with 
is the Jab frame. There is nothing at all special about this frame. It is simply 
the frame (assumed to be inertial) in which the conditions of the problem are 
given. Any inertial frame can be called the “lab frame.” Solving problems often 
involves switching back and forth between the lab and CM frames. For example, 
if the final answer is requested in the lab frame, then you may want to transform 
the given information from the lab frame to the CM frame where things are more 
obvious, and then transform back to the lab frame to give the answer. 


Example (Two masses in 1-D): A mass m with speed v approaches a stationary 
mass M (see Fig. 5.13). The masses bounce off each other without any loss in total 
energy. What are the final velocities of the particles? Assume that the motion takes 
place in 1-D. 


Solution: Doing this problem in the lab frame would require a potentially messy 
use of conservation of energy (see the example in Section 5.7.1). But if we work in 
the CM frame, things are much easier. The total momentum in the lab frame is mv, 
so the CM frame moves to the right with speed mv/(m + M) = u with respect to the 
lab frame. Therefore, in the CM frame, the velocities of the two masses are 


Mv mv 
, and vy=O0-u=— : 
m+M m+M 


Um = V — U = (5.60) 
As a double-check, the difference in the velocities is v, and the ratio of the speeds is 
M /m, which makes the total momentum zero. 

The important point to realize now is that in the CM frame, the two particles must 
simply reverse their velocities after the collision (assuming that they do indeed hit 
each other). This is true because the speeds must still be in the ratio M/m after the 
collision, in order for the total momentum to remain zero. Therefore, the speeds must 
either both increase or both decrease. But if they do either of these, then energy is not 
conserved. |° 


19 So we did have to use conservation of energy in this CM-frame solution. But it was far less messy 
than it would have been in the lab frame. 
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If we now go back to the lab frame by adding the CM velocity of mu/(m + M) 
to the two new velocities of —Mvu/(m + M) and mv/(m + M), we obtain final lab 
velocities of 


(5.61) 


Remark: Ifm = M, then the left mass stops, and the right mass picks up a velocity of v 
(this should be familiar to pool players). If M >> m, then the left mass bounces back with 
velocity ~ —v, and the right mass hardly moves (it’s essentially a brick wall). Ifm >> M, 
then the left mass keeps plowing along with velocity ~ v, and the right mass picks up a 
velocity of © 2v. This 2v is an interesting result (it is clearer if you consider things in the 
frame of the heavy mass m, which is essentially the CM frame), and it leads to some neat 
effects, such as in Problem 5.23. & 


5.6.2 Kinetic energy 


Given a system of particles, the relation between the total kinetic energy in two 
different frames is not very enlightening in general. But if one of the frames is 
the CM frame, then the relation turns out to be quite nice. Let S’ be the CM 
frame, which moves at constant velocity u with respect to another frame S. Then 
the velocities of the particles in the two frames are related by v; = vi + u. The 
kinetic energy in the CM frame is 


1 
Kou = 5 milwi. (5.62) 


And the kinetic energy in frame S is 
1 
Ks = 2 X mily; + ul? 


1 
= 5 mvv + 2vj-u+u-w) 
1 


1 
=5 An +u.: em) miv}) + slur Xm; 
1 
= Kem + zM, (5.63) 


where M is the total mass of the system, and where we have used }`; m;v; = 0, 
by definition of the CM frame. Therefore, the K in any frame equals the K in the 
CM frame, plus the K of the whole system treated like a point mass M located 
at the CM, which moves with velocity u. An immediate corollary of this fact is 
that if the K is conserved in a collision in one frame (which implies that Kem 
is conserved, because conservation of momentum says that the CM speed u is 
the same before and after the collision), then it is conserved in any other frame 
(because again, the u relevant to that frame is the same before and after the 
collision). 
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5.7 Collisions 


There are two basic types of collisions among particles, namely elastic ones (in 
which kinetic energy is conserved), and inelastic ones (in which kinetic energy 
is lost). In any collision, the total energy is conserved, but in inelastic collisions 
some of this energy goes into the form of heat (that is, relative motion of the 
molecules inside the particles) instead of showing up in the net translational 
motion of the particles.” 

We’ll deal mainly with elastic collisions here, although some situations are 
inherently inelastic, as we’ll see in Section 5.8. For inelastic collisions where 
it is stated that a certain fraction, say 20%, of the kinetic energy is lost, only a 
trivial modification to the procedure is required. To solve any elastic collision 
problem, we just have to write down the conservation of energy and momentum 
equations, and then solve for whatever variables we want to find. 


5.7.1 One-dimensional motion 


Let’s first look at one-dimensional motion. To see the general procedure, we’ll 
solve the example from Section 5.6.1 again. 


Example (Two masses in 1-D, again): A mass m with speed v approaches a 
stationary mass M (see Fig. 5.14). The masses bounce off each other elastically. What 
are the final velocities of the particles? Assume that the motion takes place in 1-D. 


Solution: Let vf and Vf be the final velocities of the masses. Then conservation 
of momentum and energy give, respectively, 


mv + 0 = mvs + MVF, 
Loa i a Ea (5.64) 
z” +0 = z” + zE- 

We must solve these two equations for the two unknowns vf and Vf. Solving for Vf 
in the first equation and substituting into the second gives 


m? (w — vp)? 
M2 
=> 0=(m+ Myw? — 2mvvp + (m — Mv’, 


mv = mož +M 


> 


=> 0= ((m + M)v¢ — (m — Mw) (vg — v). (5.65) 


One solution is vf = v, but this isn’t the one we’re concerned with. It is of course 
a solution, because the initial conditions certainly satisfy conservation of energy and 


20 We’ll use the terminology where “kinetic energy” refers to the overall translational energy of a 
particle. That is, we’ll exclude heat from this definition, even though heat is just the relative kinetic 
energy of molecules inside a particle. 
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momentum with the initial conditions (a fine tautology indeed). If you want, you can 
view vf = v as the solution where the particles miss each other. The fact that vg = v 
is always a root can often save you a lot of quadratic-formula trouble. 

The ve = v(m — M)/(m + M) root is the one we want. Plugging this vf back into 
the first of Eqs. (5.64) to obtain Vf gives 


(m — M)v 2mv 
Up = ee and Vp= (5.66) 
m 


in agreement with Eq. (5.61). 


This solution was somewhat of a pain, because it involved a quadratic 
equation. The following theorem is extremely useful because it offers a way to 
avoid the hassle of quadratic equations when dealing with 1-D elastic collisions. 


Theorem 5.3 Jna 1l-D elastic collision, the relative velocity of the two particles 
after the collision is the negative of the relative velocity before the collision. 


Proof: Let the masses be m and M. Let vi and V; be the initial velocities, and 
let vp and Vf be the final velocities. Conservation of momentum and energy give 


mvi + MV, = mvg + MVF, 
1 (5.67) 


1 1 1 
-mv? + sur = zmo + zMVẹ. 


2 


Rearranging these yields 


m(vi — vp) = M (Vf — Vi), 


(5.68) 
m? — vs) = M(VÈ — V°). 


Dividing the second equation by the first gives vi + vf = Vi + Vr. Therefore, 
vi — Vi = — (vf — Vt), (5.69) 


as we wanted to show. In taking the quotient of these two equations, we have lost 
the vf = v; and Vf = V; solution. But as stated in the above example, this is the 
trivial solution. E 


This is a splendid theorem. It has the quadratic energy-conservation statement 
built into it. Hence, using this theorem along with momentum conservation (both 
of which are linear equations and thus easy to deal with) gives the same infor- 
mation as the standard combination of Eqs. (5.67). Another quick proof is the 
following. It is fairly easy to see that the theorem is true in the CM frame (as 
we argued in the example in Section 5.6.1), so it is therefore true in any frame, 
because it involves only differences in velocities. 
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5.7.2 Two-dimensional motion 


Let’s now look at the more general case of two-dimensional motion. Three- 
dimensional motion is just more of the same, so we’ll confine ourselves to 2-D. 
Everything is basically the same as in 1-D, except that there is one more momen- 
tum equation, and one more variable to solve for. This is best seen through an 
example. 


Example (Billiards): A billiard ball with speed v approaches an identical stationary 
one. The balls bounce off each other elastically, in such a way that the incoming one 
gets deflected by an angle 0 (see Fig. 5.15). What are the final speeds of the balls? 
What is the angle ø at which the stationary ball is deflected? 


Solution: Let vf and Vp be the final speeds of the balls. Then conservation of py, 
py, and E give, respectively, 


mv = mvp cos 0 + mV cos ġ, 


0 = mvp sind — mV sin d, (5.70) 


1 1 1 
zm? = zmo + 5p 


We must solve these three equations for the three unknowns vf, Vf, and ġ. There are 
various ways to do this. Here’s one. Eliminate @ by adding the squares of the first 
two equations (after putting the vf terms on the left-hand side) to obtain 


u? — Quug cos + v? = VŽ. (5.71) 
Now eliminate Vf by combining this with the third equation to obtain”! 
Up = vcosé. (5.72) 
The third equation then yields 
Ve = vsinð. (5.73) 


The second equation then gives m(vcos@) sin@ = m(vsin@) sing, which implies 
cos@ = sin @ (or 8 = 0, which corresponds to no collision). Therefore, 


$ = 90° — 9. (5.74) 


In other words, the balls bounce off at right angles with respect to each other. This 
fact is well known to pool players. Problem 5.19 gives another (cleaner) way to 
demonstrate this result. Note that we needed to specify one of the four quantities, vf, 
Vf, 0, ġ (we chose 0), because we have only three equations. Intuitively, we can’t 
expect to solve for all four of these quantities, because we can imagine one ball hitting 


21 Another solution is up = 0. In this case, ¢ must equal zero, and @ is not well defined. This is 
simply the 1-D motion in the example in Section 5.6.1. 
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the other at various distances away from directly head-on, which will cause the balls 
to be deflected at various angles. 


As we saw in the 1-D example in Section 5.6.1, collisions are often much 
easier to deal with in the CM frame. Using the same reasoning (conservation 
of p and £) that we used in that example, we conclude that in 2-D (or 3-D) the 
final speeds of two elastically colliding particles must be the same as the initial 
speeds. The only degree of freedom in the CM frame is the angle of the line 
containing the final (oppositely directed) velocities. This simplicity in the CM 
frame invariably provides for a cleaner solution than the lab frame yields. A good 
example of this is Exercise 5.81, which gives yet another way to derive the above 
right-angle billiard result. 


5.8 Inherently inelastic processes 


There is a nice class of problems where the system has inherently inelastic prop- 
erties, even if it doesn’t appear so at first glance. In such a problem, no matter 
how you try to set it up, there will be inevitable kinetic energy loss that shows 
up in the form of heat. Total energy is conserved, of course, since heat is simply 
another form of energy. But the point is that if you try to write down a bunch of 
(1/2)mv’s and conserve their sum, then you’re going to get the wrong answer. 
The following example is the classic illustration of this type of problem. 


Example (Sand on conveyor belt): Sand drops vertically (from a negligible 
height) at a rate o kg/s onto a moving conveyor belt. 


(a) What force must you apply to the belt in order to keep it moving at a constant 
speed v? 

(b) How much kinetic energy does the sand gain per unit time? 

(c) How much work do you do per unit time? 

(d) How much energy is lost to heat per unit time? 


Solution: 


(a) Your force equals the rate of change in momentum. If we let m be the combined 
mass of the conveyor belt plus the sand on the belt, then 


dp d(mv) dv dm 
dt dt "H +t de” ois (5.75) 


where we have used the fact that v is constant. 


F= 


(b) The kinetic energy gained per unit time is 


d |m? dm [{ v2 ov 
= = : (5.76) 
dt 2 dt \ 2 2 
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(c) The work done by your force per unit time is 


d k F dx 
eo) 2 =Fy=ov', (5.77) 


dt dt 
where we have used Eq. (5.75). 
(d) If work is done at a rate ov’, and kinetic energy is gained at a rate ov /2, then 
the “missing” energy must be lost to heat at a rate ov? — ov? /2= ov /2. 


In this example, it turned out that exactly the same amount of energy was lost 
to heat as was converted into kinetic energy of the sand. There is an interesting 
and simple way to see why this is true. In the following explanation, we’ll just 
deal with one particle of mass M that falls onto the conveyor belt, for simplicity. 

In the lab frame, the mass gains a kinetic energy of Mv*/2 by the time it 
finally comes to rest with respect to the belt, because the belt moves at speed v. 
Now look at things in the conveyor belt’s reference frame. In this frame, the mass 
comes flying in with an initial kinetic energy of Mv*/2, and then it eventually 
slows down and comes to rest on the belt. Therefore, all of the Mv?/2 energy 
is converted to heat. And since the heat is the same in both frames, this is the 
amount of heat in the lab frame, too. 

We therefore see that in the lab frame, the equality of the heat loss and the 
gain in kinetic energy is a consequence of the obvious fact that the belt moves at 
the same rate with respect to the lab (namely v) as the lab moves with respect to 
the belt (also v). 

In the solution to the above example, we did not assume anything about the 
nature of the friction force between the belt and the sand. The loss of energy 
to heat is an unavoidable result. You might think that if the sand comes to rest 
on the belt very “gently” (over a long period of time), then you can avoid the 
heat loss. This is not the case. In that scenario, the smallness of the friction force 
is compensated by the fact that the force must act over a very large distance. 
Likewise, if the sand comes to rest on the belt very abruptly, then the largeness 
of the friction force is compensated by the smallness of the distance over which 
it acts. No matter how you set things up, the work done by the friction force is 
the same nonzero quantity. 

In other problems such as the following one, it is fairly clear that the process 
is inelastic. But the challenge is to correctly use F = dp/dt instead of F = ma, 
because F = ma will get you into trouble due to the changing mass. 


Example (Chain on a scale): An “idealized” (see the comments following this 
example) chain with length L and mass density ø kg/m is held such that it hangs 
vertically just above a scale. It is then released. What is the reading on the scale, as a 
function of the height of the top of the chain? 


5.8 Inherently inelastic processes 


First solution: Let y be the height of the top of the chain, and let F be the desired 
force applied by the scale. The net force on the entire chain is F — (o L)g, with upward 
taken to be positive. The momentum of the entire chain (which just comes from the 
moving part) is (oy)y. Note that this is negative, because y is negative. Equating the 
net force on the entire chain with the rate of change in its momentum gives 


_ d(oyy) 
~ dt 


= 0yĵ tos". (5.78) 


F-olg 


The part of the chain that is still above the scale is in free fall. Therefore, ýy = —g. 
And conservation of energy gives ý = ,/2g(L — y), because the chain has fallen a 
distance L — y. Plugging these into Eq. (5.78) gives 


F=olg—oyg+2o(L—y)g 
= 30 (L — y)g, (5.79) 


which happens to be three times the weight of the chain already on the scale. This 
answer for F has the expected property of equaling zero when y = L, and also the 
interesting property of equaling 3(o0L)g right before the last bit touches the scale. 
Once the chain is completely on the scale, the reading suddenly drops down to the 
weight of the chain, namely (oL)g. 

If you used conservation of energy to do this problem and assumed that all of the 
lost potential energy goes into the kinetic energy of the moving part of the chain, then 
you would obtain a speed of infinity for the last infinitesimal part of the chain to hit 
the scale. This is certainly incorrect, and the reason is that there is inevitable heat loss 
that arises when the pieces of the chain inelastically smash into the scale. 


Second solution: The normal force from the scale is responsible for doing two 
things. It holds up the part of the chain that already lies on the scale, and it also 
changes the momentum of the atoms that are suddenly brought to rest when they hit 
the scale. The first of these two parts of the force is simply the weight of the chain 
already on the scale, which is Fweignt = o (Æ — y)g. 

To find the second part of the force, we need to find the change in momentum, dp, 
of the part of the chain that hits the scale during a given time dt. The amount of mass 


that hits the scale in a time dt is dm = o |dy| = o |ý| dt = —oy dt, since ý is negative. 
This mass initially has velocity ý, and then it is abruptly brought to rest. Therefore, 
the change in its momentum is dp = 0—(dm)y = oy dt, which is positive. The force 
required to cause this change in momentum is 


dp : 
Fuplat = Gp = OF" (5.80) 


But as in the first solution, we have ý = „/2g(L — y). Therefore, the total force from 
the scale is 


F = Fyeignt + Fapsar = 0 (L — y)g + 20 (L — yg 
= 30(L—y)g. (5.81) 
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Note that Fap/dt = 2F weight (until the chain is completely on the scale), independent 
of y. 


In this example, we assumed that the chain was “ideal,” in the sense that it was 
completely flexible, infinitesimally thin, and unstretchable. The simplest model 
that satisfies these criteria is a series of point masses connected by short massless 
strings. But in the above example, the strings actually don’t even matter. You 
could instead start with many little unconnected point masses held in a vertical 
line, with the bottom one just above the scale. If you then dropped all of them 
simultaneously, they would successively smash into the scale in the same manner 
as if they were attached by little strings; the tension in the strings would all be 
zero. However, even though the strings aren’t necessary in this chain and scale 
example, there are many setups involving idealized chains where they are in fact 
necessary, because a tension is required in them. This is evident in many of the 
problems and exercises for this chapter, as you will see. 

An interesting fact is that even with the above definition of an ideal 
chain, there are some setups (in contrast with the one above) for which it 
is impossible to specify how the system behaves, without being given more 
information. This information involves the relative size of two specific length 
scales, as we'll see below. To illustrate this, consider the two following 
scenarios for the setup in Problem 5.28 (see Fig. 5.16), where a vertical 
ideal chain is dropped with its bottom end attached to the underneath of a 
support. 


e FIRST SCENARIO (ENERGY NONCONSERVING): Let the spacing between the point masses in 
our ideal chain be large compared with the horizontal span of the bend in the chain at 
its bottom; see Fig. 5.17. Then the system is for all practical purposes one dimensional. 
Each of the masses stops abruptly when it reaches the bend. This stoppage is a completely 
inelastic collision in the same way it was in the above example with the chain falling on 
the scale. Note that at any point in time, the bend consists of a massless piece of string 
folded back along itself (or perhaps it consists of one of the masses, if we happen to be 
looking at it right when a mass stops). There is no tension in this bottom piece of string 
(if there were, then the massless bend would have an infinite acceleration upward), so 
there is no tension pulling down the part of the chain on the left side of the bend. The 
left part of the chain is therefore in freefall. 

e SECOND SCENARIO (ENERGY CONSERVING): Let the spacing between the point masses in 
our ideal chain be small compared with the horizontal span of the bend in the chain 
at its bottom; see Fig. 5.18. The system is now inherently two dimensional, and the 
masses are essentially continuously distributed along the chain, as far as the bend is 
concerned. This has the effect of allowing each mass to gradually come to rest, so 
there is no abrupt inelastic stopping like there was in the first scenario. Each mass 
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in the bend keeps the same distance from its two neighbors, whereas in the first sce- 
nario the mass that has just stopped soon sees the next mass fly directly past it before 
abruptly coming to rest. The process in this second scenario is elastic; no energy is lost 
to heat. 


The basic difference between the two scenarios is whether or not there is slack 
in any of the strings in the bend. If there is, then the relative speed between a pair 
of masses changes abruptly at some point, which means that the relative kinetic 
energy of the masses goes into damped (perhaps very overdamped) vibrational 
motion in the connecting string, which then decays into the random motion 
of heat. 

Ifno energy is lost to heat in the second scenario, then you might think that the 
last infinitesimal piece of the chain will have an infinite speed. However, there 
isn’t one /ast piece of the chain. When the left part of the chain has disappeared 
and we are left with only the bend and the right part, the small (but nonzero) 
bend is the last “piece,” and it ends up swinging horizontally with a large speed. 
This then drags the whole chain to the side in a very visible motion (which can 
be traced to the horizontal force from the support), at which point we have a very 
noticeably two-dimensional system. The initial potential energy of the chain ends 
up as kinetic energy of the final wavy side-to-side motion. 

Aconsequence of energy conservation in the second scenario is that for a given 
height fallen, the left part of the chain will be moving faster than the left part in 
the first scenario. In other words, the left part in the second scenario accelerates 
downward faster than the freefall g. But although this result follows quickly 
from energy considerations, it isn’t so obvious in terms of a force argument. 
Apparently there exists a tension at the left end of the bend in the second scenario 
that drags down the left part of the chain to give it an acceleration greater than 
g. A qualitative way of seeing why a tension exists there is the following. A tiny 
piece of the chain that enters the bend from the left part slows down as it gradually 
joins the fixed right part of the chain. There must therefore be an upward force on 
this tiny piece. This upward force can’t occur at the bottom of the piece, because 
any tension there pulls down on it. The force must therefore occur at the top of 
the piece. In other words, there is a tension at this point, and so by Newton’s third 
law this tension pulls down on the left part of the chain, thereby causing it to 
accelerate faster than g. One of the tasks of Problem 5.29 is to find the tensions 
at the two ends of the bend. 

There is a simple way to demonstrate the existence of a tension that pulls 
on the free part of the chain. The following setup is basically the falling-chain 
setup without gravity, but it still has all the essential parts. Place a rope on a 


22 If the strings were ideal springs with weak spring constants, then the energy would keep changing 
back and forth between potential energy of the springs and kinetic energy of the masses, causing 
the masses to bounce around and possibly run into each other. But we’re assuming that the strings 
in our ideal chain are essentially very rigid overdamped springs. 
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(fairly smooth) table, in the shape of a very thin “U” so that it doubles back 
along itself. Then quickly yank on one of the ends, in the direction away from 
the bend. You will find that the other end moves backwards, in the direction 
opposite to the motion of your hand, toward the bend (at least until the bend 
reaches it and drags it forward). There must therefore exist a tension in the rope 
to drag the other end backwards. But there’s no need to take my word for it — all 
you need is a piece of rope. This effect is essentially the same as the one (in a 
simplified version, since the rope here has constant density) that leads to the crack 
of a whip. 

Note that a perfectly flexible thin rope, with its continuous mass distribution, 
does indeed behave elastically like the second scenario above. The continuous 
rope may be thought of as a series of point masses with infinitesimal separa- 
tion, and so this separation is much smaller than the small (but finite) length 
of the bend. As long as the thickness of the rope is much smaller than the 
length of the bend, every piece of the rope slows to a stop gradually in the 
original falling-chain setup, or starts up from rest gradually in the preceding 
“U” setup. So there is no heat loss in either setup from abrupt changes in 
motion. 

Returning to the falling-chain system with one of our ideal chains, you 
might think that if the bend is made really small, so that the system looks one- 
dimensional, then it should behave inelastically like the first scenario above. 
However, the only relevant fact is whether the bend is smaller than the spacing 
between the point masses in our ideal chain. The word “small” is meaningless, 
of course, because we are talking about the length of the bend, which is a dimen- 
sionful quantity. It makes sense only to use the word “smaller,” that is, to compare 
one length with another. The other length here is the spacing between the masses. 
If the length of the bend is large compared with this spacing, then no matter what 
the actual length of the bend is, the system behaves elastically like the second 
scenario above. 

So which of the two scenarios better describes a real chain? Details of an 
actual experiment involving a falling chain are given in Calkin and March (1989). 
The results show that a real chain behaves basically like the chain in the second 
scenario above, at least until the final part of the motion. In other words, it is 
energy conserving, and the left part accelerates faster than g. 7 

Having said all this, it turns out that the energy-conserving second scenario 
leads to complicated issues in problems (such as the numerical integration in 
Problem 5.29), so for all the problems and exercises in this chapter (with the 
exception of Problem 5.29), we’ll assume that we’re dealing with the inelastic 
first scenario. 


43 Spur-of-the-moment (but still plenty convincing) experiments were also performed by Wes 
Campbell in the physics laboratory of John Doyle at Harvard. 
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Section 5.1 Conservation of energy in one dimension 


5.1. 


5:2: 


5.3; 


5.4. 


5.5. 


Minimum length « 


The shortest configuration of string joining three given points is the one 
shown in the first setup in Fig. 5.19, where all three angles are 120°. *4 
Explain how you could experimentally prove this fact by cutting three 
holes in a table and making use of three equal masses attached to the ends 
of strings, the other ends of which are connected as shown in the second 
setup in Fig. 5.19. 


Heading to zero « 


A particle moves toward x = 0 under the influence of a potential 
V (x) = —A|x|", where A > 0 and n > 0. The particle has barely enough 
energy to reach x = 0. For what values of n will it reach x = 0 ina 
finite time? 


Leaving the sphere « 


A small mass rests on top of a fixed frictionless sphere. The mass is given 
a tiny kick and slides downward. At what point does it lose contact with 
the sphere? 


Pulling the pucks +x 


(a) A massless string of length 2£ connects two hockey pucks that 
lie on frictionless ice. A constant horizontal force F is applied to 
the midpoint of the string, perpendicular to it (see Fig. 5.20). By 
calculating the work done in the transverse direction, find how 
much kinetic energy is lost when the pucks collide, assuming they 
stick together. 

(b) The answer you obtained above should be very clean and nice. 
Find the slick solution that makes it transparent why the answer is 
so nice. 


Constant ý +x 


A bead, under the influence of gravity, slides down a frictionless wire 
whose height is given by the function y(x). Assume that at position 
(x,y) = (0,0), the wire is vertical and the bead passes this point with 
a given speed vo downward. What should the shape of the wire be (that 
is, what is y as a function of x) so that the vertical speed remains vp at all 
times? Assume that the curve heads toward positive x. 


24 Tf the three points form a triangle that has an angle greater than 120°, then the string simply passes 


through the point where that angle is. We won’t worry about this case. 


Fig. 5.19 


Fig. 5.20 
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5.6. 


Sl 


Dividing the heat xxx 


A block rests on a table where the coefficient of kinetic friction is ug. 
You pull the block at a constant speed across the table by applying a 
force uN. Consider a period of time during which the block moves a 
distance d. How much work is done on the block? On the table? How 
much does each object heat up? Is it possible to answer these questions? 
Hint: You'll have to make some sort of model, however crude, for the 
way that friction works. 


V(x) vs. a hill «xx 


A bead, under the influence of gravity, slides along a frictionless wire 
whose height is given by the function V(x), as shown in Fig. 5.21. Find 
an expression for the bead’s horizontal acceleration, x. (It can depend on 
whatever quantities you need it to depend on.) You should find that the 
result is not the same as the x for a particle moving in one dimension 
in the potential mgV (x), in which case ¥ = —gV’. But if you grab hold of 
the wire, is there any way you can move it so that the bead’s ¥ is equal to 
the ¥ = —gV’ result for the one-dimensional potential mgV (x)? 


Section 5.2: Small oscillations 


5.8. Hanging mass 


The potential for a mass hanging from a spring is V (y) = ky?/2 + mgy, 
where y = 0 corresponds to the position of the spring when nothing 
is hanging from it. Find the frequency of small oscillations around the 
equilibrium point. 


5.9. Small oscillations x 


A particle moves under the influence of the potential V(x) = —Cx"e"®. 
Find the frequency of small oscillations around the equilibrium point. 


Section 5.4.1: Gravity 


5.10. Zero force inside a sphere « 


Show that the gravitational force inside a spherical shell is zero by show- 
ing that the pieces of mass at the ends of the thin cones in Fig. 5.22 give 
canceling forces at point P. 


5.11. Escape velocity x 


(a) Find the escape velocity (that is, the velocity above which a 
particle escapes to r = oo) for a particle on a spherical planet 
of radius R and mass M. What is the numerical value for the 
earth? The moon? The sun? 


5.12. 


5.13, 
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(b) Approximately how small must a spherical planet be in order for 
a human to be able to jump off? Assume a density roughly equal 
to the earth’s. 


Ratio of potentials +» 


Consider a cube of uniform mass density. Find the ratio of the gravita- 
tional potential energy of a mass at a corner to that of the same mass 
at the center. Hint: There’s a slick way that doesn’t involve any messy 
integrals. 


Through the hole +x 


(a) A hole of radius R is cut out from an infinite flat sheet with mass 
density o per unit area. Let L be the line that is perpendicular to 
the sheet and that passes through the center of the hole. What is 
the force on a mass m that is located on L, at a distance x from the 
center of the hole? Hint: Consider the plane to consist of many 
concentric rings. 
If a particle is released from rest on L, very close to the center of 
the hole, show that it undergoes oscillatory motion, and find the 
frequency of these oscillations. 
(c) If a particle is released from rest on L, at a distance x from the 
sheet, what is its speed when it passes through the center of the 
hole? What is your answer in the limit x >> R? 


(b 


wm 


Section 5.5.1: Momentum 


5.14. 


5.16. 


Snowball « 


A snowball is thrown against a wall. Where does its momentum go? 
Where does its energy go? 


. Propelling a car «x 


For some odd reason, you decide to throw baseballs at a car of mass 
M that is free to move frictionlessly on the ground. You throw the 
balls at the back of the car at speed u, and they leave your hand at a 
mass rate of o kg/s (assume the rate is continuous, for simplicity). If 
the car starts at rest, find its speed and position as a function of time, 
assuming that the balls bounce elastically directly backward off the back 
window. 


Propelling a car again +x 


Do the previous problem, except now assume that the back window is 
open, so that the balls collect inside the car. 
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5.17. Leaky bucket «x 


Att = 0, a massless bucket contains a mass M of sand. It is connected to 
a wall by a massless spring with constant tension T (that is, independent 
of length).*> See Fig. 5.23. The ground is frictionless, and the initial 
Ẹ P distance to the wall is L. At later times, let x be the distance from the 
= wall, and let m be the mass of sand in the bucket. The bucket is released, 
and on its way to the wall, it leaks sand at a rate dm/dx = M/L. In other 
Fig. 5.23 words, the rate is constant with respect to distance, not time; and it ends 
up empty right when it reaches the wall. Note that dx is negative, so dm 

is also. 


(a) What is the kinetic energy of the (sand in the) bucket, as a function 
of x? What is its maximum value? 

(b) What is the magnitude of the momentum of the bucket, as a 
function of x? What is its maximum value? 


5.18. Another leaky bucket xxx 


Consider the setup in Problem 5.17, but now let the sand leak at a rate 
proportional to the bucket’s acceleration. That is, dm/dt = bx. Note that 
X is negative, so dm is also. 


(a) Find the mass as a function of time, m(t). 

(b) Find v(t) and x(t) during the time when the bucket contains a 
nonzero amount of sand. Also find v(m) and x(m). What is the 
speed right before all the sand leaves the bucket (assuming it 
hasn’t hit the wall yet)? 

(c) What is the maximum value of the bucket’s kinetic energy, 
assuming it is achieved before it hits the wall? 

(d) What is the maximum value of the magnitude of the 
bucket’s momentum, assuming it is achieved before it hits 
the wall? 

(e) For what value of b does the bucket become empty right when it 
hits the wall? 


Section 5.7: Collisions 


5.19. Right angle in billiards x 


A billiard ball collides elastically with an identical stationary one. Use 
the fact that mv? /2 may be written as m(v - v)/2 to show that the angle 


25 You can construct a constant-tension spring with a regular Hooke’s-law spring in the following 
way. Pick the spring constant to be very small, and stretch the spring a very large distance. Have 
it pass through a hole in the wall, with its other end bolted down a large distance to the left of the 
wall. Any changes in the bucket’s position will yield a negligible change in the spring force. 
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between the resulting trajectories is 90°. Hint: Take the dot product of 
the conservation of momentum equation with itself. 


5.20. Bouncing and recoiling «« 


A ball of mass m and initial speed vg bounces back and forth between 

a fixed wall and a block of mass M, with M >> m; see Fig. 5.24. The 

block is initially at rest. Assume that the ball bounces elastically and m Vo 
instantaneously. The coefficient of kinetic friction between the block 
and the ground is u. There is no friction between the ball and the ground. 
What is the speed of the ball after the mth bounce off the block? How far Fig. 5.24 
does the block eventually move? How much total time does the block 

actually spend in motion? Work in the approximation where M > m, 

and assume that the distance to the wall is large enough so that the block 

comes to rest by the time the next bounce occurs. 


5.21. Drag force on a sheet «x 


A sheet of mass M moves with speed V through a region of space 
that contains particles of mass m and speed v. There are n of these 
particles per unit volume. The sheet moves in the direction of its normal. 
Assume m < M, and assume that the particles do not interact with 
each other. 


(a) Ifv < V, what is the drag force per unit area on the sheet? 

(b) Ifv >> V, what is the drag force per unit area on the sheet? 
Assume, for simplicity, that the component of every particle’s 
velocity in the direction of the sheet’s motion is exactly +u/2.° 


5.22. Drag force on a cylinder x» 


Acylinder of mass M and radius R moves with speed V through a region 
of space that contains particles of mass m that are at rest. There are n 
of these particles per unit volume. The cylinder moves in a direction 
perpendicular to its axis. Assume m < M, and assume that the particles 
do not interact with each other. What is the drag force per unit length on 
the cylinder? 


5.23. Basketball and tennis ball «« 


(a) A tennis ball with a small mass mp sits on top of a basketball with 
a large mass mı (see Fig. 5.25). The bottom of the basketball 
is a height A above the ground, and the bottom of the tennis 
ball is a height A + d above the ground. The balls are dropped. 
To what height does the tennis ball bounce? Note: Work in the 


Fig. 5.25 


6 Tn reality, the velocities are randomly distributed, but this idealization actually gives the correct 
answer because the average speed in any direction is |vy| = v/2, as you can show. 
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approximation where mı is much larger than m2, and assume that 
the balls bounce elastically. Also assume, for the sake of having 
a nice clean problem, that the balls are initially separated by a 
small distance, and that the balls bounce instantaneously. 

Now consider n balls, B1, ..., Bn, having masses m1, m2, ..., Mn 
(with m; >> m > --- >> my), standing in a vertical stack (see 
Fig. 5.26). The bottom of Bı is a height A above the ground, and 
the bottom of B, is a height A + £ above the ground. The balls are 
dropped. In terms of n, to what height does the top ball bounce? 
Note: Make assumptions and approximations similar to the ones 
in part (a). 

Ifh = 1 meter, what is the minimum number of balls needed for 
the top one to bounce to a height of at least 1 kilometer? To reach 
escape velocity? Assume that the balls still bounce elastically 
(which is a bit absurd here), and ignore wind resistance, etc., and 
assume that £ is negligible. 


~ 


(b 


Maximal deflection xxx 


A mass M collides with a stationary mass m. If M < m, then it is possi- 
ble for M to bounce directly backward. However, if M > m, then there 
is a maximal angle of deflection of M. Show that this maximal angle 
equals sin™!(m/M). Hint: It is possible to do this problem by working 
in the lab frame, but you can save yourself a lot of time by consider- 
ing what happens in the CM frame, and then shifting back to the lab 
frame. 


Section 5.8: Inherently inelastic processes 


5.26. 


Note: In the problems involving chains in this section (with the exception 
of Problem 5.29), we’ll assume that the chains are of the type described 
in the first scenario near the end of Section 5.8. 


. Colliding masses x 


A mass M, initially moving at speed V, collides and sticks to a mass m, 
initially at rest. Assume M >> m, and work in this approximation. What 
are the final energies of the two masses, and how much energy is lost to 
heat, in: 

(a) The lab frame? 

(b) The frame in which M is initially at rest? 


Pulling a chain +» 


A chain with length Z and mass density o kg/m lies straight on a friction- 
less horizontal surface. You grab one end and pull it back along itself, 
in a parallel manner (see Fig. 5.27). Assume that you pull it at constant 
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speed v. What force must you apply? What is the total work that you 
do, by the time the chain is straightened out? How much energy is lost 
to heat, if any? 


5.27. Pulling a chain again «+x 


A chain with mass density o kg/m lies in a heap on the floor. You 
grab an end and pull horizontally with constant force F. What is the 
position of the end of the chain, as a function of time, while it is unrav- 
elling? Assume that the chain is greased, so that it has no friction with 
itself. 


5.28. Falling chain +x 


A chain with length Z and mass density o kg/m is held in the position 
shown in Fig. 5.28, with one end attached to a support. Assume that 
only a negligible length of the chain starts out below the support. The 
chain is released. Find the force that the support applies to the chain, as 
a function of time. 


5.29. Falling chain (energy conserving) ++» 


Consider the setup in the previous problem, but now let the chain be of 
the type in the second scenario described in Section 5.8. Show that the 
total time it takes the chain to straighten out is approximately 85% of 
the time it would take if the left part were in freefall (as it was in the 
previous problem); you will need to solve something numerically. Also, 
show that the tension at the left end of the infinitesimal bend equals the 
tension at the right end at all times.”’ 


5.30. Falling from a table xxx 


(a) Achain with length Z lies in a straight line on a frictionless table, 
except for a very small piece at one end which hangs down through 
ahole in the table. This piece is released, and the chain slides down 
through the hole. What is the speed of the chain at the instant it 
loses contact with the table? (See Footnote 3.19.) 

(b) Answer the same question, but now let the chain lie in a heap ona 
table, except for a very small piece at one end which hangs down 
through the hole. Assume that the chain is greased, so that it has 
no friction with itself. Which of these two scenarios has the larger 
final speed? 


27 The “ends” of the bend actually aren’t well defined, because the chain is at least a little bit curved 
everywhere. But since we’re assuming that the horizontal span of the chain is very small, we can 
define the height of the bend to be, say, 100 times this horizontal span, and this height is still 
negligible compared with the total height of the chain. 
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The raindrop «***« 


Assume that a cloud consists of tiny water droplets suspended (uniformly 
distributed, and at rest) in air, and consider a raindrop falling through 
them. What is the acceleration of the raindrop? Assume that the rain- 
drop is initially of negligible size and that when it hits a water droplet, 
the droplet’s water gets added to it. Also, assume that the raindrop is 
spherical at all times. 
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Section 5.1: Conservation of energy in one dimension 


5.32. 


5.34. 


5.36. 


Cart in a valley 


A cart containing sand starts at rest and then rolls, without any energy 
loss to friction, down into a valley and then up a hill on the other side. 
Let the initial height be /1, and let the final height attained on the other 
side be h2. If the cart leaks sand along the way, how does A2 compare 
with A1? 


. Walking on an escalator 


An escalator moves downward at constant speed. You walk up the esca- 
lator at this same speed, so that you remain at rest with respect to the 
ground. In the ground frame, are you doing any work? 


Lots of work 


If you push on a wall with your hand, you don’t do any work, because 
your hand doesn’t move. But in the reference frame of a person moving 
past you (from front to back), you do in fact do work, because your 
hand moves. And since the person’s speed can be made arbitrarily large, 
you can do an arbitrarily large amount of work in the person’s frame. 
It therefore seems like you should quickly use up your dinner from the 
night before and become very hungry. But you don’t. Why not? 


. Spring energy 


Using the explicit form of the position of a mass on the end of a spring, 
x(t) = Acos(wt + @), verify that the total energy is conserved. 


Damping work « 

A damped oscillator (with mx = —kx — bx) has initial position x9 and 
speed vo. After a long time, it will essentially be at rest at the origin. 
Therefore, by the work—energy theorem, the work done by the damping 
force must equal — kx /2— mvg /2. Verify that this is true. Hint: It’s 
rather messy to explicitly find x in terms of the initial conditions and 


5.37. 


5.39. 


5.40. 


5.41. 


5.42. 
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then calculate the desired integral. An easier way is to use the F = ma 
equation to rewrite the x in your integral. 


Heading to infinity « 


A particle moves under the influence of a potential V(x) = —A|x|", 
where A > 0 and n > 0. It starts at a positive value of x with a 
velocity that points in the positive x direction. For what values of n 
will the particle reach infinity in a finite time? You may assume that 
E > 0, although this isn’t necessary. (You can compare this exercise with 
Problem 5.2.) 


. Work in different frames « 


An object, initially at rest, is subject to a force that gives it constant 
acceleration a for time t. Verify explicitly that W = AK in (a) the lab 
frame, and (b) a frame moving to the left at speed V. 


Roller coaster « 


A roller coaster car starts at rest and coasts down a frictionless track. 
It encounters a vertical loop of radius R. How much higher than the top 
of the loop must the car start if it is to remain in contact with the track 
at all times? 


Pendulum and peg « 


A pendulum of length Z is held with its string horizontal, and then 
released. The string runs into a peg a distance d below the pivot, as 
shown in Fig. 5.29. What is the smallest value of d for which the string 
remains taut at all times? 


Circling around a cone x 


A fixed hollow frictionless cone is positioned with its tip pointing down. 
A particle is released from rest on the inside surface. After it has slid part 
way down to the tip, it bounces elastically off a platform. The platform 
is positioned at a 45° angle along the surface of the cone, so the particle 
ends up being deflected horizontally along the surface (in other words, 
into the page in Fig. 5.30). If the resulting motion of the particle is a 
horizontal circle around the cone, what is the ratio of the initial height 
of the particle to the height of the platform? 


Hanging spring « 
A massless spring with spring constant k hangs vertically from a ceiling, 


initially at its relaxed length. A mass m is then attached to the bottom 
and is released. 


(a) Calculate the potential energy V of the system, as a function of 
the height y (which is negative), relative to the initial position. 
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Fig. 5.32 


5.43. 


(b) Find yo, the point at which the potential energy is minimum. Make 
a rough plot of V (y). 

(c) Rewrite the potential energy as a function of z = y — yo. Explain 
why your result shows that a hanging spring can be considered 
to be a spring in a world without gravity, provided that the new 
equilibrium point, yo, is now called the “relaxed” length of the 
spring. 


Removing the friction + 


A block of mass m is supported by a spring on an inclined plane, as shown 
in Fig. 5.31. The spring constant is k, the plane’s angle of inclination is 
8, and the coefficient of static friction between the block and the plane 
is 4. 


(a) You move the block down the plane, compressing the spring. 
What is the maximum compression distance of the spring (relative 
to the relaxed length it has when nothing is attached to it) that 
allows the block to remain at rest when you let go of it? 

(b) Assume that the block is at the maximum compression you found 
in part (a). At a given instant, you somehow cause the plane to 
become frictionless, and the block gets pushed up along the plane. 
What must the relation between @ and the original u be, so that 
the block reaches its maximum height when the spring is at its 
relaxed length? 


Spring and friction +» 


A spring with spring constant k stands vertically, and a mass m is placed 
on top of it. The mass is slowly lowered to its equilibrium position. 
With the spring held at this compression length, the system is rotated to 
a horizontal position. The left end of the spring is attached to a wall, and 
the mass is placed on a table with a coefficient of friction (both kinetic 
and static) of u = 1/8; see Fig. 5.32. The mass is released. 


(a) What is the initial compression of the spring? 

(b) How much does the maximal stretch (or compression) of the 
spring decrease after each half-oscillation? 

(c) How many times does the mass oscillate back and forth before 
coming to rest? 


. Keeping contact «x 


A frictionless circle of radius R is made out of a strip of metal and held 
fixed in a vertical plane. A massless spring with spring constant k has one 
end attached to the bottom point on the inside surface of the circle, and the 
other end attached to a mass m. The spring is compressed to zero length, 
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with the mass touching the inside surface of the circle at the bottom. 
(Whatever negligible length of spring remains is essentially horizontal.) 
The spring is then released, and the mass gets pushed initially to the 
right and then up along the circle; the setup at a random later time is 
shown in Fig. 5.33. Let £ be the equilibrium length of the spring. What 
is the minimum value of £ for which the mass remains in contact with 
the circle at all times? 


5.46. Spring and hoop ++ 


A fixed hoop of radius R stands vertically. A spring with spring constant 
k and relaxed length of zero is attached to the top of the hoop. 


Fig. 5.33 
(a) A block of mass m is attached to the unstretched spring and 
dropped from the top of the hoop. If the resulting motion of the 
mass is a linear vertical oscillation between the top and bottom 
points on the hoop, what is k? 
(b) The block is now removed from the spring, and the spring is re 
art (bD): 


stretched and connected to a bead, also of mass m, at the bottom 
of the hoop, as shown in Fig. 5.34. The bead is constrained to 
move along the hoop. It is given a rightward kick and acquires an 
initial speed vo. Assuming that it moves frictionlessly, how does 
its speed depend on its position along the hoop? 


5.47. Constant x +x 


A bead, under the influence of gravity, slides along a frictionless wire aa 
whose height is given by the function y(x). Assume that at position 

(x,y) = (0, 0), the wire is horizontal and the bead passes this point with Fig, 5.34 

a given speed vo to the right. What should the shape of the wire be (that 

is, what is y as a function of x) so that the horizontal speed remains vo 

at all times? One solution is simply y = 0. Find the other.** 


5.48. Over the pipe x» 


A frictionless cylindrical pipe with radius r is positioned with its axis 
parallel to the ground, at height A. What is the minimum speed at which 
a ball must be thrown (from ground level) in order to make it over the 
pipe? Consider two cases: (a) the ball is allowed to touch the pipe, and 
(b) the ball is not allowed to touch the pipe. 


5.49. Pendulum projectile +x 
A pendulum is held with its string horizontal and is then released. The 


mass swings down, and then on its way back up, the string is cut when 


28 Solve this exercise in the spirit of Problem 5.5, that is, by solving a differential equation. Once 
you get the answer, you’ll see that you could have just written it down without any calculations, 
based on your knowledge of a certain kind of physical motion. 
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it makes an angle 0 with the vertical; see Fig. 5.35. What should 6 be so 
that the mass travels the largest horizontal distance by the time it returns 
to the height it had when the string was cut? 


. Centered projectile motion +x 


A mass is attached to one end of a massless string, the other end of which 
is attached to a fixed support. The mass swings around in a vertical circle 
as shown in Fig. 5.36. Assuming that the mass has the minimum speed 
necessary at the top of the circle to keep the string from going slack, at 
what location should you cut the string so that the resulting projectile 
motion of the mass has its maximum height located directly above the 
center of the circle? 


. Beads ona hoop +» 


Two beads of mass m are initially at rest at the top of a frictionless hoop 
of mass M and radius R, which stands vertically on the ground. The 
beads are given tiny kicks, and they slide down the hoop, one to the 
right and one to the left, as shown in Fig. 5.37. What is the largest value 
of m/M for which the hoop never rises up off the ground? 


. Stationary bowl ++» 


A hemispherical bowl of mass M rests on a table. The inside surface 
of the bowl is frictionless, while the coefficient of friction between the 
bottom of the bowl and the table is u = 1. A particle of mass m is 
released from rest at the top of the bowl and slides down into it, as 
shown in Fig. 5.38. What is the largest value of m/M for which the 
bowl never slides on the table? Hint: The angle you’re concerned with 
is not 45°. 


. Leaving the hemisphere ««x« 


A point particle of mass m sits at rest on top of a frictionless hemisphere 
of mass M, which rests on a frictionless table. The particle is given a 
tiny kick and slides down the (recoiling) hemisphere. At what angle 6 
(measured from the top of the hemisphere) does the particle lose contact 
with the hemisphere? In answering this question for m #4 M, it is suf- 
ficient for you to produce an equation that 6 must satisfy (it’s a cubic). 
However, for the special case of m = M, the equation can be solved 
without too much difficulty; find the angle in this case. 


Tetherball xxxx 


A small ball is attached to a massless string of length L, the other end 
of which is attached to a very thin pole. The ball is thrown so that it 
initially travels in a horizontal circle, with the string making an angle 
4 with the vertical. As time goes on, the string wraps itself around the 
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pole. Assume that (1) the pole is thin enough so that the length of string 
in the air decreases very slowly, so that the ball’s motion may always be 
approximated as a circle, and (2) the pole has enough friction so that the 
string does not slide on the pole, once it touches it. Show that the ratio 
of the ball’s final speed (right before it hits the pole) to initial speed is 
vf/vi = sin 00. 


Section 5.4: Gravity 
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Projectile between planets « 


Two planets of mass M and radius R are at rest (somehow) with respect 
to each other, with their centers a distance 4R apart. You wish to fire 
a projectile from the surface of one planet to the other. What is the 
minimum firing speed for which this is possible? 


Spinning quickly « 


Consider a planet with uniform mass density p. If the planet rotates 
too fast, it will fly apart. Show that the minimum period of rotation is 


given by 
ee 
Gp 


What is the minimum T if p = 5.5 g/cm? (the average density of the 
earth)? 


5.57. A cone xx 


5.58. 


(a) A particle of mass m is located at the tip of a hollow cone (such 
as an ice cream cone without the ice cream) with surface mass 
density o. The slant height of the cone is L, and the half-angle at 
the vertex is 0. What can you say about the gravitational force on 
the mass m due to the cone? 

(b) If the top half of the cone is removed and thrown away (see 
Fig. 5.39), what is the gravitational force on the mass m due to 
the remaining part of the cone? For what angle @ is this force 
maximum? 


Sphere and cones x» 


(a) Consider a thin hollow fixed spherical shell of radius R and sur- 
face mass density o . A particle initially at rest falls in from infinity. 
What is its speed when it reaches the center of the shell? Assume 
that a tiny hole has been cut in the shell to let the particle through; 
see Fig. 5.40(a). 
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(b) Consider two hollow fixed cones (such as ice cream cones with- 
out the ice cream), arranged as shown in Fig. 5.40(b). They have 
base radius R, slant height L, and surface mass density o. A par- 
ticle initially at rest falls in from infinity, along the perpendicular 
bisector line shown. What is its speed when it reaches the tip of 
the cones? 


Ratio of potentials +» 


Consider the following two systems: (1) a mass m is placed at a corner of 
a flat square sheet of mass M, and (2) a mass m is placed at the center of 
a flat square sheet of mass M. What is the ratio of the potential energies 
of m in the two systems? Hint: Find A and B in the suggestive relations 
in Fig. 5.41. You’ll need to use a scaling argument to find B. 


Solar escape velocity x+» 


What is the minimum initial velocity (with respect to the earth) required 
for an object to escape from the solar system?” Take the orbital motion 
of the earth into account (but ignore the rotation of the earth, and ignore 
the other planets). You are free to choose (wisely) the firing direction. 
Make the (good) approximation that the process occurs in two separate 
steps: first the object escapes from the earth, and then it escapes from 
the sun (starting at the radius of the earth’s orbit). Some useful quantities 
are given in the solution to Problem 5.11; also, the orbital speed of the 
earth is about 30 km/s. Hint: a common incorrect answer is 13.5 km/s. 


. Spherical shell xx 


(a) Aspherical shell of mass M has inner radius R and outer radius 
R2. A particle of mass m is located a distance r from the center of 
the shell. Calculate (and make a rough plot of) the force on m, as 
a function of r, for 0 < r < oo. 

(b) Ifthe mass m is dropped from r = oo and falls down through the 
shell (assume that a tiny hole has been drilled in it), what will its 
speed be at the center of the shell? You can let Rọ = 2R; in this 
part of the problem, to keep things from getting too messy. Give 
your answer in terms of R = R}. 


. Orbiting stick ++ 


Consider a planet of mass M and radius R. A very long stick of length 
2R extends from just above the surface of the planet out to a radius 3R. 
If initial conditions have been set up so that the stick moves in a circular 
orbit while always pointing radially (see Fig. 5.42), what is the period 


2° This problem is discussed in Hendel and Longo (1988). 
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of this orbit? How does this period compare with the period of a satellite 
in a circular orbit of radius 2R? 


Speedy travel x» 


A straight tube is drilled between two points on the earth, as shown 
in Fig. 5.43. An object is dropped into the tube. What is the resulting 
motion? How long does it take to reach the other end? Ignore friction, 
and assume (incorrectly) that the density of the earth is constant (o = 
5.5 g/cm’). 


5.64. Mine shaft «« 


(a) If the earth had constant density, the gravitational force would 


decrease linearly with radius as you descend in a mine shaft; see 
Eq. (5.44). However, the density of the earth is not constant, and 
in fact the gravitational force increases as you descend. Show that 
the general condition under which this is true is pe < (2/3) Pavg, 
where pavg is the average density of the earth, and pe is the density 
of the crust at the surface. (The values for the earth are pe ~ 
3 g/cm? and Pavg © 5.5 g/cm>.) See Zaidins (1972). 


(b) A similar problem to the one in part (a), which actually turns 


out to be exactly the same, is the following. Consider a large 
flat horizontal sheet of material with density p and thickness 
x. Show that the gravitational force (from the earth plus the 
sheet) just below the sheet is larger than the force just above it 
if o < (2/3) avg, where pavg is the average density of the earth. 
A sheet of wood (with a density roughly equal to that of water) 
satisfies this inequality, but a sheet of gold doesn’t. A result from 
Problem 5.13 will be useful here. 


(c) Assuming that the density of a planet is a function of radius only, 


what should p(r) look like if you want the gravitational force to 
be independent of the depth in a mine shaft, all the way down to 
the center of the planet? 


5.65. Space elevator +x 


(a) Let the earth’s radius be R, its average density be p, and its angular 


frequency of rotation be w. Show that if a satellite is to remain 
above the same point on the equator at all times, then it must 
travel in a circle of radius nR, where 


3 __ 41Gp 
4 3w? 


(5.82) 


What is the numerical value of 7? 


187 


Fig. 5.43 


188 


Conservation of energy and momentum 


5.66. 


5.67. 


(b) Instead of a satellite, consider a long rope with uniform mass 
density extending radially from just above the surface of the earth 
out to a radius 7’R. °° Show that if the rope is to remain above the 
same point on the equator at all times, then 7’ must be given by 

8G 
n+ rf =e (5.83) 
3w 
What is the numerical value of 7’? Where does the tension in the 
rope achieve its maximum value? Hint: no messy calculations 
required. 


Force from a straight wire x+ 


A particle of mass m is placed a distance £ away from an infinitely long 
straight wire with mass density o kg/m. Show that the force on the 
particle is F = 2Gom/€. Do this in two ways: 


(a) Integrate along the wire the contributions to the force. 

(b) Integrate along the wire the contributions to the potential, and then 
differentiate to obtain the force. You will find that the potential due 
to the infinite wire is infinite,*! but you can escape this difficulty 
by letting the wire have a large but finite length, then finding the 
potential and force, and then letting the length go to infinity. 


Maximal gravity xxx 


Given a point P in space, and given a piece of malleable material of 
constant density, how should you shape and place the material in order 
to create the largest possible gravitational field at P? 
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Maximum P and £ of a rocket « 


A rocket that starts at rest with mass M ejects exhaust at a given speed 
u. What is the mass of the rocket (including unused fuel) when its 
momentum is maximum? What is the mass when its energy is maximum? 


Speedy rockets x 


Assume that it is impossible to build a structurally sound container that 
can hold fuel of more than, say, nine times its mass (the actual limit 
is higher than this, but let’s use this number just to be concrete). It 
would then seem like the limit for the speed of a rocket is u In 10, from 
Eq. (5.54). How can you build a rocket that goes faster than this? 


30 Any proposed space elevator wouldn’t have uniform mass density. But this simplified problem 


still gives a good idea of the general features. For more on the space elevator, see Aravind (2007). 


31 There’s nothing bad about this. All that matters as far as the force is concerned is differences in 


the potential, and these differences are finite. 
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Snow on a sled, quantitative «« 


Consider the setup in the example in Section 5.5.1. Att = 0, let the mass 
of the sled (including you) be M, and let its speed be Vo. If the snow 
hits the sled at a rate of o kg/s, find the speed as a function of time for 
the three cases. 


Leaky bucket «xx 


Consider the setup in Problem 5.17, but now let the sand leak at a rate 
dm/dt = —bM. In other words, the rate is constant with respect to time, 
not distance. We’ve factored out an M here, just to make the calculations 
a little nicer. 


(a) Find v(t) and x(t) during the time when the bucket contains a 
nonzero amount of sand. 

(b) What is the maximum value of the bucket’s kinetic energy, 
assuming it is achieved before it hits the wall? 

(c) What is the maximum value of the magnitude of the bucket’s 
momentum, assuming it is achieved before it hits the wall? 

(d) For what value of b does the bucket become empty right when it 
hits the wall? 


Throwing a brick +x» 


A brick is thrown from ground level, at an angle 6 with respect to the 
(horizontal) ground. Assume that the long face of the brick remains 
parallel to the ground at all times, and that there is no deformation in the 
ground or the brick when the brick hits the ground. If the coefficient of 
friction between the brick and the ground is jz, what should 0 be so that 
the brick travels the maximum total horizontal distance before finally 
coming to rest? Assume that the brick doesn’t bounce. Hint: The brick 
slows down when it hits the ground. Think in terms of impulse. 


Section 5.7: Collisions 


5.73. 


5.74. 


A one-dimensional collision « 


Consider the following one-dimensional collision. A mass 2m moves to 
the right, and a mass m moves to the left, both with speed v. They collide 
elastically. Find their final lab-frame velocities. Solve this by: 


(a) Working in the lab frame. 
(b) Working in the CM frame. 
Perpendicular vectors x 


A moving mass m collides elastically with a stationary mass 2m. Let 
their resulting velocities be vı and v2, respectively. Show that v2 must 
be perpendicular to 2v; + v2. Hint: See Problem 5.19. 
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Three pool balls x 


A pool ball with initial speed v is aimed right between two other pool 
balls, as shown in Fig. 5.44. If the two right balls leave the (elas- 
tic) collision with equal speeds, find the final velocities of all three 
balls. 


. Seven pool balls +x 


Seven pool balls are situated at rest as shown in Fig. 5.45. The middle ball 
suddenly somehow acquires a speed v to the right. Assume that starting 
with ball A, the balls spiral out an infinitesimal amount. So A is closer to 
the center ball than B is, and B is closer than C is, etc. This means that the 
center ball collides with A first, then it gets deflected into B, and then it 
gets deflected into C, and so on. But all the collisions happen in the blink 
of an eye. What will the center ball’s velocity be after it collides (elas- 
tically) with all six balls? (You can use the results from the example in 
Section 5.7.2.) 


. Midair collision x» 


A ball is held and then released. At the instant it is released, an identical 
ball, moving horizontally with speed v, collides elastically with it and is 
deflected at an upward angle. What is the maximum horizontal distance 
the latter ball can travel by the time it returns to the height of the collision? 
(You can use the results from the example in Section 5.7.2.) 


. Maximum number of collisions +» 


N identical balls are constrained to move in one dimension. If you are 
allowed to pick their initial velocities, what is the maximum number 
of collisions you can arrange for the balls to have among themselves? 
Assume that the collisions are elastic. 


Triangular room «x 


A ball is thrown against a wall of a very long triangular room which has 
vertex angle 0. The initial direction of the ball is parallel to the angle 
bisector (see Fig. 5.46). How many (elastic) bounces does the ball make? 
Assume that the walls are frictionless. 


Equal angles «x 


(a) Amass 2m moving at speed vp collides elastically with a stationary 
mass m. If the two masses scatter at equal (nonzero) angles with 
respect to the incident direction, what is this angle? 

(b) What is the largest number that the above “2” can be replaced 
with, if you want it to be possible for the masses to scatter at 
equal angles? 
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. Right angle in billiards +» 


A billiard ball collides elastically with an identical stationary one. By 
looking at the collision in the CM frame, show that the angle between 
the resulting trajectories in the lab frame is 90°. (We proved this result 
by working in the lab frame in the example in Section 5.7.2.) 


Equal v,’s_ x* 


A mass m moving with speed v in the x direction collides elastically with 
a stationary mass nm, where n is some number. After the collision, it is 
observed that both masses have equal x components of their velocities. 
What angle does the velocity of mass nm make with the x axis? (This 
can be solved by working in the lab frame or the CM frame, but the CM 
solution is slick.) 


. Maximum vy x 


A mass M moving in the positive x direction collides elastically with 
a stationary mass m. The collision is not necessarily head-on, so the 
masses may come off at angles, as shown in Fig. 5.47. Let 6 be the 
angle of m’s resulting motion. What should 8 be so that m has the largest 
possible speed in the y direction? Hint: Think about what the collision 
should look like in the CM frame. 


Bouncing between rings +» 


Two fixed circular rings, in contact with each other, stand in a vertical 
plane. A ball bounces elastically back and forth between the rings (see 
Fig. 5.48). Assume that initial conditions have been set up so that the 
ball’s motion forever lies in one parabola. Let this parabola hit the rings 
at an angle 6 from the horizontal. Show that if you want the magnitude 
of the change in the horizontal component of the ball’s momentum at 
each bounce to be maximum, then you should pick cos 8 = (V5 — 1)/2, 
which just happens to be the inverse of the golden ratio. 


. Bouncing between surfaces xw 


Consider the following generalization of the previous exercise. A ball 
bounces back and forth between a surface defined by f (x) and its reflec- 
tion across the y axis (see Fig. 5.49). Assume that initial conditions have 
been set up so that the ball’s motion forever lies in one parabola, with the 
contact points located at x9. For what function f (x) is the magnitude 
of the change in the horizontal component of the ball’s momentum at 
each bounce independent of xo? 


Drag force on a sphere +x 


A sphere of mass M and radius R moves with speed V through a region 
of space that contains particles of mass m that are at rest. There are n of 


191 


3 
es 


Fig. 5.48 


Fig. 5.49 


192 


Conservation of energy and momentum 


these particles per unit volume. Assume m < M, and assume that the 
particles do not interact with each other. What is the drag force on the 
sphere? 


. Balls in a semicircle +x» 


N identical balls lie equally spaced in a semicircle on a frictionless 
horizontal table, as shown. The total mass of these balls is M. Another 
ball of mass m approaches the semicircle from the left, with the proper 
initial conditions so that it bounces (elastically) offall N balls and finally 
leaves the semicircle, heading directly to the left (see Fig. 5.50). 


(a) In the limit N — oo (so the mass of each ball in the semicircle, 
M /N, goes to zero), find the minimum value of M /m that allows 
the incoming ball to come out heading directly to the left. Hint: 
You’ll need to do Problem 5.24 first. 

(b) In the minimum M /m case found in part (a), show that the ratio 


of m’s final speed to initial speed equals e~”. 


. Block and bouncing ball xxxs 


A block with large mass M slides with speed Vp on a frictionless table 
toward a wall. It collides elastically with a ball with small mass m, which 
is initially at rest at a distance L from the wall. The ball slides toward the 
wall, bounces elastically, and then proceeds to bounce back and forth 
between the block and the wall. 


(a) How close does the block come to the wall? 
(b) How many times does the ball bounce off the block, by the time 
the block makes its closest approach to the wall? 


Assume M > m, and give your answers to leading order in m/M. 


Section 5.8: Inherently inelastic processes 


5.89. 


5.90. 


Note: In the exercises involving chains in this section, we'll assume that 
the chains are of the type described in the first scenario near the end of 
Section 5.8. 


Slowing down, speeding up + 

A plate of mass M moves horizontally with initial speed v on a friction- 
less table. A mass m is dropped vertically onto it and soon comes to rest 
with respect to it. How much energy is required to bring the system back 
up to speed v? Explain intuitively your answer in the M > m limit. 
Pulling a chain back x» 


A chain with length L and mass density o kg/m lies outstretched on 
a frictionless horizontal table. You grab one end and pull it back along 


5.91. 


5.92. 


5.93. 


5.94. 


5.95. 


5.10 Exercises 


itself, in a parallel manner, as shown in Fig. 5.51. If your hand starts from 
rest and has constant acceleration a, what is your force at the moment 
right before the chain is straightened out? 


Falling chain «« 


A chain with length Z and mass density ø kg/m is held in a heap, and 
you grab an end that protrudes a tiny bit out of the top. The chain is 
then released. As a function of time, what is the force that your hand 
must apply to keep the top end of the chain motionless? Assume that 
the chain has no friction with itself, so that the remaining part of the 
heap is always in freefall. The setup at a random later time is shown in 
Fig. 5.52. 


Pulling a chain down +» 


A chain with mass density o kg/m lies in a heap at the edge of a table. 
One end of the chain initially sticks out an infinitesimal distance from 
the heap. You grab this end and accelerate it downward with acceleration 
a. Assume that there is no friction of the chain with itself as it unravels. 
As a function of time, what force does your hand apply to the chain? 
Find the value of a that makes your force always equal to zero. (In other 
words, find the a with which the chain naturally falls.) 


Raising a chain +x 


A chain with length Z and mass density o kg/m lies in a heap on the 
floor. You grab one end of the chain and pull upward with a force such 
that the chain moves at constant speed v. What is the total work you do, 
by the time the chain is completely off the floor? How much energy is 
lost to heat, if any? Assume that the chain is greased, so that it has no 
friction with itself. 


Downhill dustpan +» 


A plane inclined at an angle 6 is covered with dust. An essentially mass- 
less dustpan on wheels is released from rest and rolls down the plane, 
gathering up dust. The density of dust in the path of the dustpan is 
o kg/m. What is the acceleration of the dustpan? 


Heap and block +x 


A chain with mass density o kg/m lies in a heap on the floor, with one 
end attached to a block of mass M. The block is given a sudden kick 
and instantly acquires a speed Vo. Let x be the distance traveled by the 
block. In terms of x, what is the tension in the chain, just to the right 
of the heap; that is, at the point P shown in Fig. 5.53? There is no 
friction in this problem; none with the floor, and none in the chain with 
itself. 


(top view) 
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5.96. Touching the floor «x«« 


5.11 


5.1. 


5:2; 


A chain with mass density o kg/m hangs from a spring with spring 
constant k. In the equilibrium position, a length L is in the air, and the 
bottom part of the chain lies in a heap on the floor; see Fig. 5.54. The 
chain is raised by a very small distance, b, and then released. What is 
the amplitude of the oscillations, as a function of time? 

Assume that (1) L > b, (2) the chain is very thin, so that the size of 
the heap on the floor is very small compared with b, (3) the length of 
the chain in the initial heap is larger than b, so that some of the chain 
always remains in contact with the floor, and (4) there is no friction of 
the chain with itself inside the heap. 


Solutions 


Minimum length 


Drop the masses through the three holes, and let the system reach its equilibrium 
position. The equilibrium position is the one with the lowest potential energy of the 
masses, that is, the one with the most string hanging below the table. In other words, it 
is the one with the least string lying on the table. This is the desired minimum-length 
configuration. 

What are the angles at the vertex of the string? The tensions in all three strings are 
equal to mg, because they are holding up the masses. The vertex of the string is in 
equilibrium, so the net force on it must be zero. This implies that each string must 
bisect the angle formed by the other two. Therefore, the angles between the strings 
must all be 120°. 


Heading to zero 


The energy of the particle is E = mv?/2 — A|x|". The given information tells us that 
v = 0 when x = 0. Therefore, E = 0, which then implies that v = —./2Ax"/m (we'll 
assume x > 0; thex < 0 case works the same). We have chosen the minus sign because 
the particle is heading toward the origin. Writing v as dx/dt and separating variables 


gives 
[ as I2 fa r| (5.84) 
my 2 m Jo ~ m’ i 


where xo is the initial position and T is the time to reach the origin. The integral on the 
left is finite if and only if n/2 < 1. Therefore, the condition that T is finite is n < 2. 


Remark: If0Q < n< 1, then V(x) has a cusp at x = 0 (infinite slope on either side), 
so it’s clear that T is finite. Ifn = 1, then the slope is a finite constant, so it’s also clear 
than T is finite. If > 1, then the slope of V(x) is zero at x = 0, so it’s not obvious 
what happens with 7. But the above calculation shows that n = 2 is the value where 
T becomes infinite. 

The particle therefore takes a finite time to reach the top of a triangle or the curve 
—Ax3/?, But it takes an infinite time to reach the top of a parabola, cubic, etc. A circle 
looks like a parabola at the top, so T is infinite in that case also. In fact, any nice 
polynomial function V(x) requires an infinite T to reach a local maximum, because 
the Taylor series starts at order two (at least) around an extremum. & 
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5.3. Leaving the sphere 


FIRST SOLUTION: Let R be the radius of the sphere, and let 6 be the angle of the mass, 
measured from the top of the sphere. The radial F = ma equation is 


mv? 


mg cos@ — N = R’ (5.85) 
where N is the normal force. The mass loses contact with the sphere when the normal 
force becomes zero (that is, when the normal component of gravity is barely large 
enough to account for the centripetal acceleration of the mass). Therefore, the mass 
loses contact when 


2 
wi mg cos @. (5.86) 
R 
But conservation of energy gives mv?/2 = mgR(1 — cos@). Hence, v = 
/2gR(1 — cos 0). Plugging this into Eq. (5.86) gives 
2 
cos 6 = 3 => 6 48.2°. (5.87) 


SECOND SOLUTION: Let’s assume (incorrectly) that the mass always stays in contact 
with the sphere, and then find the point where the horizontal component of v starts 
to decrease, which it of course can’t do, because the normal force doesn’t have a 
“backward” component. From above, the horizontal component of v is 


Uy = vcosé = y2gR(1 — cos@) cosé. (5.88) 


Taking the derivative of this, we find that the maximum occurs when cos@ = 2/3. | 
T x 


So this is where vy would start to decrease if the mass were constrained to remain on 
the sphere. But there is no such constraining force available, so the mass loses contact 
when cos @ = 2/3. 


5.4. Pulling the pucks yyy anM >F 


(a) Let @ be defined as in Fig. 5.55. Then the tension in the string is T = F /(2 cos 8), 
because the force on the massless kink in the string must be zero. Consider the 
“top” puck. The component of the tension in the y direction is —T sind = 
—(F'/2) tan 0. The work done on the puck by this component is therefore 


9 _Ftan@ 9 _Ftan@ . 
W, si dy -|/ d(€sin@) 
g £ 2 m 


2 2 
0 _Fésin@ 
-|/ de 
vs 


2 2 
Fecosé |° Fe 
=—; =F (5.89) system A 
m/2 
By the work—energy theorem (or equivalently, by separating variables and inte- ----> F 


grating Fy = mvy dvy/dy), this work equals the value of mv? /2 right before the 
collision. There are two pucks, so the total kinetic energy lost when they stick 
together is twice this quantity (v, doesn’t change during the collision), which 


is F£. 
(b) Consider two systems, A and B (see Fig. 5.56). A is the original setup, while B systemi Z 
starts with 6 already at zero. Let the pucks in both systems start simultaneously at e E 2 


x = 0. As the force F is applied, all four pucks will have the same x(t), because 

the same force in the x direction, namely F /2, is applied to every puck at all 

times. After the collision, both systems will therefore look exactly the same. Fig. 5.56 
Let the collision of the pucks occur at x = d. At this point, F(d + £) work 

has been done on system A, because the center of the string (where the force 
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is applied) ends up moving a distance £ more than the masses. However, only 
Fd work has been done on system B. Since both systems have the same kinetic 
energy after the collision, the extra F work done on system A must be what is 
lost in the collision. 


REMARK: Thereasoning in this second solution can be used to solve the problem 
in the case where we have a uniform massive rope (so the rope flops down, as in 
Fig. 5.57). The center of mass of the rope moves in exactly the same manner as 
the position of the two pucks in system B (assuming that the mass of each puck 
is chosen to be half the mass of the rope), because the same force F acts on both 
systems. You can show that this implies that the force acts over an extra distance 
of £/2 on the rope, compared with system B, by the time the rope has flopped 
into a straight line. From the reasoning above, F£/2 of work must therefore be 
lost to heat in the rope. & 


Constant y 


By conservation of energy, the bead’s speed at any time is given by (note that y is 
negative here) 


ere lm 5-2 5.90 
a tmgy = muy v = y v — 2gy. (5.90) 


The vertical component of the velocity is ý = v sin 0, where @ is the (negative) angle 
the wire makes with the horizontal. The slope of the wire is tan@ = dy/dx = y’, 
which yields sin@ = y'/y 1 +. The requirement ý = —vo, which is equivalent to 
v sin @ = —vo, may therefore be written as 


/ 
Jo te mii; (5.91) 
0 /1 +y? 


Squaring both sides and solving for y! = dy/dx yields dy/dx = —vo/./—2gy. 
Separating variables and integrating gives 


—20y)3/2 
[ ee = -v f ax = er = wx, (5.92) 


where the constant of integration has been set to zero, because (x, y) = (0, 0) is a point 
on the curve. Therefore, 


_ _ Bgvox)?? 


= > (5.93) 


Dividing the heat 


It turns out that it isn’t possible to answer these questions without being given more 
information. The way that the work is divided up between the objects depends on what 
their surfaces look like. It’s theoretically possible for one of the objects to gain all the 
heat, while the other doesn’t heat up at all. 

To understand this, we’ll need to make a model of how friction works. The general 
way that friction works is that molecules from one surface rub against molecules from 
the other surface. The molecules stretch to the side and then bounce back and vibrate. 
This vibrational motion is the kinetic energy associated with heat. The model we’ll 
use here will have a bunch of springs with masses on the ends, on both surfaces at the 
interface. When the surfaces rub against each other, the masses catch on each other 
for a short time (as shown in Fig. 5.58), and then they release, whereupon they vibrate 
back and forth on the springs. This is the kinetic energy of the heat we see. Though an 
oversimplification, this is basically the way friction works. 

Now, if everything is symmetrical between the two objects (that is, if the springs and 
masses on one object look like those on the other), then both objects will heat up by 
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the same amount. But things need not be symmetrical. You can imagine the springs on 
one surface being much stiffer (that is, having a much larger k value) than the springs 
on the other surface. Or, you can even take the limit where one surface (say, the block) 
is made of completely rigid teeth, as shown (for one tooth) in Fig. 5.59. In this case, 
only the bottom surface (the table) will end up with heat from vibrational motion. 

Is this asymmetrical result consistent with what we obtain from the work—energy 
theorem? Well, the net work done on the block is zero, because your pulling force does 
Fd positive work (where F = uN), while the friction force (the sum of all the forces 
from the masses on the little teeth) does Fd negative work. Therefore, since zero net 
work is done on the block, its total energy is constant. And since the kinetic energy due 
to its motion as a whole is constant, its internal thermal energy must also be constant. 
In other words, it doesn’t heat up. 

The net work done on the table in this scenario comes from the force from the teeth 
on the little masses on the springs. These teeth do Fd positive work on all the little 
spring—mass systems, so Fd is the work done on the table. Therefore, its total energy 
increases by Fd. And since the kinetic energy due to its motion as a whole is constant 
(and zero, since the table is just sitting there), its internal thermal energy must increase 
by Fd. In other words, it heats up. 

Now consider the reverse situation, where the table has the rigid teeth and the block 
has the springs and masses, as shown in Fig. 5.60. The block is now the object that 
heats up, because it has the vibrational motion of the masses. And as above, we can 
show that this is consistent with the work—energy theorem, as follows. In the present 
case, the force from the table’s teeth does no work on the block (because the teeth 
aren’t moving), so the net work done on the block is simply the Fd from your pulling, 
so it heats up. Likewise, the little masses do no work on the teeth (because the teeth 
aren’t moving), so the work done on the table is zero, so it doesn’t heat up. 

For the in between case where the spring constants on the two objects are equal, the 
net work done on each object is Fd/2, where d is the distance the block moves. This 
is true because the two masses in Fig. 5.58 each move half as far as the block moves. 
So the work done on the block is Fd — Fd/2 = Fd/2 (this is the positive work done 
by you, plus the negative work done by the table’s little masses). And the work done 
on the table is Fd /2 (this is the positive work done by the block’s little masses). So the 
objects heat up the same amount. For more discussion of the issues in this problem, 
see Sherwood (1984). 


V(x) vs. a hill 

FIRST SOLUTION: Consider the normal force N acting on the bead at a given point. Let 
0 be the angle that the tangent to V (x) makes with the horizontal, as shown in Fig. 5.61. 
The horizontal F = ma equation is 


—N sin 0 = mx. (5.94) 
The vertical F = ma equation is 
N cos0 — mg = m => Ncos0 = mg + m. (5.95) 
Dividing Eq. (5.94) by Eq. (5.95) gives 


— tan = 


=. (5.96) 
gty 


But tan 0 = V' (x). Therefore, 
žł = -g +V". (5.97) 


We see that this is not equal to —gV’. In fact, there is in general no way to construct 
a curve with height z(x) that gives the same horizontal motion that a 1-D potential 
mgV (x) gives, for all initial conditions. We would need —(g + Z)z’ = —gV’, for all 
x. But at a given x, the quantities V’ and z’ are fixed, whereas Z depends on the initial 
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conditions. For example, if there is a bend in the wire, then Z will be large ifz is large. 
And z depends (in general) on how far the bead has fallen. 

Equation (5.97) holds the key to constructing a situation that does give the x = —gV’ 
result. All we have to do is get rid of the term. So here’s what we do. We grab our 
y = V(x) wire and move it up and/or down in precisely the manner that makes the bead 
stay at the same height with respect to the ground. (Actually, constant vertical speed 
would be good enough.) This will make the ý term vanish, as desired. The vertical 
movement of the curve doesn’t change the slope V’ at a given value of x, so the 8 in 
the above derivation is still the same 0. 

Note that the quantity y here is the vertical position of the bead. It equals V (x) if the 
curve is stationary, but not if the curve is being moved up and down. 


REMARK: There is one case where X is (approximately) equal to —gV’, even when the 
wire remains stationary. In the case of small oscillations of the bead near a minimum 
of V (x), Y is small compared with g. Hence, Eq. (5.97) shows that ¥ is approximately 
equal to —gV’. Therefore, for small oscillations, it is reasonable to model a particle 
in a 1-D potential mgV (x) as a particle sliding in a valley whose height is given 
byvy=V(x). & 


SECOND SOLUTION: The component of gravity along the wire is what causes the change 
in velocity of the bead. That is, 


OY ee (5.98) 
— sin = — A 
g = 


where 0 is given by 


y' 1 
tnð = V'œ@) = sind = ———., cosĝð = ——__.. (5.99) 
V4? JF? 
We are, however, not concerned with the rate of change of v, but rather with the rate 
of change of x. In view of this, let us write v in terms of x. Since x = v cos, we 
have v = x/cos@ = xV1+ V (dots denote d/dt, primes denote d/dx). Therefore, 
Eq. (5.98) becomes 


—gV’ d/. 
pe wa TF”) 


:V(dV' Jdt 
=ixV14+724~ wa (5.100) 


Hence, x is given by 
_ ae’ xV'(dV' /dt) 
1+V? 1+? 


We’ll simplify this in a moment, but first a remark. 


(5.101) 


Remark: A common incorrect solution to this problem is the following. The accel- 
eration along the curve is gsind = —g(V'/ VI+V"). Calculating the horizontal 
component of this acceleration brings in a factor of cos@ = 1/v 1 + V2. Therefore, 
we might think that 


K 


x= Th ye (incorrect). (5.102) 


We have missed the second term in Eq. (5.101). Where is the mistake? The error is that 
we forgot to take into account the possible change in the curve’s slope (Eq. (5.102) is 
true for straight lines). We addressed only the acceleration due to a change in speed. 
We forgot to consider the acceleration due to a change in the direction of motion (the 
term we missed is the one with dV'/dt). Intuitively, if we have a sharp enough bend in 
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the wire, then x can change at an arbitrarily large rate, even if v is roughly constant. 
In view of this fact, Eq. (5.102) is definitely incorrect, because it is bounded (by g/2, 
in fact), & 


To simplify Eq. (5.101), note that V’ = dV/dx = (dV/dt)/(dx/dt) = Ý /x (V is just 
the rate of change in the bead’s height). Therefore, the numerator in the second term 
on the right-hand side of Eq. (5.101) is 


| „d (Ý (XV -vx 
xy =x) = |= - 
dt dt \ x x2 


P 4 sf 
=V'V r's( f ) = VV —V?X. (5.103) 
x 
Substituting this into Eq. (5.101) yields 
ž=-( +0, (5.104) 
in agreement with Eq. (5.97), because y = V(x) if the wire is stationary. 


Equation (5.104) is valid only for a curve V(x) that remains fixed. If we grab the 
wire and start moving it up and down, then the above solution is invalid, because the 
starting point, Eq. (5.98), rests on the assumption that gravity is the only force that 
does work on the bead. But if we move the wire, then the normal force also does work. 

It turns out that for a moving wire, we simply need to replace the V in Eq. (5.104) 
by ï, which then gives Eq. (5.97). This can be seen by looking at things in the vertically 
accelerating frame in which the wire is at rest. We won’t cover accelerating frames 
until Chapter 10, so we’ll just invoke here the result that there is an extra fictitious 
“translational” force in this accelerating frame, and the consequence of this is that the 
bead thinks that it lives in a world where the acceleration from gravity is g + h (if h is 
positive, then the bead thinks gravity is larger), where / is the position of your hand 
that is accelerating the wire. In this new frame, the wire is at rest, so the above solution 
is valid. So with the g in Eq. (5.104) replaced by g +h, we have ¥ = —(g+h+V)V’. 
But Ÿ (which is the vertical acceleration of the bead with respect to the new frame) 
plus h (which is the vertical acceleration of the new frame with respect to the ground) 
equals ÿ (which, by our definition in the first solution, is the vertical acceleration of 
the bead with respect to the ground). We have therefore reproduced Eq. (5.97). 


Hanging mass 
We will calculate the equilibrium point yo, and then use œ = /V"(yo)/m. The 
derivative of V is 


V'(y) = ky + mg. (5.105) 
Therefore, V'(v) = 0 when y = —mg/k = yo. The second derivative of V is 
V") =k. (5.106) 


We therefore have 


P nr, (5.107) 
m m 


REMARK: This is independent of yo, which makes sense because the only effect of 
gravity is to change the equilibrium position. More precisely, if yr is the position 
relative to yo (so that y = yo + yr), then the total force as a function of yp is 


FOr) = k00 +y) — mg = —k (-7E +y:) -mg = krs (6.108) 


so it still looks like a regular spring. (This works only because the spring force is 
linear.) Alternatively, you can think in terms of the potential energy; this is the task of 
Exercise 5.42. & 
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5.9. 


5.10. 


5.11. 


Small oscillations 


We will calculate the equilibrium point xo, and then use œ = /V"(xo)/m. The 
derivative of V is 


V(x) = —Ce~ x"! (n — ax). (5.109) 
Therefore, V’(x) = 0 when x = n/a = xo. The second derivative of V is 
V"(x) = Cex"? ((n — 1 — ax)(n — ax) — ax). (5.110) 


Plugging in x9 = n/a simplifies this a bit, and we find 


V" (xo) Ce—”n"-! 
o= (— = |=. (5.111) 


Zero force inside a sphere 


Let a be the distance from P to piece A, and let b be the distance from P to piece B 
(see Fig. 5.62). Draw the “perpendicular” bases of the cones, and call them A’ and 
B’. The ratio of the areas of A’ and B’ is a? /b?. The key point here is that the angle 
between the planes of A and 4’ is the same as the angle between B and B’. This is true 
because the chord between A and B meets the circle at equal angles at its ends. So the 
ratio of the areas of A and B is also a? /b?. But the gravitational force decreases like 
1/r*, and this effect exactly cancels the a’ /b? ratio of the areas. Therefore, the forces 
at P due to A and B (which can be treated like point masses, because the cones are 
assumed to be thin) are equal in magnitude; and opposite in direction, of course. If we 
draw enough cones to cover the whole shell, then the contributions to the force from 
little pieces over the whole shell cancel in pairs, so we are left with zero force at P. 
This holds for any point P inside the shell. 


REMARK: Interestingly, the force inside an ellipsoidal shell of constant density (per 
volume) is also zero, assuming that the shell is defined to be the region between the 
surfaces described by ax? + by? + cz? = k, for two different values of k. In short, 
this is true because an ellipsoid is simply a stretched sphere. In detail: Let the above 
spherical shell have some thickness dr (which was irrelevant for the sphere, but it will 
be important in the ellipsoid case). From above, we know that the masses at the ends 
of the thin cones in the sphere have canceling forces. If we now stretch the sphere into 
an ellipsoid (uniformly in each direction, but the factors in the three directions can be 
different), then the distances from the end masses to P will still be in the ratio of a to b 
(as you can verify). And the end masses will still be in the ratio of a? to b?, because both 
masses change by the same factor. This is true because every infinitesimal cube in the 
end masses has its volume changed by the same factor (namely fy fy fz, where these f’s 
are the stretching factors in each direction). So the masses are still in the ratio of a? to 
b?, and the same argument about canceling forces holds as in the spherical case. Note 
that this zero-force result is not true for an ellipsoid with constant thickness, because 
such an object isn’t the result of stretching a spherical shell (because a stretching results 
in the ellipsoidal shell being thicker near the ends where it is more “pointy”). & 


Escape velocity 


(a) The cutoff case is where the particle barely makes it to infinity, that is, where 
its speed is zero at infinity. Conservation of energy for this situation gives 


MY, 


a S040, (5.112) 


2 


In other words, the initial kinetic energy, mu¢,, 


potential energy, GMm/R. Therefore, 


2GM 
voss =) (5.113) 


/2, must account for the gain in 
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In terms of the acceleration, g = GM/R?, at the surface of a planet, we can 
write this as vesse = /2gR. Using M = 4m pR?/3, we can also write it as 


Vese = V8 GR2p/3. So for a given density p, Vese grows like R. Using the 
values of g and R given in Appendix J, we have: 


For the earth, vese = /28R ~ \/2(9.8 m/s2)(6.4 - 106 m) ~ 11.2 km/s. 
For the moon, vese = /2gR © /2(1.6 m/s2)(1.7 - 106m) © 2.3 km/s. 
For the sun, Vese = /2gR © /2(270 m/s”)(7.0- 108 m) © 620 km/s. 


REMARK: Another reasonable question to ask is: what is the escape velocity 
from the sun for an object located where the earth is (but imagine that the earth 
isn’t there)? The answer is /2GMs/Rgs, where Rgs is the earth-sun distance. 
Numerically, this is 


206.67 - 10-1! m3 /kg s2)(2 - 1030 kg) /(1.5- 10!! m) © 42km/s. (5.114) 


If you want to bring the earth back in (but let’s assume it’s at rest and not orbiting) 
and find the escape velocity (from both the sun and the earth) from a point on the 
earth’s surface, you can’t just add the 42 km/s and 11.2 km/s results. Instead, 
you have to take the square root of the sum of the squares. This follows from 
Eq. (5.112) and from the fact that potentials simply add. The result is about 
43.5 km/s. The task of Exercise 5.60 is to find the escape velocity if the orbital 
motion of the earth is included. & 


(b) To get a rough answer, we’ll assume that the initial speed of a person’s jump 
on the small planet is the same as it is on the earth. This probably isn’t quite 
true, but it’s close enough for the purposes here. A good jump on the earth is 
about a meter. For this jump, conservation of energy gives mv? /2 = mg(1m). 
Therefore, v= /2g(1m) © /20m/s. So we want /20m/s = /82 GR2p/3. 
Using p ~ 5500 kg/m?, we find R ~ 2.5 km. On sucha planet, you should tread 
lightly. 


5.12. Ratio of potentials 


Let p be the mass density of the cube. Let V°" be the potential energy of a mass m at 
the corner of a cube of side £, and let V/°" be the potential energy of a mass m at the 
center of a cube of side £. By dimensional analysis, 


G(pl3)m 
—— « 


V9 x 


e. (5.115) 


Therefore,*? 
Ver = AV - (5.116) 


But a cube of side £ can be built from eight cubes of side £/2. So by superposition, we 
have 


ee ee (5.117) 


because the center of the larger cube lies at a corner of the eight smaller cubes (and 
because potentials just add). Therefore, 


yoo 4yeor 1 
pen = per => (5.118) 
£ £/2 


32 In other words, imagine expanding a cube of side £/2 to one of side £. If we consider corresponding 
pieces of the two cubes, then the larger piece has 23 = 8 times the mass of the smaller. But 
corresponding distances are twice as big in the large cube as in the small cube. Therefore, the 
larger piece contributes 8/2 = 4 times as much to V7 as the smaller piece contributes to Vin ; 
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5.13. Through the hole 


(a) By symmetry, only the component of the gravitational force perpendicular to 


the plane survives. A piece of mass dm at radius r on the plane provides a force 
equal to Gm(dm)/(r? +x”). To obtain the component perpendicular to the plane, 
we must multiply this by x//7? + x?. Slicing the plane up into rings with mass 
dm = (2nr dr)o, we find that the total force is 


F@) = f Gm(2rro dr)x 


=00 
= 2n0 Gmx(r? +x Í 


(r? + x2)3/2 r=R 
_ 2nmaGmx (5.119) 
O0 JEFE l 
Note that if R = 0 (so that we have a uniform plane without a hole), then 
F = —2r o Gm, which is independent of the distance from the plane. 
(b) Ifx < R, then Eq. (5.119) gives F(x) © —220Gmx/R, so F = ma yields 
2 
+( mee )x=0 (5.120) 


The frequency of small oscillations is therefore 


w= OG. (5.121) 
R 


REMARK: For everyday values of R, this is a very small number because G 
is so small. Let’s determine the rough size. If the sheet has thickness d, and 
if it is made of a material with density p (per unit volume), then o = pd. 
So w = ./2mpdG/R. In the above analysis, we assumed that the sheet was 
infinitely thin. In practice, we need d to be much smaller than the amplitude 
of the motion. But this amplitude must be much smaller than R in order for 
our approximation to hold. So we conclude that d « R. To get a rough upper 
bound on ow, let’s pick d/R = 1/10. And let’s make our sheet out of gold (with 
p © 2-104kg/m?3). We then find œ © 1- 10~4s~!, which corresponds to 
an oscillation about every 100 minutes. For the analogous system consisting of 
electrical charges, the frequency is much larger, because the electrical force is 
so much stronger than the gravitational force. & 


which is independent of m. 


(c) Integrating the force in Eq. (5.119) to obtain the potential energy (relative to the 


center of the hole) gives 


V(x) [ow jd [ 210 Gmx dx 
y= - x)dx = psa aise 
0 0 VR2+x2 


= 2noGmyR +| = 2noGm(/R+x?—R). (5122) 


By conservation of energy, the speed at the center of the hole is given by 
mv? /2 = V (x). Therefore, 


v= 2/xoG( R? +x? =R). (5.123) 
For x > R this reduces to v © 2y mo Gx. 


REMARK: You can also obtain this last result by noting that for large x, the force 
in Eq. (5.119) reduces to F = —2x o Gm. This is constant, so it’s basically just 
like a gravitational force F = mg’, where g’ = 2770 G. But we know that in this 
familiar case, v = V2g'h —> /2(2170G)x,as above. & 


5.14. 


5.15. 


5.16. 


5.11 Solutions 


Snowball 


All of the snowball’s momentum goes into the earth, which then translates (and rotates) 
a tiny bit faster (or slower, depending on which way the snowball was thrown). 

What about the energy? Let m and v be the mass and initial speed of the snowball. 
Let M and V be the mass and final speed of the earth (with respect to the original rest 
frame of the earth). Since m « M, conservation of momentum gives V ~ mv/M. The 
kinetic energy of the earth is therefore 


oa (me) =: (A) «; 2 (5.124) 
a M) oo mM ge 
There is also a rotational kinetic-energy term of the same order of magnitude, but that 
doesn’t matter. We see that essentially none of the snowball’s energy goes into the 
earth. It therefore must all go into the form of heat, which melts some of the snow 
(and/or heats up the wall). This is a general result for a small object hitting a large 
object: The large object picks up essentially all of the momentum but essentially none 
of the energy (except possibly in the form of heat). 


Propelling a car 


Let the speed of the car be v(t). Consider the collision of a ball of mass dm with the car. 
In the instantaneous rest frame of the car, the speed of the ball is u — v. In this frame, 
the ball reverses velocity when it bounces (because the car is so much more massive), 
so its change in momentum is —2(u — v) dm. This is also the change in momentum 
in the lab frame, because the two frames are related by a given speed at any instant. 
Therefore, in the lab frame the car gains a momentum of 2(u — v) dm from each ball 
that hits it. The rate of change in momentum of the car (that is, the force) is thus 
dp ' 

"ie 2o0'(u — v), (5.125) 
where o’ = dm/dt is the rate at which mass hits the car. ø’ is related to the given o by 
o’ = o (u — v)/u, because although you throw the balls at speed u, the relative speed 
of the balls and the car is only (u — v). We therefore have 


dv 2u—v)*o v dv 20 ft 
M = = dt 
dt u o (u—v)2 Mu Jo 
1 1 2ot 
u—v u Mu 
Gra) 
v(t) = 5 (5.126) 
1+ 


Note that v —> u as t — oo, as it should. Writing this speed as u(1 -1/d +20t/M)), 
we can integrate it to obtain the position, 


Bee n a (5.127) 
x(t) =u n t > G 
20 M 


where the constant of integration is zero because x = 0 at t = 0. We see that even 
though the speed approaches u, the car will eventually be an arbitrarily large distance 
behind an object that moves with constant speed u (for example, pretend that your first 
ball misses the car and continues forward at speed u). 


Propelling a car again 
We can carry over some of the results from the previous problem. The only change in 
the calculation of the force on the car is that since the balls don’t bounce backward, 
we don’t pick up the factor of 2 in Eq. (5.125). The force on the car is therefore 

dv _ u- v)? 


=> = —_—_— 5.128 
m P7 7 r ( ) 
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where m(t) is the mass of the car-plus-contents, as a function of time. The main 
difference between this problem and the previous one is that this mass m changes 
because the balls are collecting inside the car. From the previous problem, the rate at 
which mass enters the car is o’ = ø (u — v)/u. Therefore, 


dm _ (u—v)o 

dt u 
We must now solve the two preceding differential equations. Dividing Eq. (5.128) by 
Eq. (5.129), and separating variables, gives’? 


v du m dm u— v m Mu 
= In = In ( ) m= Í 
0 u-v mm u M u— v 


(5.130) 


(5.129) 


Note that m —> oo as v — u, as it should. Substituting this value of m into either 
Eq. (5.128) or Eq. (5.129) gives 


i dvu todt 1 Í ot 
o (u— v)’ o Mu? 2u—v)2 2u? Mi 


u 


=" 
Viti 


Note that v + u as t —> oo, as it should. Integrating this speed to obtain the position 
gives 


v(t) =u (5.131) 


Mu 2ot Mu 
x(t) = ut 14 H ; (5.132) 
o M o 


where the constant of integration has been chosen so that x = 0 at t = 0. Fora 
given ¢, the v(t) in Eq. (5.131) is smaller than the v(t) for the previous problem in 
Eq. (5.126), which is easy to see if the latter is written as u(1 —1/d+ 2ot/M)). This 
makes sense, because in the present problem the balls have less of an effect on v(t), 
because (1) they don’t bounce back, and (2) the mass of the car-plus-contents is larger. 


5.17. Leaky bucket 


(a) First soLution: The initial position is x = L. The given rate of leaking implies 
that the mass of the bucket at position x is m = M(x/L). Therefore, F = ma 
gives —T = (Mx/L)x. Writing the acceleration as vdv/dx, and separating 
variables and integrating, gives 


TL [* dx g TL x v? 
= S od n( j= , (5.133) 
M L Xx 0 M L 2 
The kinetic energy at position x is therefore 
mv? Mx\ v? x 
Bele = Tx In ( ). (5.134) 
2 LJ 2 L 


In terms of the fraction z = x/L, we have E = —TLzInz. Setting dE/dz = 0 
to find the maximum gives 


1 TL 
io = Äre. (5.135) 
È 


33 We can also quickly derive this equation by writing down conservation of momentum for the 


time interval when a mass dm enters the car: dmu + mv = (m + dm)(v + dv). This yields the 
first equality in Eq. (5.130). But we will still need to use one of Eqs. (5.128) and (5.129) in what 
follows. 
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Note that the (fractional) location of Emax is independent of M, T, and L, but its 
value depends on T and L. These facts follow from dimensional analysis. 


Remark: We began this solution by writing down F = ma, where m is the 
mass of the bucket. You may be wondering why we didn’t use F = dp/dt, 
where p is the momentum of the bucket. This would certainly give a different 
result, because dp/dt = d(mv)/dt = ma+ (dm/dt)v. We used F = ma because 
at any instant the mass m is what is being accelerated by the force F. 

If you want, you can imagine the process occurring in discrete steps: The 
force pulls on the mass for a short period of time, then a little piece falls off. 
Then the force pulls again on the new mass, then another little piece falls off. 
And so on. In this scenario, it is clear that F = ma is the appropriate formula, 
because it holds for each step in the process. 

It is indeed true that F = dp/dt, if you let F be total force in the problem, 
and let p be the total momentum. The tension T is the only horizontal force in 
the problem, because we’ve assumed the ground to be frictionless. However, 
the total momentum consists of both the sand in the bucket and the sand that has 
leaked out and is sliding along on the ground. If we use F = dp/dt, where p is 
the total momentum, we obtain (remember that dm/dt is a negative quantity) 


d d di di 
T= IPbucket | IPleaked = (ma4 ma | m v= ma, (5.136) 
dt dt dt dt 


as expected. See Appendix C for further discussion of the uses of F = ma and 
F = dp/dt. & 


SECOND SOLUTION: Consider a small time interval during which the bucket 
moves from x to x + dx (where dx is negative). The bucket’s kinetic energy 
changes by (—T) dx (this is positive) due to the work done by the spring, and 
also changes by a fraction dx/x (this is negative) due to the leaking. Therefore, 
dE = —T dx + Edx/x, or 

dE E 

—e—-f+—. (5.137) 

dx x 
In solving this differential equation, it is convenient to introduce the variable 
y = E/x. Then E’ = xy’ + y, where a prime denotes differentiation with respect 
to x. Equation (5.137) then becomes xy’ = —T, which gives 


Efx x dx x 
f dy = rf E=~-TxIn(=), (5.138) 
0 L * L 


as in the first solution. 
(b) From Eq. (5.133), the speed is v = ./27L/M./—Inz, where z = x/L. 
Therefore, the magnitude of the momentum is 


p = mv = (Mz)v = V2TLM Ņ —z? Inz. (5.139) 


Setting dp/dz = 0 to find the maximum gives 


1 TLM 
Z Je Pmax a" (5.140) 
Note that the (fractional) location of pmax is independent of M, T, and L, but its 
value depends on all three. These facts follow from dimensional analysis. 


REMARK: Emax occurs closer to the wall (that is, at a later time) than pmax. The 
reason for this is that v matters more in E = mv*/2 than it does in p = mv. As 
far as E is concerned, it is beneficial for the bucket to lose a little more mass if 
it means being able to pick up a little more speed (up to a certain point). & 
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5.18. Another leaky bucket 


(a) F = ma gives —T = mx. Combining this with the given dm/dt = bx equation 
yields mdm = —bT dt. Integration then gives m?/2 = C — bTt. But m = M 
when t = 0, so we have C = M? /2. Therefore, 


m(t) = VM? — 2bTt. (5.141) 


This holds for t < M?/2bT, provided that the bucket hasn’t hit the wall yet. 
(b) The given equation dm/dt = bx = bdv/dt integrates to v = m/b + D. But 
v = 0 when m = M, so we have D = —M /b. Therefore, 
m—M VM2—2bTt M 


v(m) = b v(t) = 7 7 


At the instant right before all the sand leaves the bucket, we have m = 0. 
Therefore, v = —M /b at this point. Integrating u(t) to obtain x(t), we find 


(5.142) 


a Sa Mid Me (5.143) 
3°T Bo oer 
where the constant of integration has been chosen so that x = L when t = 
0. Solving for ¢ in terms of m from Eq. (5.141), substituting the result into 


Eq. (5.143), and simplifying, gives 


x(t) = 


(M — m)?(M + 2m) 


L ; 5.144 
x(m) IF (5.144) 
(c) Using Eq. (5.142), the kinetic energy is (it’s easier to work in terms of m here) 
Eo mea (m— M}? (5.145) 
= -mv = m(n ; : 
amu zp” 


Taking the derivative dE/dm to find the maximum, we obtain 


2M? 
m = 3 => Emax 272 (5.146) 
(d) Using Eq. (5.142), the momentum is 
1 
p= m= Pm — M). (5.147) 
Taking the derivative to find the maximum magnitude, we obtain 
M 2 
m= > => |[Plmax = 4b . (5.148) 
(d) We want x = 0 when m = 0, so Eq. (5.144) gives 
M? M? 
0=L b= ; (5.149) 
6b? T 6TL 


This is the only combination of M, T, and L that has the units of b, namely 
kgs/m. But we needed to do the calculation to find the numerical factor of 
1/V6. 


5.19. Right angle in billiards 


Let v be the initial velocity, and let vı and v2 be the final velocities. Since the masses 
are equal, conservation of momentum gives v = vj + v2. Taking the dot product of 
this equation with itself gives 


Vi-v=v-v1+2v, -v2+v2- V2. (5.150) 
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5.11 Solutions 


And conservation of energy gives (dropping the factors of m/2) 
V-V=V-Vi+Vv2-Vo. (5.151) 
Taking the difference of these two equations yields 
vvn =0. (5.152) 


So v1 v2 cos 8 = 0, which means that 6 = 90°. (Or vı = 0, which means the incoming 
mass stops because the collision is head-on. Or v2 = 0, which means the masses miss 
each other.) 

Bouncing and recoiling 


Let v; be the speed of the ball after the ith bounce, and let V; be the speed of the block 
right after the ith bounce. Then conservation of momentum gives 


mvi = MVj41 — mini. (5.153) 


But Theorem 5.3 says that v; = Vi+1 +vi+1. Solving this system of two linear equations 

gives 

(M =- mwi U-«)u 
M+m ite 


Vi4] = (1 — 2e)vi, and Vii © 2euj, (5.154) 


where € = m/M < 1. This expression for v;+4ı implies that the speed of the ball after 
the nth bounce is 


Un = (1 — 2e)"vo, where € =m/M. (5.155) 


The total distance traveled by the block can be obtained by looking at the work done 
by friction. Eventually, the ball has negligible energy, so all of its initial kinetic energy 
goes into heat from friction. Therefore, muy /2 = Fed = (uMg)d, which gives 


2 
mvg 


= 2uMg ` 


(5.156) 


To find the total time, we can add up the times, t„, the block moves after each bounce. 
Since the product of force and time equals the change in momentum, we have Ft, = 
MVn, and so (uMg)t, = M (2evn—-1) = 2Me(1 — 2€)"—! v9. Therefore, 


[e6] 


2evo 2 2evo 1 vo 
t=) h= (= 26)" = : sa (5.157) 
de ug 2 ug 1-(1—26) ug 


n=1 n=0 


We’ve let the sum go to n = oo even though our v, = (1 — 2€)”vp9 approximation 
eventually breaks down for very large n (because we dropped terms of order €? in 
deriving it). But by the time it breaks down, the terms are negligibly small anyway. 
The calculation of d above can also be done by adding up the geometric series of the 
distances moved after each bounce. 


Remarks: This ź = vo/g result is much larger than the result obtained in the case 
where the ball sticks to the block on the first hit, in which case the answer is t = 
mvo/ (Mg). The total time is proportional to the total momentum that the block picks 
up, and the t = vo/ug answer is larger because the wall keeps transferring positive 
momentum to the ball, which then gets transferred to the block. 

In contrast, d is the same as it would be in the case where the ball sticks to the block 
on the first hit. The total distance is proportional to the total energy that the block picks 
up, and in both cases the total energy given to the block is mv /2. The wall (which is 
attached to the very massive earth) transfers essentially no energy to the ball. 

The t = vo/jg result is independent of the masses (as long as M >> m), although 
it’s not at all intuitively obvious that if we keep the same vo, but decrease m by a factor 
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of 100, that we'll end up with the same ft. The distance d, on the other hand, would 
decrease by 100. & 


5.21. Drag force on a sheet 


(a) We will set v = 0 here. When the sheet hits a particle, the particle acquires 
a speed of essentially 2V. This follows from Theorem 5.3, or by working in 
the frame of the heavy sheet. The momentum of the particle is then 2mV. In 
time ¢, the sheet sweeps through a volume AVt, where A is the area of the 
sheet. Therefore, in time t, the sheet hits AVtn particles. The sheet therefore 
loses momentum at a rate of dP/dt = —(AVn)(2mV). But F = dP/dt, so the 
magnitude of the drag force per unit area is 


F 2 2 

g7 =2pV’, (5.158) 
where p is the mass density of the particles. We see that the force depends 
quadratically on V. 

(b) Ifv >> V, the particles now hit the sheet from various directions on both sides, 
but we need only consider the particles’ motions along the line of the sheet’s 
motion. As stated in the problem, we will assume that all velocities in this 
direction are equal to +v/2. Note that we won’t be able to set V exactly equal 
to zero here, because we would obtain a force of zero and miss the lowest-order 
effect. 

Consider a particle in front of the sheet, moving backward toward the sheet. 
The relative speed between the particle and the sheet is v/2 + V. This relative 
speed reverses direction during the collision, so the change in momentum of the 
particle is 2m(v/2 + V). We have used the fact that the speed of the heavy sheet 
is essentially unaffected by the collision. The rate at which particles collide with 
the sheet is A(v/2 + V)(n/2), from the reasoning in part (a). The n/2 factor 
comes from the fact that half of the particles move toward the sheet, and half 
move away from it. 

Now consider a particle behind the sheet, moving forward toward the sheet. 
The relative speed between the particle and the sheet is v/2 — V. This relative 
speed reverses direction during the collision, so the change in momentum of this 
particle is —2m(v/2 — V). And the rate at which particles collide with the sheet 
is A(v/2 — V)(n/2). Therefore, the magnitude of the drag force per unit area is 


F 1 |aP 
A A |d 
= (F241) (2mew/2 +V)) 4 (30/2 v))( 2m(v/2— V)) 
= 2nmvV = 2pvV, (5.159) 


where p is the mass density of the particles. We see that the force depends 
linearly on V. The fact that it agrees with the result in part (a) in the case of 
v = V is coincidence. Neither result is valid when v = V. 


5.22. Drag force on a cylinder 


Consider a particle that makes contact with the cylinder at an angle 6 with respect 
to the line of motion. In the frame of the heavy cylinder (see Fig. 5.63), the particle 
comes in with velocity —V and then bounces off with a horizontal velocity component 
of V cos 26. So in this frame (and hence also in the lab frame), the particle increases 
its horizontal momentum by mV (1 + cos 20). The cylinder must therefore lose this 
momentum. 

The area on the cylinder that lies between 6 and 0 + d0 sweeps out volume at a 
rate (R d0 cos) V£, where £ is the length of the cylinder. The cos @ factor here gives 
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the projection orthogonal to the direction of motion. The force per unit length on the 
cylinder (that is, the rate of change in momentum per unit length) is therefore 


a = L. (nRa cos)V) (mva + cos 26)) 


m/2 
= 2nmev? | cos 6(1 — sin’0) do 


—1/2 
1 m/2 
= 2nmRV? (sine — -= sino) 
3 —1/2 
8 2 8 2 
= z”mRV = gerV A (5.160) 


where p is the mass density of the particles. Note that the average force per cross- 
sectional area, F /(2R£), equals (4/3) pV 2. This is smaller than the result for the sheet 
in the previous problem, as it should be, because the particles bounce off somewhat 
sideways in the cylinder case. 


5.23. Basketball and tennis ball 


(a) Right before the basketball hits the ground, both balls move downward with 
speed (using mv? /2 = mgh) 


v = /2gh. (5.161) 


Right after the basketball bounces off the ground, it moves upward with speed 
v, while the tennis ball still moves downward with speed v. The relative speed 
is therefore 2v. After the balls bounce off each other, the relative speed is still 
2v (this follows from Theorem 5.3, or by working in the frame of the heavy 
basketball). Since the upward speed of the basketball essentially stays equal to 
v, the upward speed of the tennis ball is 2v +v = 3v. By conservation of energy, 
it therefore rises to a height of H = d + (3v)? /(2g). But v? = 2gh, so we have 


H=d+9h. (5.162) 


(b) Right before Bı hits the ground, all of the balls move downward with speed 
v= /2gh . We will inductively determine the speed of each ball after it bounces 
off the one below it. If B; achieves a speed of v; after bouncing off B;_1, then 
what is the speed of B;+, after it bounces off B;? The relative speed of Bj.) and 
B; (right before they bounce) is v + vj. This is also the relative speed after they 
bounce. Therefore, since B; is still moving upward at essentially speed v;, we 
see that the final upward speed of B;+1 equals (v + v;) + vi. Thus, 


Vig) = 2v; +v. (5.163) 


Since vı = v, we obtain v2 = 3v (in agreement with part (a)), and then v3 = 7v, 
and then v4 = 15v, etc. In general, 


Un = 2” — 1)v, (5.164) 


which is easily seen to satisfy Eq. (5.163), with the initial value vı = v. From 
conservation of energy, B, therefore rises to a height of 


((2” — 1)v)? 


H=€4 = l+ (2" — 1)°h. (5.165) 
2g 


If h is 1 meter, and if we want this height to be 1000 meters, then (assuming £ 
is not very large) we need 2” — 1 > y 1000. Five balls won’t quite do the trick, 
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but six will, in which case the height is almost 4 kilometers. Escape velocity 
from the earth (which is vege = ./2gR © 11 200 m/s) is reached when 


R 
Un = vese => (2"— 1)y2gh > /2gR => n=In (E+ 1) 
(5.166) 
With R = 6.4- 106 m and h = 1m, we find n > 12. Of course, the elasticity 


assumption is absurd in this case, as is the notion that you can find 12 balls with 
the property that mı >> m > --- >> my. 


5.24. Maximal deflection 


FIRSTSOLUTION: Let’s figure out what the collision looks like in the CM frame. IfM has 
initial speed V in the lab frame, then the CM moves with speed Vom = MV /(M +m). 
The speeds of M and m in the CM frame therefore equal, respectively, 
MV 
M+m- 


mV 
U=V—-Vou= , and u= |- Vcom] = 


5.167 
M+m ( ) 


In the CM frame, the collision is simple. The particles keep the same speeds, but 
simply change their directions (while still moving in opposite directions), as shown 
in Fig. 5.64. The angle 6 is free to have any value. This scenario clearly satisfies 
conservation of energy and momentum, so it must be what happens. 

The important point to note is that since @ can have any value, the tip of the U 
velocity vector can be located anywhere on a circle of radius U. If we then shift back 
to the lab frame, we see that the final velocity of M with respect to the lab frame, 
Viab, is obtained by adding Vcm to the vector U, which can point anywhere on the 
dotted circle in Fig. 5.65. A few possibilities for Vjgp are shown. The largest angle of 
deflection is obtained when Via» is tangent to the dotted circle, in which case we have 
the situation shown in Fig. 5.66. The maximum angle of deflection, @max, is therefore 
given by 


U mV /(M +m) m 
Vom MV/(M+m) M` 


sin Pmax = (5.168) 


IfM < m, then the dotted circle passes to the left of the left vertex of the triangle. 
This means that ¢ can take on any value. In particular, it is possible for M to bounce 
directly backward. 


SECOND SOLUTION: We’ll work in the lab frame in this solution. Let V’ and v’ be the 
final speeds, and let ¢ and y be the scattering angles of M and m, respectively, in the 
lab frame. Then conservation of px, py, and E give 


MV = MV" cos + mv' cosy, (5.169) 
0 = MV' sing — mv' sin y, (5.170) 
1 2 1 12 l 12 
MV* = -MV4 + -—mv”. (5.171) 
2 2 2 


Putting the ¢ terms on the left-hand sides of Eqs. (5.169) and (5.170), and then squaring 
and adding these equations, gives 


M?(V2 + V°? —2VV' cosp) = mv". (5.172) 


Equating this expression for m?v’* with the one obtained by multiplying Eq. (5.171) 
through by m gives 


M(V? + V°? —2VV' cos¢) = m(V? — V°?) 


(M +m) V? — (2MV coso) V’ + (M — m) V? = 0. (5.173) 


5.11 Solutions 


A solution to this quadratic equation in V’ exists if and only if the discriminant is 
non-negative. Therefore, we must have 


(2MV cos ¢)* — 4(M + m)(M — m) V? > 0 
=> m > M° — cos? o) 
=> 


m 

= —ssing (5.174) 
— 1n @Q. A 
me: 


5.25. Colliding masses 


(a) By conservation of momentum, the final speed of the combined masses is 
MV/(M +m) © (1—m/M)V, plus higher-order corrections. The final energies 
are therefore 


1 2 1 
En zm(1 Z) Vex 5m, 

l a A (5.175) 
Eu m(ı ) V2 x MV? — mV. 

2 M 2 


These energies add up to MV?/2 — mV2/2, which is mV/2 less than the initial 
energy of mass M, namely MV?/2. Therefore, mV/2 is lost to heat. 

(b) In this frame, mass m has initial speed V, so its initial energy is E; = mV?/2. By 
conservation of momentum, the final speed of the combined masses is mV /(M + 
m) ~ (m/M)V, plus higher-order corrections. The final energies are therefore 


E,= m (ZÝ = (Z7 axo 
2” (M M ? 

1 

= 


ne- (awo 


(5.176) 


E 

a M 
This negligible final energy of zero is mV7/2 less than E;. Therefore, mV? /2 is 
lost to heat, in agreement with part (a). 


5.26. Pulling a chain 


Let x be the distance your hand has moved. Then x/2 is the length of the moving part 
of the chain, because the chain gets “doubled up.” The momentum of this moving part 
is therefore p = (o0x/2)x. The force that your hand applies is found from F = dp/dt, 
which gives F = (0/2) (x? + xX). But since v is constant, the ¥ term vanishes. The 
change in momentum here is due simply to additional mass acquiring speed v, and not 
due to any increase in speed of the part already moving. Hence, 


(5.177) 


which is constant. Your hand applies this force over a total distance 2L, so the total 
work you do is 


F(2L) =oLv’. (5.178) 


The mass of the chain is øL, so its final kinetic energy is (ø L)v?/2. This is only half 
of the work you do. Therefore, an energy of ø Lv?/2 is lost to heat. Each atom in the 
chain goes abruptly from rest to speed v, and there is no way to avoid heat loss in such 
a process. This is clear when viewed in the reference frame of your hand. In this frame, 
the chain initially moves at speed v and then eventually comes to rest, piece by piece. 
So all of its initial kinetic energy, (o L)v?/2, goes into heat. 


211 


212 


Conservation of energy and momentum 


5.27. 


5.28. 


5.29. 


Pulling a chain again 

Let x be the position of the end of the chain. The momentum of the chain is then 
p = (ox)x. F = dp/dt gives (using the fact that F is constant) Ft = p, so we have 
Ft = (ox)x. Separating variables and integrating yields 


x t ox2 Fre 
Í oxdr = | Ftdt x=t/F/o. (5.179) 
0 0 2 2 


The position therefore grows linearly with time. In other words, the speed is constant, 


and it equals VF /o. 


Remark: Realistically, when you grab the chain, there is some small initial value of 
x (call it €). The dx integral above now starts at € instead of 0, so x takes the form, 
x = y Ft /o + e?. Ife is very small, the speed very quickly approaches ,/F/o. Even 
if € is not small, the position becomes arbitrarily close to t,/F'/o as t becomes large. 
The “head start” of € therefore doesn’t help you in the long run. & 


Falling chain 

First SOLUTION: The left part of the chain is in freefall (because we’re dealing with 
a chain described by our first scenario), so at time f it is moving at speed gf and has 
fallen a distance gt?/2. The chain gets doubled up below the support, so a length of 
only gt*/4 hangs at rest. Hence, a length of L— gt?/4 is falling at speed gt. With upward 
taken to be positive, the momentum of the entire chain (which just comes from the 
moving part, of course) is therefore 


p=o(L— gt*/4)(—gt) = —oLgt + og°t/4. (5.180) 


If F; is the force from the support, then the net force on the entire chain is F; — o Lg. 
So F = dp/dt for the entire chain gives 


eE) Fe 309717 


d 
F, L Lgt +4 5.181 
s-o = 3 ( oLg 7 s 4 (5.181) 
This result holds until the top of the chain has fallen a distance 2L (at T = ./4L/g). 
Prior to time T, F; equals three times the weight, ø (gt? /4)g, of the part of the chain 
that hangs at rest. After time T, Fs simply equals the total weight of the chain, o Lg. 
So at time T, F; abruptly drops from 30 Lg to ø Lg. 


SECOND SOLUTION: F; is responsible for two things: (1) It holds up the part of the chain 
that hangs at rest below the support, and (2) it changes the momentum of the atoms 
in the chain that are suddenly brought to rest at the kink in the chain. In other words, 
Fs = Fweignt + Fap/a:- From the first solution above, we have Fwweight = o (gt? /A)g. 

Now let’s find Fapjar- At time t, the speed of the chain is gt, so in a small time interval 
dt, the top of the chain falls a distance (gt) dt. But due to the “doubling up” effect, only 
half of this length is brought to rest in the time dt. Therefore, a small piece of mass 
o (1/2)(gt) dt that was moving at speed gt is suddenly brought to rest. The momentum 
was (a/2)(gt)?dt downward, and then it becomes zero. Hence, dp = +(0/2)g*?*dt, 
and so Fapja: = dp/dt = (o/2)g7??. Therefore, 


2 42 242 2 42 
og‘t ogt 30g°t 
Fs = Fweight t Fap/dt = 4 t 2 = 4 i 


(5.182) 


Falling chain (energy conserving) 

Let o be the mass density of the chain, let L be its total length, and let x be the distance 
(defined to be positive) that the top of the chain has fallen. For a given x, a piece 
of chain with mass ox and CM position L — x/2 has effectively been replaced by a 
thin “U” below the support, with height x/2; so its CM position is —x/4. The loss in 
potential energy is therefore (ox)g(L — x/2) — (ox)g(—x/4) = oxg(L — x/4). Since 
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we are assuming that energy is conserved in this setup, this loss in potential energy 
shows up as the gain in kinetic energy of the moving part of the chain. This part has 
length L — x/2 (because x/2 is the length hanging below the support), so conservation 
of energy gives 


soul x/2)v* = oxg(L — x/4) “= pee D 5.183) 


This has the expected property of going to infinity for x —> 2L. However, the finite 
size of the bend becomes relevant when x approaches 2L, so all of the energy never 
ends up being concentrated in an infinitely small piece. All speeds therefore remain 
finite, as they must. 

Writing v as dx/dt in Eq. (5.183), and separating variables and integrating, gives 


1 77 | L—x/2 Lf? 1-2/2 
t= f dx t= f dz, (5.184) 
Jk 0 2x(L — x/4) g Jo 2z(1 — z/4) 


where we have changed variables to z = x/L. Numerically integrating this gives a 
total time of t ~ (1.694)/L/g. Since the freefall time in the previous problem is given 
by gt /2 = 2L => t = 2/L/g, we see that the time in the present energy-conserving 
case is about 0.847 times the time in the freefall case. 

To find the tension T at the left end of the bend, let’s paint a little dot on the chain 
there. We’ ll find the acceleration of the falling part above the dot and then use F = ma. 
The acceleration of the falling part is a = dv/dt, where v is given in Eq. (5.183). Taking 
the derivative, you can show (don’t forget the dx/dt from the chain rule) that a can be 


written in the form, 
_ @/2)(L—x/4) 
a=e(1 + Exo ) (5.185) 


The F = ma equation for the part of the chain above the dot, with downward taken to 
be positive, gives T+ mg = ma T = m(a—g). Therefore, with m = o(L—x/2), 
we have 


o E 2 
(x/2)(L 9) _ ogx(L—x/4) _ ov (5.186) 


T=o0(L vide ( (L — x/2)2 2(L — x/2) 4° 


where we have used Eq. (5.183). This has the expected properties of equaling zero 
when x = 0 and diverging when x — 2L. 

To find the tension at the right end of the bend, note that the total upward tension on 
the tiny bend equals the rate of change in momentum of the bend (the gravitational force 
is negligible). In a small time dt, a length v dt/2 of chain is brought to rest (the factor 
of 2 comes from the doubling-up effect). This length goes from moving downward at 
speed v to being at rest, so the change in momentum is ø (v dt/2)v upward. Therefore, 
dp/dt = ov"/2. This must be the total upward force on the bend, and since we found 
above that the upward force at the left end is ov?/4, there must also be an upward force 
at the right end of ov?/4. The tensions at the two ends are therefore equal. 


5.30. Falling from a table 


(a) First soLUTION: Let o be the mass density of the chain. From conservation of 
energy, we know that the chain’s final kinetic energy, which is (o L)v? /2, equals 
the loss in potential energy. This loss equals (o L)(L/2)g, because the center of 
mass falls a distance L/2. Therefore, 


v= gL. (5.187) 


This equals the speed obtained by an object that falls a distance L/2. Note that 
if the initial piece hanging down through the hole is arbitrarily short, then the 
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34 


35 


chain will take an arbitrarily long time to fall down. But the final speed will still 
be (arbitrarily close to) ./gL. 


SECOND SOLUTION: Let x be the length that hangs down through the hole. The 
gravitational force on this length, which is (ox)g, is responsible for changing 
the momentum of the entire chain, which is (o L)x. Therefore, F = dp/dt gives 
(ox)g = (oL)Xx, which is simply the F = ma equation. Hence, ¥ = (g/L)x, and 
the general solution to this equation is? 


x(t) = AelV8/ + Be“ tV8/E (5.188) 


Let T be the time for which x(7) = L. If e is very small, then T will be very 
large. But for large ¢ (more precisely, for t >> ./L/g), we may neglect the 
negative-exponent term in Eq. (5.188). We then have 


xa Aey == ix (4e g/L )vgiL ~x g/L (for large t). 
(5.189) 


When x = L, we obtain 


X(T) = Lyg/L = ygL, (5.190) 


in agreement with the first solution. 

(b) Let o be the mass density of the chain, and let x be the length that hangs down 
through the hole. The gravitational force on this length, which is (ox)g, is 
responsible for changing the momentum of the chain. This momentum is (o x)x, 
because only the hanging part is moving. Therefore, F = dp/dt gives 


xg = xk +37. (5.191) 


Note that F = ma gives the wrong equation, because it neglects the fact that the 
amount of moving mass, ox, is changing. It therefore misses the second term on 
the right-hand side of Eq. (5.191). In short, the momentum of the chain increases 
because it is speeding up (which gives the xx term) and because additional mass 
is continually being added to the moving part (which gives the x? term, as you 
can show). 

Let’s now solve Eq. (5.191) for x(t). Since g is the only parameter in the 
equation, the solution for x(t) can involve only g’s and t’s.*> By dimensional 
analysis, x(t) must then be of the form x(t) = bet”, where b is a numerical 
constant to be determined. Plugging this expression for x(t) into Eq. (5.191) and 
dividing by g’?? gives b = 2b? + 4b?. Therefore, b = 1/6, and our solution 
may be written as 


x) = ; ($) 2. (5.192) 


This is the equation for something that accelerates downward with acceleration 
g’ = g/3. The time the chain takes to fall a distance L is then given by L = 
g't? /2, which yields t = \/2L/g’. The final speed is thus 


2eL 
v =g't = 2g = $ : (5.193) 


If € is the initial value of x, then A = B = e€/2 satisfies the initial conditions x(0) = € and 
x(0) = 0, in which case we can write x(t) = € cosh (t g/L). But we won’t need this information 
in what follows. 

The other dimensionful quantities in the problem, Z and ø, do not appear in Eq. (5.191), so they 
cannot appear in the solution. Also, the initial position and speed (which in general appear in the 
solution for x(t), because Eq. (5.191) is a second-order differential equation) do not appear in this 
case, because they are equal to zero. 
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This is smaller than the ./gZ result in part (a). We therefore see that although 
the total time for the scenario in part (a) is very large, the final speed in that case 
is in fact larger than that in the present scenario. You can show that the speed in 
part (a)’s scenario is smaller than the speed in part (b)’s scenario for x less than 
2L/3, but larger for x greater than 2L/3. 


Remarks: Using Eq. (5.193), you can show that 1/3 of the available potential 
energy is lost to heat. This inevitable loss occurs during the abrupt motions that 
suddenly bring the atoms from zero to nonzero speed when they join the moving 
part of the chain. The use of conservation of energy is therefore not a valid way 
to solve part (b). If you used conservation of energy, you would (as you can 
verify) obtain an incorrect acceleration of g/2. In view of the above solution 
based on F = dp/dt, this g/2 result cannot be correct, because there is simply 
not enough downward force in the setup to yield this acceleration. The only 
downward force comes from gravity, and we showed above that this leads to an 
acceleration of g/3. 

If we want to try to get rid of the energy loss and somehow produce an 
acceleration of g/2, a plausible idea is to use a continuous piece of rope instead 
of the ideal kind of chain we’ ve been using. As mentioned near the end of Section 
5.8, arope yields an energy-conserving system. However, from the reasoning on 
page 171, there is now a nonzero tension everywhere throughout the rope, even 
in the part inside the heap; this is the price we pay for having none of the points 
in the rope abruptly go from zero to nonzero speed. This tension then causes the 
rope (all of it) in the heap to move. The system is therefore completely different 
from the original one with our ideal chain, where it was understood that all the 
little strings connecting the ideal point masses in the heap are initially limp with 
zero tension; the tension in a given little string becomes nonzero only when it 
joins the moving part of the chain. The solution to this new energy-conserving 
setup therefore depends on exactly how the rope in the heap is initially situated; 
more information is therefore needed to solve the problem. But one thing is 
certain: this new setup is definitely not going to yield an acceleration of g/2, 
because the conservation-of-energy solution that leads to g/2 is based on the 
assumption that the entire loss in potential energy goes exclusively into the gain 
in kinetic energy of the part of the rope that has fallen below the table. The fact 
that the rope inside the heap is now also moving ruins this assumption. & 


5.31. The raindrop 


Let p be the mass density of the raindrop, and let à be the average mass density in 
space of the water droplets. Let r(t), M (t), and v(t) be the radius, mass, and velocity of 
the raindrop, respectively. We need three equations to solve for these three unknowns. 
The equations we will use are two different expressions for dM/dt, and the F = dp/dt 
expression for the raindrop. The first expression for M is obtained by taking the 
derivative of M = (4/3)r3p, which gives 


M =4nrip (5.194) 
r 
=3M-. (5.195) 
r 


The second expression for M is obtained by noting that the change in M is due to the 
acquisition of water droplets. The raindrop sweeps out volume at a rate given by its 
cross-sectional area times its velocity. Therefore, 


M=nrv~. (5.196) 


The F = dp/dt equation is found as follows. The gravitational force is Mg, and the 
momentum is Mv. Therefore, F = dp/dt gives 


Mg = Mv + Mv. (5.197) 
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We now have three equations involving the three unknowns, r, M, and v.36 Our goal 
is to find ù. We will do this by first finding 7. Equating the expressions for M in Eqs. 
(5.194) and (5.196) gives 


4p 
=i 5.198 
v 1 r ( ) 
. 4p. 
= i= 7. (5.199) 


Plugging Eqs. (5.195), (5.198), and (5.199) into Eq. (5.197) gives 
r\ (4p, 4p.. 
Mg =|(3M-)|—r)+M(-—}. (5.200) 
r À À 


čr = 12}? + 4rř, (5.201) 


Therefore, 


where we have defined g = gå /p, for convenience. The only parameter in Eq. (5.201) 
is &. Therefore, r (£) can depend only on & and t.*’ Hence, by dimensional analysis, r 
must take the form 


r(t) = Agr’, (5.202) 


where A is a numerical constant, to be determined. Plugging this expression for r into 
Eq. (5.201) gives 


ERP) = 12(24Bt)* + 4(ABr?) (2.48) 
=> A= 48424847. (5.203) 


Therefore, A = 1/56, and so 7 = 2Ag = g/28 = gd /28p. Eq. (5.199) then gives the 
acceleration of the raindrop as 


i= =, (5.204) 
7 
independent of p and à. For further discussion of the raindrop problem, see Krane 
(1981). 


REMARK: A common invalid solution to this problem is the following, which (incor- 
rectly) uses conservation of energy: The fact that v is proportional to 7 (shown in Eq. 
(5.198)) means that the volume swept out by the raindrop is a cone. The center of mass 
of a cone is 1/4 of the way from the base to the apex (as you can show by integrating 
over horizontal circular slices). Therefore, if M is the mass of the raindrop after it has 
fallen a height A, then an (incorrect) application of conservation of energy gives 

h 2 gh 


zM” Ms7 > wast. (5.205) 


Taking the derivative of this (or just using the general result, v? = 2ah) gives 


p= A (incorrect). (5.206) 


36 Note that we cannot write down the naive conservation-of-energy equation (which would say that 
the decrease in the water’s potential energy equals the increase in its kinetic energy), because 
mechanical energy is not conserved. The collisions between the raindrop and the droplets are 
completely inelastic. The raindrop will, in fact, heat up. See the remark at the end of the solution. 

37 The other dimensionful quantities in the problem, p and A, do not appear in Eq. (5.201), except 
through g, so they cannot appear in the solution. Also, the initial values of r and 7 (which in 
general appear in the solution for r(t), because Eq. (5.201) is a second-order differential equation) 
do not appear in this case, because they are equal to zero. 


5.11 Solutions 


The reason why this solution is invalid is that the collisions between the raindrop and 
the droplets are completely inelastic. Heat is generated, and the overall kinetic energy 
of the raindrop is smaller than you would otherwise expect. 

Let’s calculate how much mechanical energy is lost (and therefore how much the 
raindrop heats up) as a function of the height fallen. The loss in mechanical energy is 


h 1, 
Eos = Mg — 7M0. (5.207) 


Using v= 2(g/7)h, this becomes 
3 
AEF int = Fiost = zgh (5.208) 


where AF;nt is the gain in internal thermal energy. The energy required to heat 1 g of 
water by 1 °C is 1 calorie (= 4.18 joules). Therefore, the energy required to heat 1 kg 
of water by 1°C is ~ 4200 J. In other words, 


AEint = 4200 M AT, (5.209) 


where M is measured in kilograms, and T is measured in degrees Celsius. Equations 
(5.208) and (5.209) give the increase in temperature as a function of h, 


3 
4200 AT = = gh. (5.210) 


How far must the raindrop fall before it starts to boil? If we assume that the water 
droplets’ temperature is near freezing, then the height through which the raindrop 
must fall to have AT = 100°C is found from Eq. (5.210) to be 


h ~ 400 000 m = 400 km, (5.211) 


which is much larger than the height of the atmosphere. We have, of course, idealized 
the problem in a drastic manner. But needless to say, there is no need to worry about 
getting burned by the rain. A typical value for A is a few kilometers, which would raise 
the temperature by only about one degree. This effect is completely washed out by 
many other factors. & 
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Chapter 6 
The Lagrangian method 


In this chapter, we’re going to learn about a whole new way of looking at things. 
Consider the system of a mass on the end of a spring. We can analyze this, of 
course, by using F = ma to write down mx = —kx. The solutions to this equation 
are sinusoidal functions, as we well know. We can, however, figure things out 
by using another method which doesn’t explicitly use F = ma. In many (in 
fact, probably most) physical situations, this new method is far superior to using 
F = ma. You will soon discover this for yourself when you tackle the problems 
and exercises for this chapter. We will present our new method by first stating its 
rules (without any justification) and showing that they somehow end up magically 
giving the correct answer. We will then give the method proper justification. 


6.1 The Euler-Lagrange equations 


Here is the procedure. Consider the following seemingly silly combination of the 
kinetic and potential energies (T and V, respectively), 


L=T-V. (6.1) 


This is called the Lagrangian. Yes, there is a minus sign in the definition (a plus 
sign would simply give the total energy). In the problem of a mass on the end of 
a spring, T = mx*/2 and V = kx? /2, so we have 


1 1 
L= —mi? — —k’. (6.2) 
2 2 
Now write 
d (aL\ ƏL 63 
dt\ ax) ax’ a 


Don’t worry, we’ll show you in Section 6.2 where this comes from. This equation 
is called the Euler-Lagrange (E-L) equation. For the problem at hand, we have 
dL/dx = mx and dL/dx = —kx (see Appendix B for the definition of a partial 
derivative), so Eq. (6.3) gives 


mł = —kx, (6.4) 


6.1 The Euler-Lagrange equations 


which is exactly the result obtained by using = ma. An equation such as 
Eq. (6.4), which is derived from the Euler-Lagrange equation, is called an 
equation of motion.' If the problem involves more than one coordinate, as most 
problems do, we just have to apply Eq. (6.3) to each coordinate. We will obtain 
as many equations as there are coordinates. Each equation may very well involve 
many of the coordinates (see the example below, where both equations involve 
both x and 6). 

At this point, you may be thinking, “That was a nice little trick, but we just 
got lucky in the spring problem. The procedure won’t work in a more general 
situation.” Well, let’s see. How about if we consider the more general problem 
of a particle moving in an arbitrary potential V (x) (we’ll stick to one dimension 
for now). The Lagrangian is then 


1 
L= sme — V(x), (6.5) 
and the Euler-Lagrange equation, Eq. (6.3), gives 
my = — —. (6.6) 


But —dV /dx is the force on the particle. So we see that Eqs. (6.1) and (6.3) 
together say exactly the same thing that F = ma says, when using a Cartesian 
coordinate in one dimension (but this result is in fact quite general, as we'll see 
in Section 6.4). Note that shifting the potential by a given constant has no effect 
on the equation of motion, because Eq. (6.3) involves only the derivative of V. 
This is equivalent to saying that only differences in energy are relevant, and not 
the actual values, as we well know. 

In a three-dimensional setup written in terms of Cartesian coordinates, the 
potential takes the form V (x,y,z), so the Lagrangian is 


ae ; . 
L= sme +37? +27) —V(x,y,z). (6.7) 


It then immediately follows that the three Euler-Lagrange equations (obtained 
by applying Eq. (6.3) to x, y, and z) may be combined into the vector statement, 


mx = —VV. (6.8) 


But —VV = F, so we again arrive at Newton’s second law, F = ma, now in 
three dimensions. 

Let’s do one more example to convince you that there’s really something 
nontrivial going on here. 


' The term “equation of motion” is a little ambiguous. It is understood to refer to the second-order 
differential equation satisfied by x, and not the actual equation for x as a function of ¢, namely 
x(t) = Acos(wt + @) in this problem, which is obtained by integrating the equation of motion 
twice. 
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Example (Spring pendulum): Consider a pendulum made of a spring with a 
mass m on the end (see Fig. 6.1). The spring is arranged to lie in a straight line 
(which we can arrange by, say, wrapping the spring around a rigid massless rod). The 
equilibrium length of the spring is £. Let the spring have length £ + x(t), and let its 
angle with the vertical be @(t). Assuming that the motion takes place in a vertical 
plane, find the equations of motion for x and 0. 


Solution: The kinetic energy may be broken up into the radial and tangential 
parts, so we have 


1 , 
T= zm? J+ ge). (6.9) 


The potential energy comes from both gravity and the spring, so we have 
V(x,0) = —mg(£ + x) cos + she. (6.10) 
The Lagrangian is therefore 
L=T-V= sm? +(e + x)°6?) + mg(l +x) cos6 — she. (6.11) 
There are two variables here, x and 0. As mentioned above, the nice thing about the 


Lagrangian method is that we can just use Eq. (6.3) twice, once with x and once with 
0. So the two Euler-Lagrange equations are 


d(al\ al . 5 
—(—J=— = m=m((l+x)0° + mgcosð — kx, (6.12) 
dt \ ax Ox 
and 
eS es “( (¢+x)°6) (€+x) sing 
—f{ —)= — = = — in 
di\ a6) 0 aver Mee Le 


=> m(l+x)?6+2m(l+x)x6 = —me(l +x) sind 
=> ml+x)b +2mx6 = -mg sind. (6.13) 


Equation (6.12) is simply the radial F = ma equation, complete with the centripetal 
acceleration, — (£ + x)6. And the first line of Eq. (6.13) is the statement that the 
torque equals the rate of change of the angular momentum (this is one of the sub- 
jects of Chapter 8). Alternatively, if you want to work in a rotating reference frame, 
then Eq. (6.12) is the radial F = ma equation, complete with the centrifugal force, 
m(€ + x)62. And the third line of Eq. (6.13) is the tangential F = ma equation, 
complete with the Coriolis force, —2mx0. But never mind about this now. We’ll deal 
with rotating frames in Chapter 10. 


2 Throughout this chapter, I’ll occasionally point out torques, angular momenta, centrifugal forces, 
and other such things when they pop up in equations of motion, even though we haven’t covered 


6.2 The principle of stationary action 


Remark: After writing down the E-L equations, it is always best to double-check them by 
trying to identify them as F = ma and/or t = dL /dt equations (once we learn about that). 
Sometimes, however, this identification isn’t obvious. And for the times when everything 
is clear (that is, when you look at the E-L equations and say, “Oh, of course!”), it is usually 
clear only after you’ve derived the equations. In general, the safest method for solving 
a problem is to use the Lagrangian method and then double-check things with F = ma 
and/or t = dL/dt if you can. & 


At this point it seems to be personal preference, and all academic, whether you 
use the Lagrangian method or the F = ma method. The two methods produce 
the same equations. However, in problems involving more than one variable, it 
usually turns out to be much easier to write down T and V, as opposed to writing 
down all the forces. This is because T and V are nice and simple scalars. The 
forces, on the other hand, are vectors, and it is easy to get confused if they point 
in various directions. The Lagrangian method has the advantage that once you’ve 
written down L = T — V, you don’t have to think anymore. All you have to do 
is blindly take some derivatives.’ 


When jumping from high in a tree, 
Just write down del L by del z. 

Take del L by z dot, 

Then f¢-dot what you’ve got, 

And equate the results (but quickly!) 


But ease of computation aside, is there any fundamental difference between the 
two methods? Is there any deep reasoning behind Eq. (6.3)? Indeed, there is ... 


6.2 The principle of stationary action 


Consider the quantity, 


t2 
s=| L(x, x, tdt. (6.14) 
ti 


S is called the action. It is a quantity with the dimensions of (Energy) x (Time). S 
depends on L, and L in turn depends on the function x(t) via Eq. (6.1). Given any 


them yet. I figure it can’t hurt to bring your attention to them. But rest assured, a familiarity with 
these topics is by no means necessary for an understanding of what we’ll be doing in this chapter, 
so just ignore the references if you want. One of the great things about the Lagrangian method is 


99 6 


that even if you’ve never heard of the terms “torque,” “centrifugal,” “Coriolis,” or even “F = ma” 
itself, you can still get the correct equations by simply writing down the kinetic and potential 


energies, and then taking a few derivatives. 


w 


Well, you eventually have to solve the resulting equations of motion, but you have to do that with 
the F = ma method, too. 

In some situations, the kinetic and potential energies in L = T — V may explicitly depend on time, 
so we have included the “£” in Eq. (6.14). 
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Fig. 6.2 


The Lagrangian method 


function x(t), we can produce the quantity S. We’ Il deal with only one coordinate, 
x, for now. 

Integrals like the one in Eq. (6.14) are called functionals, and S is sometimes 
denoted by S[x(t)]. It depends on the entire function x(t), and not on just one 
input number, as a regular function f (t) does. S can be thought of as a function 
of an infinite number of values, namely all the x(¢) for t ranging from ¢ to t2. If 
you don’t like infinities, you can imagine breaking up the time interval into, say, 
a million pieces, and then replacing the integral by a discrete sum. 

Let’s now pose the following question: Consider a function x(t), for ti < 
t < b, which has its endpoints fixed (that is, x(t;) = xı and x(t) = x2, 
where x; and x2 are given), but is otherwise arbitrary. What function x(t) yields 
a stationary value of S? A stationary value is a local minimum, maximum, or 
saddle point.’ 

For example, consider a ball dropped from rest, and consider the function 
y(t) for 0 < t < 1. Assume that we somehow know that y(0) = 0 and 
y(1) = —g/2.° A number of possibilities for y(t) are shown in Fig. 6.2, and 
each of these can (in theory) be plugged into Eqs. (6.1) and (6.14) to generate S. 
Which one yields a stationary value of S? The following theorem gives us the 
answer. 


Theorem 6.1 /f the function xo(t) yields a stationary value (that is, a local 
minimum, maximum, or saddle point) of S, then 


d ( OL ) aL (6.15) 


dt\ dio) ~ xo 


It is understood that we are considering the class of functions whose endpoints 
are fixed. That is, x(t,) = xı and x(t) = x2. 


Proof: We will use the fact that if a certain function xo(¢) yields a stationary 
value of S, then any other function very close to xo(¢) (with the same endpoint 
values) yields essentially the same S, up to first order in any deviations. This is 
actually the definition of a stationary value. The analogy with regular functions 
is that if f (b) is a stationary value of f, then f (b + €) differs from f (b) only at 
second order in the small quantity €. This is true because f'(b) = 0, so there is 
no first-order term in the Taylor series expansion around b. 

Assume that the function x(t) yields a stationary value of S, and consider the 
function 


Xa(t) = xo(t) + aß (t), (6.16) 


5 A saddle point is a point where there are no first-order changes in S, and where some of the 
second-order changes are positive and some are negative (like the middle of a saddle, of course). 
é This follows from y = — gt? /2, but pretend that we don’t know this formula. 


6.2 The principle of stationary action 


where a is a number, and where f(t) satisfies B(t;) = B(t2) = 0 (to keep the 
endpoints of the function fixed), but is otherwise arbitrary. When producing the 
action S[x,q(t)] in (6.14), the ¢ is integrated out, so S is just a number. It depends 
on a, in addition to ¢; and ¢2. Our requirement is that there be no change in S at 
first order in a. How does S depend on a? Using the chain rule, we have 


f] a 2 2 OL 
—S >a] = — Ldt = — dt 
ða ba] da [ i da 


n / aL aL as 
=j (3 OMe y A vt) at. (6.17) 
ty OXq Oa OXq Oa 


In other words, a influences S through its effect on x, and also through its effect 
on x. From Eq. (6.16), we have 


Oe an. oe (6.18) 
da da 
so Eq. (6.17) becomes’ 
a 2 aL OL . 
——S[xa(] = / (0 + =) dt. (6.19) 
ða 4 OXa IXa 


Now comes the one sneaky part of the proof. We will integrate the second term 
by parts (you will see this trick many times in your physics career). Using 


aL . aL d aL 
[ba ane IG T) Bat (6.20) 


Eq. (6.19) becomes 


ð mat dalL aL 
2 Stxa(t)] = ee ee 
aq aI [ (= di 2) mar 


bh 


(6.21) 


ty 


But £ (t1) = B(t2) = 0, so the last term (the “boundary term”) vanishes. We now 
use the fact that (0/da)S[xq(t)] must be zero for any function 6(t), because we 
are assuming that xo(¢) yields a stationary value. The only way this can be true 
is if the quantity in parentheses above (evaluated at a = 0) is identically equal to 
zero, that is, 


d ( +) _ aL a G 


dt\ dio) Axo 


7 Note that nowhere do we assume that xq and <q are independent variables. The partial derivatives 
in Eq. (6.18) are very much related, in that one is the derivative of the other. The use of the chain 
tule in Eq. (6.17) is still perfectly valid. 
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The E-L equation, Eq. (6.3), therefore doesn’t just come out of the blue. It 
is a consequence of requiring that the action be at a stationary value. We may 
therefore replace F = ma by the following principle. 


e The principle of stationary action: 
The path of a particle is the one that yields a stationary value of the action. 


This principle (also known as Hamilton’s principle) is equivalent to F = ma 
because the above theorem shows that if (and only if, as you can show by working 
backwards) we have a stationary value of S, then the E-L equations hold. And the 
E-L equations are equivalent to F = ma (as we showed for Cartesian coordinates 
in Section 6.1, and as we’ll prove for any coordinate system in Section 6.4). 
Therefore, “stationary action” is equivalent to F = ma. 

If we have a multidimensional setup where the Lagrangian is a function of the 
variables x; (ft), x2(t),..., then the above principle of stationary action is still all 
we need. With more than one variable, we can now vary the path by varying each 
coordinate (or combinations thereof). The variation of each coordinate produces 
an E-L equation which, as we saw in the Cartesian case, is equivalent to an 
F = ma equation. 

Given a classical mechanics problem, we can solve it with F = ma, or we 
can solve it with the E-L equations, which are a consequence of the principle of 
stationary action (often called the principle of “least action” or “minimal action,” 
but see the fourth remark below). Either method will get the job done. But as 
mentioned at the end of Section 6.1, it is often easier to use the latter, because it 
avoids the use of force which can get confusing if you have forces pointing in all 
sorts of complicated directions. 


It just stood there and did nothing, of course, 
A harmless and still wooden horse. 

But the minimal action 

Was just a distraction; 

The plan involved no use of force. 


Let’s now return to the example of a ball dropped from rest, mentioned above. 
The Lagrangian is L = T — V = mý?/2 — mgy, so Eq. (6.22) gives } = —g, 
which is simply the F = ma equation (divided through by m), as expected. The 
solution is y(t) = —gt?/2 + vot + yo, as we well know. But the initial conditions 
tell us that vp = yo = 0, so our solution is y(t) = —gt*/2. You are encouraged 
to verify explicitly that this y(t) yields an action that is stationary with respect 
to variations of the form, say, y(t) = —gt?/2 + et(t — 1), which also satisfies 
the endpoint conditions (this is the task of Exercise 6.30). There is, of course, 
an infinite number of other ways to vary y(t), but this specific result should help 
convince you of the validity of Theorem 6.1. 

Note that the stationarity implied by the Euler-Lagrange equation, Eq. (6.22), 
is a local statement. It gives information only about nearby paths. It says nothing 


6.2 The principle of stationary action 


about the global nature of how the action depends on all possible paths. If we find 
that a solution to Eq. (6.22) happens to produce a local minimum (as opposed to a 
maximum or a saddle), there is no reason to conclude that it is a global minimum, 
although in many cases it turns out to be (see Exercise 6.32, for the case of a 
thrown ball). 


REMARKS: 


1. Theorem 6.1 is based on the assumption that the ending time, t2, of the motion is given. 
But how do we know this final time? Well, we don’t. In the example of a ball thrown 
upward, the total time to rise and fall back to your hand can be anything, depending on the 
ball’s initial speed. This initial speed will show up as an integration constant when solving 
the E-L equations. The motion must end sometime, and the principle of stationary action 
says that for whatever time this happens to be, the physical path has a stationary action. 

2. Theorem 6.1 shows that we can explain the E-L equations by the principle of stationary 
action. This, however, simply shifts the burden of proof. We are now left with the task 
of justifying why we should want the action to have a stationary value. The good news 
is that there is a very solid reason for this. The bad news is that the reason involves 
quantum mechanics, so we won’t be able to discuss it properly here. Suffice it to say that 
a particle actually takes all possible paths in going from one place to another, and each 
path is associated with the complex number e'/” (where A = 1.05 - 10734 J s is Planck's 
constant). These complex numbers have absolute value 1 and are called “phases.” It turns 
out that the phases from all possible paths must be added up to give the “amplitude” of 
going from one point to another. The absolute value of the amplitude must then be squared 
to obtain the probability.® 

The basic point, then, is that at a nonstationary value of S, the phases from different 
paths differ (greatly, because h is so small compared with the typical size of the action for 
a macroscopic particle) from one another, which effectively leads to the addition of many 
random vectors in the complex plane. These end up canceling each other, yielding a sum 
of essentially zero. There is therefore no contribution to the overall amplitude from non- 
stationary values of S. Hence, we do not observe the paths associated with these S’s. At a 
stationary value of S, however, all the phases take on essentially the same value, thereby 
adding constructively instead of destructively. There is therefore a nonzero probability 
for the particle to take a path that yields a stationary value of S. So this is the path we 
observe. 

3. But again, the preceding remark simply shifts the burden of proof one step further. We 
must now justify why these phases e’5/” should exist, and why the Lagrangian that appears 
in S should equal T — V. But here’s where we’re going to stop. 

4. The principle of stationary action is sometimes referred to as the principle of “least” 
action, but this is misleading. True, it is often the case that the stationary value turns out to 
be a minimum value, but it need not be, as we can see in the following example. Consider 
a harmonic oscillator which has a Lagrangian equal to 

| i (6.23) 

2 2" 
Let xo(t) be a function that yields a stationary value of the action. Then we know that 
xo(t) satisfies the E-L equation, m¥p = —kxo. Consider a slight variation on this path, 


8 This is one of those remarks that is completely useless, because it is incomprehensible to those 
who haven’t seen the topic before, and trivial to those who have. My apologies. But this and the 
following remarks are definitely not necessary for an understanding of the material in this chapter. 
If you’re interested in reading more about these quantum mechanical issues, you should take a 
look at Richard Feynman’s book (Feynman, 2006). Feynman was, after all, the one who thought 
of this idea. 
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xolt) + E(t), where E(t) satisfies (t1) = &(t2) = 0. With this new function, the action 
becomes 


Se -f EE F2 +4?) I T °)) z aay 


The two cross-terms add up to zero, because after integrating the xo term by parts, their 
sum is 


mxo& 


t Q 
= f (ničo + kxo)é dt. (6.25) 
H ti 


The first term is zero, due to the boundary conditions on (t). The second term is zero, 
due to the E-L equation. We’ve basically just reproduced the proof of Theorem 6.1 for 
the special case of the harmonic oscillator here. 

The terms in Eq. (6.24) involving only xg give the stationary value of the action 
(call it So). To determine whether So is a minimum, maximum, or saddle point, we 
must look at the difference, 


h 


AS = Sz — So = ; f (m? — ké?) dt. (6.26) 


ti 


It is always possible to find a function € that makes AS positive. Simply choose & to be 
small, but make it wiggle very fast, so that ¿ is large. Therefore, it is never the case that 
So is a maximum. Note that this reasoning works for any potential, not just a harmonic 
oscillator, as long as it is a function of position only (that is, it contains no derivatives, as 
we always assume). 

You might be tempted to use the same line of reasoning to say that it is also always 
possible to find a function £ that makes AS negative, by making £ large and é small. 
If this were true, then we could put everything together and conclude that all stationary 
points are saddle points, for a harmonic oscillator. However, it is not always possible to 
make & large enough and é small enough so that AS is negative, due to the boundary 
conditions &(t;) = &(t) = 0. If £ changes from zero to a large value and then back to 
zero, then & may also have to be large, if the time interval is short enough. Problem 6.6 
deals quantitatively with this issue. For now, let’s just recognize that in some cases So is 
a minimum, in some cases it is a saddle point, and it is never a maximum. “Least action” 
is therefore a misnomer. 


. It is sometimes said that nature has a “purpose,” in that it seeks to take the path that 


produces the minimum action. In view of the second remark above, this is incorrect. 
In fact, nature does exactly the opposite. It takes every path, treating them all on equal 
footing. We end up seeing only the path with a stationary action, due to the way the 
quantum mechanical phases add. It would be a harsh requirement, indeed, to demand 
that nature make a “global” decision (that is, compare paths that are separated by large 
distances) and choose the one with the smallest action. Instead, we see that everything 
takes place on a “local” scale. Nearby phases simply add, and everything works out 
automatically. 

When an archer shoots an arrow through the air, the aim is made possible by all the other 
arrows taking all the other nearby paths, each with essentially the same action. Likewise, 
when you walk down the street with a certain destination in mind, you’re not alone... 


When walking, I know that my aim 

Is caused by the ghosts with my name. 
And although I can’t see 

Where they walk next to me, 

I know they’re all there, just the same. 


6.3 Forces of constraint 


6. Consider a function, f(x), of one variable (for ease of terminology). Let f (b) be a local 
minimum of f. There are two basic properties of this minimum. The first is that f (b) 
is smaller than all nearby values. The second is that the slope of f is zero at b. From 
the above remarks, we see that (as far as the action S is concerned) the first property is 
completely irrelevant, and the second one is the whole point. In other words, saddle points 
(and maxima, although we showed above that these never exist for S) are just as good as 
minima, as far as the constructive addition of the e’S/” phases is concerned. 

7. Given that classical mechanics is an approximate theory, while quantum mechanics is 
the (more) correct one, it is quite silly to justify the principle of stationary action by 
demonstrating its equivalence to F = ma, as we did above. We should be doing it the 
other way around. However, because our intuition is based on F = ma, it’s easier to start 
with F = ma as the given fact, rather than calling upon the latent quantum-mechanical 
intuition hidden deep within all of us. Maybe someday ... 

At any rate, in more advanced theories dealing with fundamental issues concerning the 
tiny building blocks of matter (where actions are of the same order of magnitude as h), 
the approximate F = ma theory is invalid, and you have to use the Lagrangian method. 

8. When dealing with a system in which a nonconservative force such as friction is present, 
the Lagrangian method loses much ofits appeal. The reason for this is that nonconservative 
forces don’t have a potential energy associated with them, so there isn’t a specific V (x) that 
you can write down in the Lagrangian. Although friction forces can in fact be incorporated 
in the Lagrangian method, you have to include them in the E-L equations essentially by 
hand. We won’t deal with nonconservative forces in this chapter. & 


6.3 Forces of constraint 


Anice thing about the Lagrangian method is that we are free to impose any given 
constraints at the beginning of the problem, thereby immediately reducing the 
number of variables. This is always done (perhaps without thinking) whenever 
a particle is constrained to move on a wire or surface, etc. Often we are con- 
cerned not with the exact nature of the forces doing the constraining, but only 
with the resulting motion, given that the constraints hold. By imposing the con- 
straints at the outset, we can find the motion, but we can’t say anything about the 
constraining forces. 

If we want to determine the constraining forces, we must take a different 
approach. The main idea of the strategy, as we will show below, is that we must 
not impose the constraints too soon. This leaves us with a larger number of 
variables to deal with, so the calculations are more cumbersome. But the benefit 
is that we are able to find the constraining forces. 

Consider the setup of a particle sliding off a fixed frictionless hemisphere of 
radius R (see Fig. 6.3). Let’s say that we are concerned only with finding the 
equation of motion for 6, and not the constraining force. Then we can write 
everything in terms of 0, because we know that the radial distance r is cons- 
trained to be R. The kinetic energy is mR767 /2, and the potential energy (relative 
to the bottom of the hemisphere) is mgR cos 6, so the Lagrangian is 


1 , 
L= mR? — mgRcos9, (6.27) 


Fig. 6.3 
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and the equation of motion, via Eq. (6.3), is 
6 = (g/R) sind, (6.28) 


which is equivalent to the tangential F = ma statement. 

Now let’s say that we want to find the constraining normal force that the 
hemisphere applies to the particle. To do this, let’s solve the problem in a different 
way and write things in terms of both r and 0. Also (and here’s the critical step), 
let’s be really picky and say that r isn’t exactly constrained to be R, because in 
the real world the particle actually sinks into the hemisphere a little bit. This may 
seem a bit silly, but it’s really the whole point. The particle pushes and sinks 
inward a tiny distance until the hemisphere gets squashed enough to push back 
with the appropriate force to keep the particle from sinking in any more (just 
consider the hemisphere to be made of lots of little springs with very large spring 
constants). The particle is therefore subject to a (very steep) potential arising from 
the hemisphere’s force. The constraining potential, V (r), looks something like 
the plot in Fig. 6.4. The true Lagrangian for the system is thus 


_1 -2 1 1.242) _ 
L= gr +r°0°) — mgr cos — V(r). (6.29) 


(The 7? term in the kinetic energy will turn out to be insignificant.) The equa- 
tions of motion obtained from varying 0 and r are therefore 


mr?6 + 2mri = mgr sind, 
: (6.30) 
mř = mr? — mg cos — V' (r). 


Having written down the equations of motion, we will now apply the constraint 
condition that r = R. This condition implies 7 = 7 = 0. (Of course, r isn’t really 
equal to R, but any differences are inconsequential from this point onward.) The 
first of Eqs. (6.30) then reproduces Eq. (6.28), while the second yields 


dV ry) 
-— = mg cos 0 — mRO-. (6.31) 
dr r=R 
But Fy = —dV/dr is the constraint force applied in the r direction, which is 


precisely the force we are looking for. The normal force of constraint is therefore 
Fy(0,0) = mg cos @ — mR@. (6.32) 


This is equivalent to the radial F = ma equation, mg cos 0 — Fy = mR6? (which 
is certainly a quicker way to find the normal force in the present problem). Note 
that this result is valid only if Fy (6,6) > 0. If the normal force becomes zero, 
then this means that the particle has left the sphere, in which case r no longer 
equals R. 
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REMARKS: 


1. What if we instead had (unwisely) chosen Cartesian coordinates, x and y, instead of polar 
coordinates, r and 6? Since the distance from the particle to the surface of the hemisphere 
is n = yx? + y? — R, we obtain a true Lagrangian equal to 


L sme? +57) mgy — V (n). (6.33) 


The equations of motion are (using the chain rule) 
mł = ——-—, and m= -mg - ——. (6.34) 


We now apply the constraint condition 7 = 0. Since —dV /dn equals the constraint force 
F, you can show that the equations we end up with (namely, the two E-L equations and 
the constraint equation) are 


mi = Fo, my = mg +Fo, and ,/x2+y2—-R=0. (6.35) 


These three equations are sufficient to determine the three unknowns x, ý, and F as 
functions of the quantities x, x, y, and y. See Exercise 6.37, which should convince 
you that polar coordinates are the way to go. In general, the strategy for solving for F 
is to take two time derivatives of the constraint equation and then eliminate the second 
derivatives of the coordinates by using the E-L equations (this process was trivial in the 
polar-coordinate case). 

2. You can see from Eq. (6.35) that the E-L equations end up taking the form, 


d [ƏL ðL 
OD es aig (6.36) 
dt \ ðġi ðqi ðqi 


for each coordinate g;. The quantity 7 is what appears in the constraint equation, n = 0. 
In our hemisphere problem, we had n = r — R in polar coordinates, and n = y x? + y? — 
R in Cartesian coordinates. The E-L equations, combined with the n = 0 condition, 
give us exactly the number of equations (N + 1 of them, where N is the number of 
coordinates) needed to determine all of the V + 1 unknowns (all the g;, and F) in terms of 
the q; and qj. 

Writing down the equations in Eq. (6.36) is basically the method of Lagrange multipli- 
ers, where the Lagrange multiplier turns out to be the force. But if you’re not familiar with 
this method, no need to worry; you can derive everything from scratch using the above 
technique involving the steep potential. If you do happen to be familiar with it, then there 
might in fact be a need to worry about how you apply it, as the following remark explains. 

3. When trying to determine the forces of constraint, you can just start with Eq. (6.36), 
without bothering to write down V (n). But you must be careful to make sure that n does 
indeed represent the distance the particle is from where it should be. In polar coordinates, 
if someone gives you the constraint condition for the hemisphere as 7(7 — R) = 0, and 
if you use the left-hand side of this as the 7 in Eq. (6.36), then you will get the wrong 
constraint force; it will be too small by a factor of 7. Likewise, in Cartesian coordinates, 
writing the constraint as y—./R? — x? = 0 will give you the wrong force. The best way to 
avoid this problem is, of course, to pick one of your variables as the distance the particle 
is from where it should be (up to an additive constant, as in the case ofr — R = 0). & 
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When L is written in terms of Cartesian coordinates x, y, z, we showed in Section 
6.1 that the Euler-Lagrange equations are equivalent to Newton’s F = ma 
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equations; see Eq. (6.8). But what about the case where we use polar, spherical, 
or some other coordinates? The equivalence of the E-L equations and F = ma 
isn’t so obvious. As far as trusting the E-L equations for such coordinates goes, 
you can achieve peace of mind in two ways. You can accept the principle of 
stationary action as something so beautiful and profound that it simply has to 
work for any choice of coordinates. Or, you can take the more mundane road 
and show through a change of coordinates that if the E-L equations hold for one 
set of coordinates (and we know that they do hold for at least one set, namely 
Cartesian coordinates), then they also hold for any other coordinates (of a certain 
form, described below). In this section, we will demonstrate the validity of the 
E-L equations through the explicit change of coordinates.” 
Consider the set of coordinates, 


Xi: (1, X2,...,Xyn). (6.37) 


For example, if N = 6, then x1, x2, x3 could be the Cartesian x, y, z coordinates 
of one particle, and x4, x5, x6 could be the r, 6, @ polar coordinates of a second 
particle, and so on. Assume that the E-L equations hold for these variables, 
that is, 


d [ƏL aL 

—| —]=— (1<ix<N). 6.38 

“(= OX; (isis ) ( ) 
Consider a new set of variables that are functions of the x; and ¢, 


qi = qi (%1, X2, - - - , Xy; Ê). (6.39) 


We will restrict ourselves to the case where the q; do not depend on the x;. (This is 
quite reasonable. If the coordinates depended on the velocities, then we wouldn’t 
be able to label points in space with definite coordinates. We’d have to worry 
about how the particles were behaving when they were at the points. These would 
be strange coordinates indeed.) We can, in theory, invert Eq. (6.39) and express 
the x; as functions of the q; and t, 


Xi = Xi (qig aayi t): (6.40) 


Claim 6.2 Jf Eq. (6.38) is true for the x; coordinates, and if the x; and qi are 
related by Eq. (6.40), then Eq. (6.38) is also true for the q; coordinates. That is, 


d [{ OL aL 
( - ) = (l<m<N). (6.41) 
dt \ ddm Odm 


° This calculation is straightforward but a bit messy, so you may want to skip this section and just 
settle for the “beautiful and profound” reasoning. 


6.4 Change of coordinates 


Proof: We have 


N 3 
=) 5 ae (6.42) 


i i=l OX; ddm 


(Note that if the x; depended on the ġ;, then we would have to include the 
additional term, )*\(0L/0x;)(0x;/0Gm). But we have excluded such dependence.) 
Let’s rewrite the 0x;/0qm term. From Eq. (6.40), we have 


Š OX; 5 OX; 
— pails ee 6.43 
. 23 O4m MET ðt ( ) 
m=1 
Therefore, 
Ox j 0 
a (6.44) 
dm ddm 


Substituting this into Eq. (6.42) and taking the time derivative of both sides gives 
N 


N 
d ( aL d (0L\ 0x; OL (5) 
—| = À + - . (6.45) 
dt (=) 2 dt (=) ddm 2 Ox; dt \ Iqm 


In the second term here, it is legal to switch the order of the total derivative, d/dt, 
and the partial derivative, 0/0qgm. 


REMARK: In case you have your doubts, let’s prove that this switching is legal. 


(in) - ar (Ja a] 
dk + 
dt adm ðqk qm at Om 


a > OX; i i Ox; 
E ddm I at 


k=1 


ii 
E ddm f 


(6.46) 


In the first term on the right-hand side of Eq. (6.45), we can use the given 
information in Eq. (6.38) and rewrite the (d/dt)(0L/0x;) term. We then obtain 


T ƏL ax; wa AL AK; 


dt dgm E = ax; ddm i=l Ox; ddm 
ðL 
oe (6.47) 
ddm 
as we wanted to show. E 


We have therefore demonstrated that if the Euler-Lagrange equations are true 
for one set of coordinates, x; (and they are true for Cartesian coordinates), then 
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they are also true for any other set of coordinates, g;, satisfying Eq. (6.39). If 
you’re inclined to look at the principle of stationary action with distrust, thinking 
that it might be a coordinate-dependent statement, this proof should put you at 
ease. The Euler-Lagrange equations are valid in any coordinates. 

Note that the above proof did not in any way use the precise form of the 
Lagrangian. If L were equal to T + V, or 8T + xV*/T, or any other arbitrary 
function, our result would still be true: If Eq. (6.38) is true for one set of coordi- 
nates, then it is also true for any other set of coordinates q; satisfying Eq. (6.39). 
The point is that the only L for which the hypothesis is true at all (that is, for 
which Eq. (6.38) holds) is L = T — V (or any constant multiple of this). 


REMARK: On one hand, it is quite amazing how little we assumed in proving the above claim. 
Any new coordinates of the very general form in Eq. (6.39) satisfy the E-L equations, as long 
as the original coordinates do. If the E-L equations had, say, a factor of 5 on the right-hand 
side of Eq. (6.38), then they would not hold in arbitrary coordinates. To see this, just follow 
the proof through with the factor of 5. 

On the other hand, the claim is quite believable, if you make an analogy with a function 
instead of a functional. Consider the function f(z) = z?. This has a minimum at z = 0, 
consistent with the fact that df /dz = 0 at z = 0. But let’s now write f in terms of the variable 
y defined by, say, z = y*. Then f (y) = y’, and f has a minimum at y = 0, consistent with the 
fact that df /dy = 0 at y = 0. So f” = 0 holds in both coordinates at the corresponding points 
y =z = l. This is the (simplified) analog of the E-L equations holding in both coordinates. In 
both cases, the derivative equation describes where the stationary value occurs. 

This change-of-variables result may be stated in a more geometrical (and friendly) way. If 
you plot a function and then stretch the horizontal axis in an arbitrary manner (which is what 
happens when you change coordinates), then a stationary value (that is, one where the slope is 
zero) will still be a stationary value after the stretching.'° A picture (or even just the thought of 
one) is worth a dozen equations, apparently. 

As an example of an equation that does not hold for all coordinates, consider the preceding 
example, but with f’ = 1 instead of f” = 0. In terms of z, f’ = 1 when z = 1/2. And in terms 
of y, f! = 1 when y = (1/8)!/7. But the points z = 1/2 and y = (1/8)!/7 are not the same 
point. In other words, f’ = 1 is not a coordinate-independent statement. Most equations are 
coordinate dependent. The special thing about f’ = 0 is that a stationary point is a stationary 
point no matter how you look at it. & 


6.5 Conservation laws 
6.5.1 Cyclic coordinates 


Consider the case where the Lagrangian does not depend on a certain coordinate 
qk. Then 


ee _, Lo oe 
dt \ 09x ðqk ddk Í l 


10 There is, however, one exception. A stationary point in one coordinate system might be located at 
a kink in another coordinate system, so that f’ is not defined there. For example, if we had instead 
defined y by z = yl, then f y) = yl/2, which has an undefined slope at y = 0. Basically, we’ve 
stretched (or shrunk) the horizontal axis by a factor of infinity at the origin, and this is a process 
that can change a zero slope into an undefined one. But let’s not worry about this. 
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where C is a constant, that is, independent of time. In this case, we say that qg is a 
cyclic coordinate, and that 0L./dq, is a conserved quantity (meaning that it doesn’t 
change with time). If Cartesian coordinates are used, then 0L/dx, is simply the 
momentum, mx;, because x; appears only in the the kinetic energy’s mx; /2 term 
(we exclude cases where the potential depends on x). We therefore call 0L/dqg% 
the generalized momentum corresponding to the coordinate qg. And in cases 
where 0L/0q, does not change with time, we call it a conserved momentum. 
Note that a generalized momentum need not have the units of linear momentum, 
as the angular-momentum examples below show. 


Example (Linear momentum): Consider a ball thrown through the air. In the 
full three dimensions, the Lagrangian is 


1 
L= zmo? +57 +22) — mg. (6.49) 


There is no x or y dependence here, so both dL/dx = mx and dL/dy = my are 
constant, as we well know. The fancy way of saying this is that conservation of 
Px = mx arises from spatial translation invariance in the x direction. The fact that the 
Lagrangian doesn’t depend on x means that it doesn’t matter if you throw the ball in 
one spot, or in another spot a mile down the road. The setup is independent of the x 
value. This independence leads to conservation of py. Likewise for py. 


Example (Angular and linear momentum in cylindrical coordinates): 
Consider a potential that depends only on the distance to the z axis. In cylindrical 
coordinates, the Lagrangian is 


L= sm? +776 +37) — V(r). (6.50) 
There is no z dependence here, so dL/dzZ = mz is constant. Also, there is no 0 
dependence, so 8L/96 = mr?Å is constant. Since rÅ is the velocity in the tangential 
direction around the z axis, we see that our conserved quantity, mr(r6), is the angular 
momentum (discussed in Chapters 7—9) around the z axis. In the same manner as in 
the preceding example, conservation of angular momentum around the z axis arises 
from rotation invariance around the z axis. 


Example (Angular momentum in spherical coordinates): In spherical coor- 
dinates, consider a potential that depends only onr and 0. Our convention for spherical 
coordinates is that 6 is the angle down from the north pole, and ¢ is the angle around 
the equator. The Lagrangian is 


1 ' . 
L= smi +776? +r sin26 $7) — Vir, 6). (6.51) 


233 


234 


The Lagrangian method 


2 sin? é is constant. Since r sin @ is 


There is no ¢ dependence here, so L/0¢ = mr 
the distance from the z axis, and since r sin 0 ¢ is the speed in the tangential direction 
around the z axis, we see that our conserved quantity, m(rsin@)(r sin @ @), is the 


angular momentum around the z axis. 


6.5.2 Energy conservation 


We will now derive another conservation law, namely conservation of energy. 
The conservation of momentum or angular momentum above arose when the 
Lagrangian was independent of x, y, z, 0, or @. Conservation of energy arises 
when the Lagrangian is independent of time. This conservation law is different 
from those in the above momenta examples, because ¢ is not a coordinate that 
the stationary-action principle can be applied to. You can imagine varying the 
coordinates x, 0, etc., which are functions of t. But it makes no sense to vary f. 
Therefore, we’re going to have to prove this conservation law in a different way. 


Consider the quantity 
OL , 
E= L. (6.52) 
(SHa) 


E turns out (usually) to be the energy. We’ll show this below. The motivation 
for this expression for E comes from the theory of Legendre transforms, but we 
won’t get into that here. We’ll just accept the definition in Eq. (6.52) and prove 
a very useful fact about it. 


Claim 6.3 If L has no explicit time dependence (that is, if AL/dt = 0), then 
E is conserved (that is, dE/dt = 0), assuming that the motion obeys the E-L 
equations (which it does). 


Note that there is one partial derivative and one total derivative in this statement. 


Proof: L is a function of the q;, the g;, and possibly t. Making copious use of 
the chain rule, we have 


dE d OL , dL 
dt “(> agi a) dt 


-F dal), | db, eee +g) + 
en dt ðqi 1 agi agi! agi! at} 


= i=l 
(6.53) 


There are five terms here. The second cancels with the fourth. And the first 
(after using the E-L equation, Eq. (6.3), to rewrite it) cancels with the third. We 
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therefore arrive at the simple result, 


i ty (6.54) 


In the event that 0L/dt = 0 (that is, there are no f¢’s sitting on the paper 
when you write down L), which is usually the case in the situations we con- 
sider (because we generally won’t deal with potentials that depend on time), we 
have dE /dt = 0. a 


Not too many things are constant with respect to time, and the quantity £ has 
units of energy, so it’s a good bet that it’s the energy. Let’s show this in Cartesian 
coordinates (however, see the remark below). The Lagrangian is 


af <> ra 32) 
L= ym +y +2°)—-V(x, y, z), (6.55) 
so Eq. (6.52) gives 
= 1 E E. 
E = ams +y°+2°)+ VQ, y, z), (6.56) 


which is the total energy. The effect of the operations in Eq. (6.52) in most cases 
is just to switch the sign in front of the potential. 

Of course, taking the kinetic energy T and subtracting the potential energy V 
to obtain L, and then using Eq. (6.52) to produce E = T + V, seems like a rather 
convoluted way of arriving at 7 + V. But the point of all this is that we used the 
E-L equations to prove that E is conserved. Although we know very well from 
the F = ma methods in Chapter 5 that the sum T + V is conserved, it’s not fair to 
assume that it is conserved in our new Lagrangian formalism. We have to show 
that this follows from the E-L equations. 

As with the translation and rotation invariance we observed in the examples 
in Section 6.5.1, we see that energy conservation arises from time translation 
invariance. If the Lagrangian has no explicit t dependence, then the setup looks 
the same today as it did yesterday. This fact leads to conservation of energy. 


REMARK: The quantity Æ in Eq. (6.52) gives the energy of the system only if the entire system 
is represented by the Lagrangian. That is, the Lagrangian must represent a closed system with 
no external forces. If the system is not closed, then Claim 6.3 (or more generally, Eq. (6.54)) is 
still perfectly valid for the £ defined in Eq. (6.52), but this £ may simply not be the energy of 
the system. Problem 6.8 is a good example of such a situation. 

Another example is the following. Imagine a long rod in the horizontal x-y plane. The rod 
points in the x direction, and a bead is free to slide frictionlessly along it. At t = 0, an external 
machine is arranged to accelerate the rod in the negative y direction (that is, transverse to itself) 
with acceleration —g. So y = —gt. There is no internal potential energy in this system, so the 
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Lagrangian is just the kinetic energy, L = mx” /2 + m(gt)*/2. Equation (6.52) therefore gives 
E = mx? /2 — m(gt)*/2, which isn’t the energy. But Eq. (6.54) is still true, because 


dE ðL ise 24 2t «=> ž=0 (6.57) 
ae mxx — mg“ t = —mg aaa ` 


which is correct. However, this setup is exactly the same as projectile motion in the x-y plane, 
where y is now the vertical axis, provided that we eliminate the rod and consider gravity 
instead of the machine to be causing the acceleration in the y direction. But if we are thinking in 
terms of gravity, then the normal thing to do is to say that the particle moves under the influence 
of the potential V(y) = mgy. The Lagrangian for this closed system (bead plus earth) is 
L = m(x? +37)/2 — mgy, and so Eq. (6.52) gives E = m(x? + 37) /2 + mgy, which is indeed 
the energy of the particle. But having said all this, most of the systems we’ll deal with are 
closed, so you can usually ignore this remark and assume that the £ in Eq. (6.52) gives the 
energy. & 


6.6 Noether’s theorem 


We now present one of the most beautiful and useful theorems in physics. It deals 
with two fundamental concepts, namely symmetries and conserved quantities. 
The theorem (due to Emmy Noether) may be stated as follows. 


Theorem 6.4 (Noether’s theorem) For each symmetry of the Lagrangian, 
there is a conserved quantity. 


By “symmetry,” we mean that if the coordinates are changed by some small 
quantities, then the Lagrangian has no first-order change in these quantities. By 
“conserved quantity,” we mean a quantity that does not change with time. The 
result in Section 6.5.1 for cyclic coordinates is a special case of this theorem. 


Proof: Let the Lagrangian be invariant, to first order in the small number €, 
under the change of coordinates, 


qi — qi + €Ki(q). (6.58) 


Each K;(q) may be a function of all the q;, which we collectively denote by the 
shorthand, q. 


REMARK: As an example of what these K;’s might look like, consider the Lagrangian, L = 
(m/2)(5x* — 2x9 + 297) + C(2x — y). We’ve just pulled this out of a hat, although it happens to 
be the type of L that arises in Atwood’s machine problems; see Problem 6.9 and Exercise 6.40. 
This Ł is invariant under the transformation x > x + € and y > y + 2e, because the derivative 
terms are unaffected, and the difference 2x — y is unchanged. (It’s actually invariant to all 
orders in €, and not just first order. But this isn’t necessary for the theorem to hold.) Therefore, 
K, = 1 and K, = 2, which happen to be independent of the coordinates. In the problems we’ ll 
be doing, the K;’s can generally be determined by simply looking at the potential term. 

Of course, someone else might come along with K, = 3 and K, = 6, which is also a 
symmetry. And indeed, any factor can be taken out of € and put into the K;’s without changing 
the quantity €K;(q) in Eq. (6.58). Any such modification will just bring an overall constant 
factor (and hence not change the property of being conserved) into the conserved quantity in 
Eq. (6.61) below. It is therefore irrelevant. & 
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The fact that the Lagrangian does not change at first order in € means that 
dL OL oq; ƏL OG; 
ae (2 A) 
de f dqi 0€ dg; OE 
aL OL . 
= (Zx + ki) ; (6.59) 
i 


Using the E-L equation, Eq. (6.3), we can rewrite this as 


d [ƏL ƏL. 
0= K; K; 
>» (5(G5) eer ‘ 


i 


d aL 
=F (x ag . (6.60) 


i 


Therefore, the quantity 


; aL 
Pa d=) PRAC (6.61) 
. [A 
t 
does not change with time. It is given the generic name of conserved momentum. 
But it need not have the units of linear momentum. E 


As Noether most keenly observed 

(And for which much acclaim is deserved), 
It’s easy to see 

That for each symmetry, 

A quantity must be conserved. 


Example 1: Consider the Lagrangian in the above remark, L = (m/2) (5x? — 2xy+ 
2%?) + C(2x — y). We saw that Ky = 1 and Ky = 2. The conserved momentum is 
therefore 
_ aL aL oe ; : 
PO Y.x,y) = ane ae ay = m(5x — y)(1) + m(—x + 2y)(2) 
= m(3x + 3%). (6.62) 


The overall factor of 3m isn’t important. 


Example 2: Consider a thrown ball. We have L = (m/2) (x2 +32 +27) —mgz. This 
is invariant under translations in x, that is, x —> x + €; and also under translations in 
y, that is, y > y + €. (Both x and y are cyclic coordinates.) We need invariance only 
to first order in e for Noether’s theorem to hold, but this Z is invariant to all orders. 


237 


238 


The Lagrangian method 


We therefore have two symmetries in our Lagrangian. The first has Ky = 1, Ky = 0, 
and Kz = 0. The second has Ky = 0, Ky = 1, and Kz = 0. Of course, the nonzero 
K;’s here can be chosen to be any constants, but we may as well pick them to be 1. 
The two conserved momenta are 


Pi( ss, 8 fe ee E 3 
1%, Vs Z, X, Y, Z) = aA ay” an 


mx, 


(6.63) 

>.. OL, ƏL ab l 
Pox, y, Z, x, Y, Z) = -Kx + ~ Ky + = Kz = my. 
ox oy az 


These are simply the x and y components of the linear momentum, as we saw in 
the first example in Section 6.5.1. Note that any combination of these momenta, say 
3P + 8P, is also conserved. (In other words, x > x + 3€, y > y+ 8€,z —> zZ is 
a symmetry of the Lagrangian.) But the above P; and P2 are the simplest momenta 
to choose as a “basis” for the infinite number of conserved momenta (which is how 
many you have, if there are two or more independent continuous symmetries). 


Example 3: Consider a mass on a spring, with relaxed length zero, in the x-y plane. 
The Lagrangian, L = (m/ 2)(X2 + yr) — (k/2) (x2 + y»), is invariant under the change 
of coordinates, x —> x + ey and y > y — ex, to first order in € (as you can check). 


So we have Ky = y and Ky = —x. The conserved momentum is therefore 
La ðL OL : ; 
P(X, y, x, ý) = = Kx + Ky = my — yx). (6.64) 
ox ay 


This is the (negative of the) z component of the angular momentum. The angular 
momentum is conserved here because the potential, V (x, y) « x2 + y? = r, depends 
only on the distance from the origin. We’ll discuss such potentials in Chapter 7. 

In contrast with the first two examples above, the x > x+e€y, y > y—ex transfor- 
mation isn’t so obvious here. How did we get this? Well, unfortunately there doesn’t 
seem to be any fail-safe method of determining the K;’s in general, so sometimes you 
just have to guess around. But in many problems, the K;’s are simple constants which 
are easy to see. 


REMARKS: 


1. 


2. 


As we saw above, in some cases the K;’s are functions of the coordinates, and in some 
cases they are not. 

The cyclic-coordinate result in Eq. (6.48) is a special case of Noether’s theorem, for the 
following reason. If L doesn’t depend on a certain coordinate qg, then qk — qk + € is 
certainly a symmetry. Hence Kg = 1 (with all the other K;’s equal to zero), and Eq. (6.60) 
reduces to Eq. (6.48). 


. We use the word “symmetry” to describe the situation where the transformation in 


Eq. (6.58) produces no first-order change in the Lagrangian. This is an appropriate choice 
of word, because the Lagrangian describes the system, and 


For example, if we have a setup that doesn’t depend on @, then we say that the 
setup is symmetric under rotations. Rotate the system however you want, and it looks 
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the same. The two most common applications of Noether’s theorem are the conservation 
of angular momentum, which arises from symmetry under rotations; and conservation of 
linear momentum, which arises from symmetry under translations. 

4. Insimple systems, as in Example 2 above, it is clear why the resulting P is conserved. But 
in more complicated systems, as in Example 1 above, the resulting P might not have an 
obvious interpretation. But at least you know that it is conserved, and this will invariably 
help in understanding a setup. 

5. Although conserved quantities are extremely useful in studying a physical situation, it 
should be stressed that there is no more information contained in them than there is in 
the E-L equations. Conserved quantities are simply the result of integrating the E-L 
equations. For example, if you write down the E-L equations for Example | above, and 
then add the “x” equation (which is 5mx — my = 2C) to twice the “y” equation (which 
is —mx + 2my = —C), then you arrive at 3m(x + ý) = 0. In other words, 3m(x + y) is 
constant, as we found from Noether’s theorem. 

Of course, you might have to do some guesswork to find the proper combination of 
the E-L equations that gives a zero on the right-hand side. But you’d have to do some 
guesswork anyway, to find the symmetry for Noether’s theorem. At any rate, a conserved 
quantity is useful because it is an integrated form of the E-L equations. It puts you one 
step closer to solving the problem, compared with where you would be if you started with 
the second-order E—L equations. 

6. Does every system have a conserved momentum? Certainly not. The one-dimensional 
problem of a falling ball (mZ = —mg) doesn’t have one. And if you write down an 
arbitrary potential in 3-D, odds are that there won’t be one. In a sense, things have to 
contrive nicely for there to be a conserved momentum. In some problems, you can just 
look at the physical system and see what the symmetry is, but in others (for example, in 
the Atwood’s-machine problems for this chapter), the symmetry is not at all obvious. 

7. By “conserved quantity,” we mean a quantity that depends on (at most) the coordinates 
and their first derivatives (that is, not on their second derivatives). If we don’t make 
this restriction, then it is trivial to construct quantities that are independent of time. For 
example, in Example | above, the “x” E-L equation (which is 5mx — my = 2C) tells 
us that 5mx — my has its time derivative equal to zero. Note that an equivalent way of 
excluding these trivial cases is to say that the value of a conserved quantity depends on 
the initial conditions (that is, the velocities and positions). The quantity 5m% — my doesn’t 
satisfy this criterion, because its value is always constrained to be 2C.  & 
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In many physical systems, a particle undergoes small oscillations around an 
equilibrium point. In Section 5.2, we showed that the frequency of these small 


oscillations is 
y" 
w= 4j K" o) ; (6.65) 
m 


where V (x) is the potential energy, and xo is the equilibrium point. However, this 
result holds only for one-dimensional motion (we’ll see below why this is true). 
In more complicated systems, such as the one described below, it is necessary 
to use another procedure to obtain the frequency w. This procedure is a fail-safe 
one, applicable in all situations. It is, however, a bit more involved than simply 
writing down Eq. (6.65). So in one-dimensional problems, Eq. (6.65) is still what 
you want to use. We’ll demonstrate our fail-safe method through the following 
problem. 


239 


240 The Lagrangian method 


Problem: A mass m is free to slide on a frictionless table and is connected, via 
a string that passes through a hole in the table, to a mass M that hangs below (see 


Fig. 6.5). Assume that M moves in a vertical line only, and assume that the string 


always remains taut. 


Fig. 6.5 


(a) Find the equations of motion for the variables r and 0 shown in the figure. 

(b) Under what condition does m undergo circular motion? 

(c) What is the frequency of small oscillations (in the variable r) about this circular 
motion? 


Solution: 


(a) Let the string have length £ (this length won’t matter). Then the Lagrangian 
(we’ll call it “£” here, to save “L” for the angular momentum, which arises 
below) is 


1 1 : 
L= Me + zm? +702) + Mg- r). (6.66) 


For the purposes of the potential energy, we’ve taken the table to be at height 
zero, but any other value could be chosen. The E-L equations of motion 
obtained from varying 0 and r are 


d as 
— (mr^ 0) = 0, 
dt (6.67) 
(M +m)? = mr6? — Mg. 


The first equation says that angular momentum is conserved. The second 
equation says that the Mg gravitational force accounts for the acceleration of 
the two masses along the direction ofthe string, plus the centripetal acceleration 
ofm. 

(b) The first of Eqs. (6.67) says that mr26 = L, where L is some constant (the 
angular momentum) which depends on the initial conditions. Plugging 6 = 
L/mr? into the second of Eqs. (6.67) gives 


2 


L 
(M + m)? = — — Mg. (6.68) 


Circular motion occurs when 7 = 7 = 0. Therefore, the radius of the circular 
orbit is given by 

[2 
Mmg ` 


n= (6.69) 
Since L = mr6, Eq. (6.69) is equivalent to 


mroĝ? = Mg, (6.70) 
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which can also be obtained by letting 7 = 0 in the second of Eqs. (6.67). In 
other words, the gravitational force on M exactly accounts for the centripetal 
acceleration of m if the motion is circular. Given rg, Eq. (6.70) determines 


what 6 must be in order to have circular motion, and vice versa. 


(c 


wm 


To find the frequency of small oscillations about the circular motion, we need 
to look at what happens to r if we perturb it slightly from its equilibrium value 
of rọ. Our fail-safe procedure is the following. 

Let r(t) = ro + ô(t), where 5(f) is very small (more precisely, 5(t) < ro), 
and expand Eq. (6.68) to first order in 5(t). Using 


TO: a cee l S ‘(i D ai 
re (ro +8)3 fe + 3795 E rad + 3ô8/ro) 3 ro)” , 


"0 
we obtain 
. pP 35 
(M+m)5 x — |1- — Mg. (6.72) 
mro TO 


The terms not involving ô on the right-hand side cancel, by the definition of ro 
given in Eq. (6.69). This cancellation always occurs in such a problem at this 
stage, due to the definition of the equilibrium point. We are therefore left with 


312 
5+ | —— ] x0. (6.73) 
(M + m)mro 


This is a good old simple-harmonic-oscillator equation in the variable ô. 
Therefore, the frequency of small oscillations about a circle of radius rọ is 


3L2 3M Tg 
PEN 7= , (6.74) 
(M + m)mro M +m\ ro 


where we have used Eq. (6.69) to eliminate Z in the second expression. 


To sum up, the above frequency is the frequency of small oscillations in 
the variable r. In other words, if you have nearly circular motion, and if you 
plot r as a function of time (and ignore what @ is doing), then you will get a 
nice sinusoidal graph whose frequency is given by Eq. (6.74). Note that this 
frequency need not have anything to do with the other relevant frequency in this 
problem, namely the frequency of the circular motion, which is ./M/m./g/ro, 
from Eq. (6.70). 


Remarks: Let’s look at some limits. Fora given rọ, ifm >> M, thenw ~ ./3Mg/mro ~ 0. 
This makes sense, because everything is moving very slowly. This frequency equals /3 
times the frequency of circular motion, namely ./Mg/mro, which isn’t at all obvious. For 
a given ro, ifm < M, then w © ./3g/70, which isn’t so obvious, either. 

The frequency of small oscillations is equal to the frequency of circular motion if 
M = 2m, which, once again, isn’t obvious. This condition is independent of rọ. & 
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The above procedure for finding the frequency of small oscillations can be 
summed up in three steps: (1) find the equations of motion; (2) find the equilib- 
rium point; and (3) let x(t) = xo + ô (t), where xo is the equilibrium point of the 
relevant variable, and expand one of the equations of motion (or a combination 
of them) to first order in ô, to obtain a simple-harmonic-oscillator equation for ô. 
If the equilibrium point happens to be at x = 0 (which is often the case), then 
everything is greatly simplified. There is no need to introduce ô, and the expan- 
sion in the third step above simply entails ignoring powers of x that are higher 
than first order. 


REMARK: Ifyou just use the potential energy for the above problem (which is Mgr, up to an 
additive constant) in Eq. (6.65), then you will obtain a frequency of zero, which is incorrect. 
You can use Eq. (6.65) to find the frequency, if you instead use the “effective potential” for 
this problem, namely E j Q2mr?) + Mgr, and if you use the total mass, M + m, as the mass 
in Eq. (6.65), as you can check. The reason why this works will become clear in Chapter 7 
when we introduce the effective potential. In many problems, however, it isn’t obvious what 
the appropriate modified potential is that should be used, or what mass goes in Eq. (6.65). So 
it’s generally much safer to take a deep breath and go through an expansion similar to the one 
in part (c) of the example above. & 


The one-dimensional result in Eq. (6.65) is, of course, a special case of our 
above expansion procedure. We can repeat the derivation of Section 5.2 in the 
present language. In one dimension, the E-L equation of motion is mx = — V” (x). 
Let xo be the equilibrium point, so V’(xo) = 0. And let x(t) = xo + (£). 
Expanding mx = —V' (x) to first order in 5, we have mé = — V’ (xo) — V” (xo)8, 
plus higher-order terms. Since V’(xo) = 0, we have mô ~ —V"(xo)d, as 
desired. 


6.8 Other applications 


The formalism developed in Section 6.2 works for any function L(x, x, t). If our 
goal is to find the stationary points of S = f L, then Eq. (6.15) holds, no matter 
what L is. There is no need for L to be equal to T — V, or indeed, to have anything 
to do with physics. And ¢ need not have anything to do with time. All that is 
required is that the quantity x depend on the parameter ¢, and that L depend only 
on x, x, and ¢ (and not, for example, on x; see Exercise 6.34). The formalism is 
very general and powerful, as the following example demonstrates. 


Example (Minimal surface of revolution): A surface of revolution has two 
given parallel rings as its boundary; see Fig. 6.6. What should the shape of the surface 
be so that it has the minimum possible area? We will present three solutions. A fourth 
is left for Problem 6.22. 


First solution: Let the surface be generated by rotating the curve y = y(x) around 
the x axis. The boundary conditions are y(a,) = cı and y(az) = c2; see Fig. 6.7. 
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Slicing the surface up into vertical rings, we see that the area is given by 


a 
A= / Qny 1 +y? dx. (6.75) 
al 


Our goal is to find the function y(x) that minimizes this integral. We therefore have 
exactly the same situation as in Section 6.2, except that x is now the parameter (instead 
oft), andy is now the function (instead of x). Our “Lagrangian” is thus L œ yy 1 + y’. 
To minimize the integral A, we “simply” have to write down the E-L equation, 


d (aL ðL 
—(—)=—, (6.76) 
dx \ ay’ ay 


and calculate the derivatives. This calculation however, gets a bit tedious, so I’ve 
relegated it to Lemma 6.5 at the end of this section. For now we’ll just use the result 
in Eq. (6.86) which gives (with f (y) = y here) 


1+y? = By’. (6.77) 


At this point we can cleverly guess (motivated by the fact that 1 + sinh?z = cosh”z) 
that the solution is 


1 
y(x) = z cosh b(x + d), (6.78) 
where b = VB, and d is a constant of integration. Or we can separate variables to 
obtain (again with b = V/B) 
dy 


Vo= 


and then use the fact that the integral of 1/./z2 — 1 is cosh! z, to obtain the same 
result. The answer to our problem, therefore, is that y(x) takes the form of Eq. (6.78), 


dx = (6.79) 


with b and d determined by the boundary conditions, 
1 1 
q= 7 cosh b(a] +d), and c = 5 cosh b(a? + d). (6.80) 


In the symmetrical case where c} = c2, we know that the minimum occurs in the 
middle, so we may choose d = 0 and a] = —ap. 

Solutions for b and d exist only for certain ranges of the a’s and c’s. Basically, if 
az — ay is too large, then there is no solution. In this case, the minimal “surface” turns 
out to be the two given circles, attached by a line (which isn’t a nice two-dimensional 
surface). If you perform an experiment with soap bubbles (which want to minimize 
their area), and if you pull the rings too far apart, then the surface will break and 
disappear as it tries to form the two circles. Problem 6.23 deals with this issue. 


Second solution: Consider the curve that we rotate around the x axis to be 
described now by the function x(y). That is, let x be a function of y. The area is 
then given by 


c2 
a=] 2ryv 1 +x’? dy, (6.81) 
C. 


1 
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where x’ = dx/dy. Note that the function x) may be double-valued, so it may not 
really be a function. But it looks like a function locally, and all of our formalism deals 
with local variations. 

Our “Lagrangian” is now L œ yy 1 + x, and the E-L equation is 


d (aL ðL d yx’ 
LIAE ees ee, 6.82 
(=) ox al =) (6.32) 


The nice thing about this solution is the “0” on the right-hand side, which arises 
from the fact that L doesn’t depend on x (that is, x is a cyclic coordinate). Therefore, 
yx! //1 + x’? is constant. If we define this constant to be 1/b, then we can solve for 
x’ and then separate variables to obtain 


dy 


Vey =1" 


in agreement with Eq. (6.79). The solution proceeds as above. 


dx = (6.83) 


Third solution: The “Lagrangian” in the first solution above, L œ yJ1 + y?, is 
independent of x. Therefore, in analogy with conservation of energy (which arises 
from a Lagrangian that is independent of t), the quantity 


OL yy -y 
E=y ban yy 1 +y? = —— (6.84) 
ay’ /1 +y? /1 +y? 


is constant (that is, independent of x). This statement is equivalent to Eq. (6.77), and 
the solution proceeds as above. As demonstrated by the brevity of the second and 
third solutions here, it is highly advantageous to make use of conserved quantities 
whenever you can. 


Let us now prove the following lemma, which we invoked in the first solution 
above. This lemma is very useful, because it is common to encounter problems 
where the quantity to be extremized depends on the arclength, y 1 +, and takes 
the form of f f(y). 1 + y? dx. We will give two proofs. The first proof uses the 
Euler-Lagrange equation. The calculation gets a bit messy, so it’s a good idea to 
work through it once and for all and then just invoke the result whenever needed. 
This derivation isn’t something you’d want to repeat too often. The second proof 
makes use of a conserved quantity. And in contrast with the first proof, this 
method is exceedingly clean and simple. It actually is something you’d want to 
repeat quite often. But we’ll still do it once and for all. 


Lemma 6.5 Let f(y) be a given function of y. Then the function y(x) that 


extremizes the integral, 
x2 
JOW 1+ y? dx, (6.85) 
Ka 
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satisfies the differential equation, 


1 +y? = By)’, (6.86) 


where B is a constant of integration. |! 


First proof: Our goal is to find the function y(x) that extremizes the integral 
in Eq. (6.85). We therefore have exactly the same situation as in Section 6.2, 
except with x in place of ¢, and y in place of x. Our “Lagrangian” is thus L = 
fO)V1+2, and the Euler-Lagrange equation is 


d (aL aL d , 1 ; 
bal E Ween ffy — |=, (6387 
TA ( z) ay = oe ( T ra >) f'vity (6.87) 


where f’ =df /dy. We must now perform some straightforward (albeit tedious) 
differentiations. Using the product rule on the three factors on the left-hand side, 
and making copious use of the chain rule, we obtain 


f'y? p" pry E 
aft +y? V1 +y? a + y2)3/2 


fifty? (6.88) 


Multiplying through by (1 + y?)/* and simplifying gives 


p= +y. (6.89) 


We have completed the first step of the proof, namely producing the Euler- 
Lagrange differential equation. We must now integrate it. Equation (6.89) 
happens to be integrable for arbitrary functions f (y). If we multiply through 
by y’ and rearrange, we obtain 


yy" B fy 


1472 f i 


Taking the dx integral of both sides gives (1/2) In(1 + y?) = In(f) + C, where 
C is a constant of integration. Exponentiation then gives (with B = e*©) 


1 +y? = Boy, (6.91) 


as we wanted to show. In an actual problem, we would solve this equation for y’, 
and then separate variables and integrate. But we would need to be given a specific 
function f (y) to be able to do this. 


1L The constant B and also one other constant of integration (arising when Eq. (6.86) is integrated to 
obtain y) are determined by the boundary conditions on y(x); see, for example, Eq. (6.80). This 
situation, where the two constants are determined by the values of the function at two points, is 
slightly different from the situation in the physics problems we’ve done where the two constants 
are determined by the value (that is, the initial position) and the slope (that is, the speed) at just 
one point. But either way, two given facts must be used. 
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Second proof: Our “Lagrangian,” L = f @)y 1+ y”, is independent of x. 
Therefore, in analogy with the conserved energy given in Eq. (6.52), the quantity 


par Te =f 0) 


ay’ — fl+y? 


is independent of x. Call it 1/./B. Then we have easily reproduced Eq. (6.91). 
For practice, you can also prove this lemma by considering x to be a function of 
y, as we did in the second solution in the minimal-surface example above. E 


<_ a 6.9 Problems 


Section 6.1: The Euler-Lagrange equations 


(6.92) 


6.1. Moving plane «« 


Fig. 6.8 
: i A block of mass m is held motionless on a frictionless plane of mass M 
y £ and angle of inclination 8 (see Fig. 6.8). The plane rests on a frictionless 
mN horizontal surface. The block is released. What is the horizontal accel- 
; eration of the plane? (This problem already showed up as Problem 3.8. 
r If you haven’t already done so, try solving it using F = ma. You will 
m then have a greater appreciation for the Lagrangian method.) 
j 


6.2. Two falling sticks +x 


Two massless sticks of length 2r, each with a mass m fixed at its middle, 
are hinged at an end. One stands on top of the other, as shown in Fig. 6.9. 
The bottom end of the lower stick is hinged at the ground. They are held 

E ‘such that the lower stick is vertical, and the upper one is tilted at a 
small angle e with respect to the vertical. They are then released. At this 
instant, what are the angular accelerations of the two sticks? Work in 
the approximation where € is very small. 


Fig. 6.9 


l 6.3. Pendulum with an oscillating support +» 


A pendulum consists of a mass m and a massless stick of length £. 


m 


see Fig. 6.10. What is the general solution for the 
angle of the pendulum as a function of time? 


Fig. 6.10 


6.4. Two masses, one swinging x«« 


Two equal masses m, connected by a massless string, hang over two 
pulleys (of negligible size), as shown in Fig. 6.11. The left one moves 
in a vertical line, but the right one is free to swing back and forth in the 
plane of the masses and pulleys. Find the equations of motion for r and 
6, as shown. 

Assume that the left mass starts at rest, and the right mass undergoes 
Fig. 6.11 small oscillations with angular amplitude e (with € < 1). What is the 
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initial average acceleration (averaged over a few periods) of the left 
mass? In which direction does it move? m 


6.5. kikk 


A pendulum consists of a mass m at the end of a massless stick of length 
£. The other end ofthe stick is made to oscillate vertically with a position 
given by y(t) = Acos(wt), where A < £. See Fig. 6.12. It turns out | 
that if w is large enough, and if the pendulum is initially nearly upside- 
down, then surprisingly it will not fall over as time goes by. Instead, it Fig. 6.12 
will (sort of) oscillate back and forth around the vertical position. If you 
want to do the experiment yourself, see the 28th demonstration of the 
entertaining collection in Ehrlich (1994). 
Find the equation of motion for the angle of the pendulum (measured 
relative to its upside-down position). Explain why the pendulum doesn’t 
fall over, and find the frequency of the back and forth motion. 


Section 6.2: The principle of stationary action 


6.6. Minimum or saddle «« 


(a) In Eq. (6.26), let fi = 0 and H = T, for convenience. And let 
E(t) be an easy-to-deal-with “triangular” function, of the form 


et/T, 0<1<T/2, 


a=), TJ2<t<T. a 


s=] 


Under what condition is the harmonic-oscillator AS in Eq. (6.26) 
negative? 


(b) Answer the same question, but now with £(t) = e sin(xt/T). 


Section 6.3: Forces of constraint 


6.7. Normal force from a plane ++ 


A mass m slides down a frictionless plane that is inclined at an angle 8. 
Show, using the method in Section 6.3, that the normal force from the 
plane is the familiar mg cos 0. 


Section 6.5: Conservation laws 


6.8. Bead on a stick « 


A stick is pivoted at the origin and is arranged to swing around in a 
horizontal plane at constant angular speed w. A bead of mass m slides 
frictionlessly along the stick. Let r be the radial position of the bead. 
Find the conserved quantity E given in Eq. (6.52). Explain why this 
quantity is not the energy of the bead. 


248 The Lagrangian method 
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Fig. 6.15 
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Fig. 6.16 
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Fig. 6.17 
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Section 6.6: Noether 5 theorem 


Atwood’s machine «« 

Consider the Atwood’s machine shown in Fig. 6.13. The masses are 
4m, 3m, and m. Let x and y be the heights of the left and right masses, 
relative to their initial positions. Find the conserved momentum. 


Section 6.7: Small oscillations 


Q 6.10. 


Hoop and pulley +» 


A mass M is attached to a massless hoop of radius R that lies in a vertical 
plane. The hoop is free to rotate about its fixed center. M is tied to a 
string which winds part way around the hoop, then rises vertically up 
and over a massless pulley. A mass m hangs on the other end of the string 
(see Fig. 6.14). Find the equation of motion for the angle of rotation of 
the hoop. What is the frequency of small oscillations? Assume that m 
moves only vertically, and assume M > m. 


. Bead on a rotating hoop +» 


A bead is free to slide along a frictionless hoop of radius R. The hoop 
rotates with constant angular speed w around a vertical diameter (see 
Fig. 6.15). Find the equation of motion for the angle 0 shown. What are 
the equilibrium positions? What is the frequency of small oscillations 
about the stable equilibrium? There is one value of œ that is rather 
special; what is it, and why is it special? 


Another bead on a rotating hoop ++ 


A bead is free to slide along a frictionless hoop of radius r. The plane of 
the hoop is horizontal, and the center of the hoop travels in a horizontal 
circle of radius R, with constant angular speed w, about a given point 
(see Fig. 6.16). Find the equation of motion for the angle 6 shown. Also, 
find the frequency of small oscillations about the equilibrium point. 


Mass on a wheel xx 


A mass m is fixed to a given point on the rim of a wheel of radius R that 
rolls without slipping on the ground. The wheel is massless, except for 
amass M located at its center. Find the equation of motion for the angle 
through which the wheel rolls. For the case where the wheel undergoes 
small oscillations, find the frequency. 


Pendulum with a free support x» 


A mass M is free to slide along a frictionless rail. A pendulum of length £ 
and mass m hangs from M (see Fig. 6.17). Find the equations of motion. 
For small oscillations, find the normal modes and their frequencies. 


6.15. 


6.16. 


6.17. 


6.18. 


6.19. 
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Pendulum support on an inclined plane x+» 


A mass M is free to slide down a frictionless plane inclined at an 
angle 6. A pendulum of length £ and mass m hangs from M; 
see Fig. 6.18 (assume that M extends a short distance beyond the side 
of the plane, so the pendulum can hang down). Find the equations 
of motion. For small oscillations, find the normal modes and their 
frequencies. 


Tilting plane xx» 


A mass M is fixed at the right-angled vertex where a massless rod of 
length £ is attached to a very long massless rod (see Fig. 6.19). A mass m 
is free to move frictionlessly along the long rod (assume that it can pass 
through M). The rod of length £ is hinged at a support, and the whole 
system is free to rotate, in the plane of the rods, about the hinge. Let 0 
be the angle of rotation of the system, and let x be the distance between 
mand M. Find the equations of motion. Find the normal modes when 0 
and x are both very small. 


Rotating curve xxx 


The curve y(x) = b(x/a)* is rotated around the y axis with constant 
frequency w (see Fig. 6.20). A bead moves frictionlessly along the curve. 
Find the frequency of small oscillations about the equilibrium point. 
Under what conditions do oscillations exist? (This problem gets a little 
messy.) 


Motion in a cone xxx 


A particle slides on the inside surface of a frictionless cone. The cone 
is fixed with its tip on the ground and its axis vertical. The half-angle 
at the tip is a (see Fig. 6.21). Let r be the distance from the particle to 
the axis, and let 0 be the angle around the cone. Find the equations of 
motion. 

If the particle moves in a circle of radius ro, what is the frequency, œ, 
of this motion? If the particle is then perturbed slightly from this circular 
motion, what is the frequency, Q, of the oscillations about the radius ro? 
Under what conditions does Q = w? 


Double pendulum x«x« 


Consider a double pendulum made of two masses, mı and m2, and two 
rods of lengths £; and £2 (see Fig. 6.22). Find the equations of motion. 

For small oscillations, find the normal modes and their frequencies 
for the special case 0; = £2 (and consider the cases mı = m2, mı >> mp, 
and mı < m2). Do the same for the special case mı = mz (and consider 
the cases £; = 42, 41 > €2, and 41 < £2). 


Fig. 6.21 
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Fig. 6.22 
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Section 6.8: Other applications 


Shortest distance in a plane « 


In the spirit of Section 6.8, show that the shortest path between two 
points in a plane is a straight line. 


Index of refraction «+ 


Assume that the speed of light in a given slab of material is proportional 
to the height above the base of the slab.'* Show that light moves in 
circular arcs in this material; see Fig. 6.23. You may assume that light 
takes the path of least time between two points (Fermat’s principle of 
least time). 


Minimal surface xx 


Derive the shape of the minimal surface discussed in Section 6.8, by 
demanding that a cross-sectional “ring” (that is, the region between the 
planes x = x; and x = x2) is in equilibrium; see Fig. 6.24. Hint: The 
tension must be constant throughout the surface (assuming that we’re 
ignoring gravity, which we are). 


Existence of a minimal surface «« 


Consider the minimal surface from Section 6.8, and look at the special 
case where the two rings have the same radius r (see Fig. 6.25). Let 
2£ be the distance between the rings. What is the largest value of £/r 
for which a minimal surface exists? You will need to solve something 
numerically here. 


The brachistochrone xxx 


A bead is released from rest at the origin and slides down a frictionless 
wire that connects the origin to a given point, as shown in Fig. 6.26. 
You wish to shape the wire so that the bead reaches the endpoint in the 
shortest possible time. Let the desired curve be described by the function 
y(x), with downward taken to be positive. Show that y(x) satisfies 


(6.94) 


where B is a constant. Then show that x and y may be written as 


x=a(O—sin@), y= a(l — cos8). (6.95) 


12 Tf you want to make the equivalent statement in terms of the material’s “index of refraction,” 
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commonly denoted by n, then you can say: As a function of the height y, the index n is given by 


ay) = 


yo/y, where yo is some length that is larger than the height of the slab. This is equivalent 


to the original statement because the speed of light in a material equals c/n. 


6.10 


6.10 Exercises 


This is the parametrization of a cycloid, which is the path taken by a 
point on the rim of a rolling wheel. 


Exercises 


Section 6.1: The Euler-Lagrange equations 


6.25. 


6.26. 


6.27. 


6.28. 


Spring ona T xx 

A rigid T consists of a long rod glued perpendicular to another rod of 
length £ that is pivoted at the origin. The T rotates around in a horizontal 
plane with constant frequency w. A mass m is free to slide along the 
long rod and is connected to the intersection of the rods by a spring 
with spring constant k and relaxed length zero (see Fig. 6.27). Find r(t), 
where r is the position of the mass along the long rod. There is a special 
value of w; what is it, and why is it special? 


Spring on a T, with gravity +x» 

Consider the setup in the previous exercise, but now let the T swing 
around in a vertical plane with constant frequency w. Find r(t). There is 
a special value of w; what is it, and why is it special? (You may assume 


æ < k/m.) 


Coffee cup and mass x» 


A coffee cup of mass M is connected to a mass m by a string. The coffee 
cup hangs over a frictionless pulley of negligible size, and the mass m 
is initially held with the string horizontal, as shown in Fig. 6.28. The 
mass m is then released. Find the equations of motion for r (the length 
of string between m and the pulley) and 0 (the angle that the string to 
m makes with the horizontal). Assume that m somehow doesn’t run into 
the string holding the cup up. 

The coffee cup will initially fall, but it turns out that it will reach a 
lowest point and then rise back up. Write a program (see Section 1.4) 
that numerically determines the ratio of the r at this lowest point to the r 
at the start, for a given value of m/M. (To check your program, a value 
of m/M = 1/10 yields a ratio of about 0.208.) 


Three falling sticks +x» 


Three massless sticks of length 2r, each with a mass m fixed at its 
middle, are hinged at their ends, as shown in Fig. 6.29. The bottom 
end of the lower stick is hinged at the ground. They are held such that 
the lower two sticks are vertical, and the upper one is tilted at a small 
angle e with respect to the vertical. They are then released. At this 
instant, what are the angular accelerations of the three sticks? Work 


Fig. 6.27 


Fig. 6.28 
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(top view) 


Fig. 6.29 
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6.29. 


in the approximation where € is very small. (You may want to look at 
Problem 6.2 first.) 


Cycloidal pendulum «*«« 


The standard pendulum frequency of ./g/@ holds only for small oscil- 
lations. The frequency becomes smaller as the amplitude grows. It turns 
out that if you want to build a pendulum whose frequency is independent 
of the amplitude, you should hang it from the cusp of a cycloid of a cer- 
tain size, as shown in Fig. 6.30. As the string wraps partially around the 
cycloid, the effect is to decrease the length of string in the air, which in 
turn increases the frequency back up to a constant value. In more detail: 

A cycloid is the path taken by a point on the rim of a rolling wheel. 
The upside-down cycloid in Fig. 6.30 can be parametrized by (x,y) = 
R(@—sin6, —1+cos@), where 0 = 0 corresponds to the cusp. Consider 
a pendulum of length 4R hanging from the cusp, and let œ be the angle 
the string makes with the vertical, as shown. 


(a) In terms of a, find the value of the parameter 0 associated with 
the point where the string leaves the cycloid. 

(b) In terms of a, find the length of string touching the cycloid. 

(c) In terms of a, find the Lagrangian. 

(d) Show that the quantity sin œ undergoes simple harmonic motion 
with frequency ./g/4R, independent of the amplitude. 

(e) In place of parts (c) and (d), solve the problem again by using 
F = ma. This actually gives a much quicker solution. 


Section 6.2: The principle of stationary action 


6.30. 


6.31. 


6.32. 


Dropped ball x 


Consider the action, from ¢ = 0 to t = 1, of a ball dropped from rest. 
From the E-L equation (or from F = ma), we know that y(t) = —gt? /2 
yields a stationary value of the action. Show explicitly that the particular 
function y(t) = —gt*/2+et(t—1) yields an action that has no first-order 
dependence on €. 


Explicit minimization « 

For a ball thrown upward, guess a solution for y of the form y(t) = 
ant? + aıt + ag. Assuming that y(0) = y(T) = 0, this quickly becomes 
y(t) = an(t? — Tt). Calculate the action between t = 0 and t = T, and 
show that it is minimized when az = —g/2. 

Always a minimum «x 


For a ball thrown up in the air, show that the stationary value of the 
action is always a global minimum. 


6.33. 


6.34. 


6.10 Exercises 


Second-order change x 


Let xg(t) = xo(t) + aB(t). Equation (6.19) gives the first derivative of 
the action with respect to a. Show that the second derivative is 


d? 2 (3L aL. PL. 
—~S[xq(t)] = = y —— 6? | dt. (6.96 
L Sa] / (FP +20 BB+ 5 ) (6.96) 


1 


x dependence x 


Assume that there is x dependence (in addition to x, x, t dependence) in 
the Lagrangian in Theorem 6.1. There will then be the additional term 
(ƏL/ðža)Ë in Eq. (6.19). It is tempting to integrate this term by parts 
twice, and then arrive at a modified form of Eq. (6.22): 


ðL d (aL ree ab) _ (697) 
0x0 dt \ 0x0 dt? 3X0 as i 


Is this a valid result? If not, where is the error in the reasoning? 


Section 6.3: Forces of constraint 


6.35. 


6.36. 


6.37. 


6.38. 


Constraint on a circle x 

A bead of mass m slides with speed v around a horizontal hoop of 
radius R. What force does the hoop apply to the bead? (Ignore gravity.) 
Atwood’s machine x 

Consider the standard Atwood’s machine in Fig. 6.31, with masses mı 
and m2. Find the tension in the string. 

Cartesian coordinates x» 


In Eq. (6.35), take two time derivatives of the yx? +y? — R = 0 
equation to obtain 


Raž + yp) + (xp — yi)? = 0, (6.98) 


and then combine this with the other two equations to solve for F in terms 
of x, y, x, y. Convert the result to polar coordinates (with 0 measured 
from the vertical) and show that it agrees with Eq. (6.32). 


Constraint on a curve «*« 


Let the horizontal plane be the x-y plane. A bead of mass m slides with 
speed v along a curve described by the function y = f(x). What force 
does the curve apply to the bead? (Ignore gravity.) 


Fig. 6.31 
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Section 6.5: Conservation laws 


6.39. 


Bead on stick, using F = ma « 
After doing Problem 6.8, show again that the quantity E£ is conserved, 
but now use F = ma. Do this in two ways: 
(a) Use the first of Eqs. (3.51). Hint: multiply through by 7. 
(b) Use the second of Eqs. (3.51) to calculate the work done on the 
bead, and use the work—energy theorem. 


Section 6.6: Noether s theorem 


Atwood’s machine +» 


Consider the Atwood’s machine shown in Fig. 6.32. The masses are 
4m, 5m, and 3m. Let x and y be the heights of the right two masses, 
relative to their initial positions. Use Noether’s theorem to find the 
conserved momentum. (The solution to Problem 6.9 gives some other 
methods, too.) 


Section 6.7: Small oscillations 


6.40. 
4m 
x y 
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Fig. 6.32 
6.41. 
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6.42. 
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6.43. 
6.44. 


Fig. 6.34 


Spring and a wheel « 


The top of a wheel of mass M and radius R is connected to a spring 
(at its equilibrium length) with spring constant k, as shown in Fig. 6.33. 
Assume that all the mass of the wheel is at its center. If the wheel rolls 
without slipping, what is the frequency of (small) oscillations? 


Spring on a spoke x» 


A spring with spring constant k and relaxed length zero lies along a spoke 
of a massless wheel of radius R. One end of the spring is attached to the 
center, and the other end is attached to a mass m that is free to slide along 
the spoke. When the system is in its equilibrium position with the spring 
hanging vertically, how far (in terms of R) should the mass hang down 
(you are free to adjust x) so that for small oscillations, the frequency of 
the spring oscillations equals the frequency of the rocking motion of the 
wheel? Assume that the wheel rolls without slipping. 


Oscillating hoop «* 


Two equal masses are glued to a massless hoop of radius R that is free to 
rotate about its center in a vertical plane. The angle between the masses 
is 20, as shown in Fig. 6.34. Find the frequency of small oscillations. 


Oscillating hoop with a pendulum ++» 


A massless hoop of radius R is free to rotate about its center in a vertical 
plane. A mass m is attached at one point, and a pendulum of length /2R 
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(and also of mass m) is attached at another point 90° away, as shown in 
Fig. 6.35. Let @ be the angle of the hoop relative to the position shown, 
and let «œ be the angle of the pendulum with respect to the vertical. Find 
the normal modes for small oscillations. 


6.45. Mass sliding on a rim «x 


A mass m is free to slide frictionlessly along the rim of a wheel of radius R 
that rolls without slipping on the ground. The wheel is massless, except 
for a mass M located at its center. Find the normal modes for small 
oscillations. 


6.46. Mass sliding on a rim, with a spring +x» 


Consider the setup in the previous exercise, but now let the mass m be fig. 6.35 
attached to a spring with spring constant k and relaxed length zero, 

the other end of which is attached to a point on the rim. Assume 

that the spring is constrained to run along the rim, and assume that 

the mass can pass freely over the point where the spring is attached 

to the rim. To keep things from getting too messy here, you can 

set M =m. 


(a) Find the frequencies of the normal modes for small oscillations. 
Check the g = 0 limit, and (if you’ ve done the previous exercise) 
the k = 0 limit. 

(b) For the special case where g/R = k/m, show that the frequencies 
can be written as Vk/m (/5 + 1)/2. This numerical factor is the 
golden ratio (and its inverse). Describe what the normal modes 
look like. 


6.47. Vertically rotating hoop +*+» 


A bead is free to slide along a frictionless hoop of radius r. The plane 
of the hoop is vertical, and the center of the hoop travels in a vertical 
circle of radius R with constant angular speed œw about a given point 
(see Fig. 6.36). Find the equation of motion for the angle 6 shown. For 
large w (which implies small 0), find the amplitude of the “particular” 
solution with frequency w. What happens if r = R? 


6.11 Solutions — ` a 


6.1. Moving plane 


(side view) 


Fig. 6.36 


Let x; be the horizontal coordinate of the plane (with positive x; to the left), and let x2 
be the horizontal coordinate of the block (with positive x2 to the right); see Fig. 6.37. 
The relative horizontal distance between the plane and the block is x; + x2, so the 
height fallen by the block is (x; + x2) tan 0. The Lagrangian is therefore 


Fig. 6.37 


l 1 
L= 5Mit4 zr b Gi +i)? tan?) + mg(x1 + x2) tan 0. (6.99) 
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6.2. 


6.3. 


The equations of motion obtained from varying x; and x2 are 


Mx, +m Xz) tan?0 = mg tan 6, 
(6.100) 


mx2 + mx ¥2) tan?0 = mg tan 0. 


Note that the difference of these two equations immediately yields conservation of 
momentum, Mx, — m% = 0 => (d/dt)(Mx, — mx2) = 0. Equations (6.100) are two 
linear equations in the two unknowns, ¥ı and X2, so we can solve for X;. After a little 
simplification, we arrive at 

mg sin 0 cos @ 


x) = —. (6.101) 
i M + msin?0 


For some limiting cases, see the remarks in the solution to Problem 3.8. 


Two falling sticks 

Let 81 (t) and 62 (t) be defined as in Fig. 6.38. Then the position of the bottom mass 
in Cartesian coordinates is (r sin 01,r cos 01), and the position of the top mass is 
(2r sin 01 — r sin 62, 2r cos 01 + r cos 02). So the potential energy of the system is 


V (01,02) = mgr (3 cos 01 + cos 62). (6.102) 


The kinetic energy is somewhat more complicated. The kinetic energy of the bottom 
mass is simply mr6? /2. Taking the derivative of the top mass’s position given above, 
we find that the kinetic energy of the top mass is 


1 i : , : 
zm? (e cos 0161 — cos 262)? + (—2 sin 616, — sin 6262)°). (6.103) 


We can simplify this, using the small-angle approximations. The terms involving sin 0 
are fourth order in the small 6’s, so we can neglect them. Also, we can approximate 
cos @ by 1, because this entails dropping only terms of at least fourth order. So the 
top mass’s kinetic energy becomes (1 /2)mr? (26 = 62). In retrospect, it would have 
been easier to obtain the kinetic energies of the masses by first applying the small-angle 
approximations to the positions, and then taking the derivatives to obtain the velocities. 
This strategy shows that both masses move essentially horizontally (initially). You will 
probably want to use this strategy when solving Exercise 6.28. 

Using the small-angle approximation cos@ œ% 1 — 67/2 to rewrite the potential 
energy in Eq. (6.102), we have 


l Poe are Ja 4 
La 5mr(sé? 46162 + 62) mer (4 55% 302). (6.104) 


The equations of motion obtained from varying 6; and 62 are, respectively, 


ok 
56, — 2 = 4, 
i (6.105) 


—26; +65 = Eo, 
r 


At the instant the sticks are released, we have 6, = 0 and 6) = €. Solving Eq. (6.105) 
for 0; and 62 gives 
3 € ge Sge 

d= —, and h=. (6.106) 

r 

Pendulum with an oscillating support 
Let 6 be defined as in Fig. 6.39. With x(t) = A cos(wf), the position of the mass m is 
given by 


(X,Y)m = x + £sin 0, —£ cos 0). (6.107) 


6.4. 


6.11 Solutions 


Taking the derivative to obtain the velocity, we find that the square of the speed is 


V2 = X? 4 97? = 767 +57 + 256 cos, (6.108) 


which also follows from applying the law of cosines to the horizontal x and tangential 
£6 parts of the velocity vector. The Lagrangian is therefore 


1 ; : 
L= zmei +i? + 2&0 cos 0) + mgt cos. (6.109) 
The equation of motion for 6 is 


d ; ; 
7 (ml? + mex cos 6) = —mlX6 sin 0 — mgt sind 


LÖ +%cos@ = —g sing. (6.110) 
Plugging in the explicit form of x(t), we have 
LÖ — Aw cos(wt) cos 6 + g sind = 0. (6.111) 


In retrospect, this makes sense. Someone in the reference frame of the support, which 
has horizontal acceleration ¥ = —Aw* cos(wt), may as well be living in a world 
where the acceleration from gravity has a component g downward and a component 
Aw? cos(wt) to the right. Equation (6.111) is just the F = ma equation in the tangential 
direction in this accelerating world. 

A small-angle approximation in Eq. (6.111) gives 


6 + a0 = aa’ cos(wt), (6.112) 


where wọ = /g/€ and a = A/€. This equation is simply the equation for a driven 
oscillator, which we solved in Chapter 4. The solution is 


A(t) = 


— 5 cos(wt) + C cos(wot + ), (6.113) 
WO —@ 


0 


where C and ¢ are determined by the initial conditions. 

Ifw happens to equal wo, then it appears that the amplitude goes to infinity. However, 
as soon as the amplitude becomes large, our small-angle approximation breaks down, 
and Eqs. (6.112) and (6.113) are no longer valid. 


Two masses, one swinging 
The Lagrangian is 
1 1 ; 
L= gmi? + z"? + 1767) — mgr + mgr cos 0. (6.114) 


The last two terms are the (negatives of the) potentials of each mass, relative to where 
they would be if the right mass were located at the right pulley. The equations of motion 
obtained from varying r and 0 are 


27 = rò? — g(1 —cos@), 
d 3, (6.115) 
—(r°0) = —gr sin 0. 
P7 (r0) gr sin 


The first equation deals with the forces and accelerations along the direction of the 
string. The second equation equates the torque from gravity with the change in angular 
momentum of the right mass. If we do a (coarse) small-angle approximation and keep 
only terms up to first order in 6, we find that at t = 0 (using the initial condition, 
+ = 0), Eqs. (6.115) become 


r=0, 


. (6.116) 
ö+Ë0=0. 
p 
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Fig. 6.40 


The Lagrangian method 


6.5. 


These equations say that the left mass stays still, and the right mass behaves just like 
a pendulum. 

If we want to find the leading term in the initial acceleration of the left mass (that 
is, the leading term in 7’), we need to be a little less coarse in our approximation. So 
let’s keep terms in Eq. (6.115) up to second order in 0. We then have at t = 0 (using 
the initial condition, + = 0) 


z Í 
2% = r6? — 580 
(6.117) 


£ 


ö+20=0. 
5 


The second equation still says that the right mass undergoes harmonic motion. We are 
told that the amplitude is €, so we have 


O(t) = €cos(wt + ¢), (6.118) 
where w = ,/g/r. Plugging this into the first equation gives 
* 2 +2 1 2 
27 = eg | sinf (œt +) — z 00s (at+¢)). (6.119) 
If we average this over a few periods, both sin?« and cos?q average to 1/2, so we find 
2 
Pws = A (6.120) 


This is a small second-order effect. It is positive, so the left mass slowly begins to 
climb. 


Inverted pendulum 


Let 0 be defined as in Fig. 6.40. With y(t) = A cos(œwt), the position of the mass m is 
given by 


(X,Y) = (€sin6, y+ £cos@). (6.121) 
Taking the derivative to obtain the velocity, we find that the square of the speed is 


V? = X? 4 ¥? = 076? +3? eyð sind, (6.122) 


which also follows from applying the law of cosines to the vertical } and tangential £6 
parts of the velocity vector. The Lagrangian is therefore 


1 f . 
L= 5m lO +5? — 2¢)6 sin 6) — mg (y + £cos8). (6.123) 
The equation of motion for 0 is 
ol 0L ia £6 — ï sinð in (6.124) 
AETI 06 ysm = gsing. i 


Plugging in the explicit form of y(t), we have 
tö + sin 6 (40? cos(wt) — g) =0. (6.125) 


In retrospect, this makes sense. Someone in the reference frame of the support, which 
has vertical acceleration ý = —Aw* cos(wt), may as well be living in a world where 
the acceleration from gravity is g — Aw* cos(wf) downward. Equation (6.125) is just 
the F = ma equation in the tangential direction in this accelerating world. 

Assuming 0 is small, we may set sin@ ~ 0, which gives 


a+ (aw? cos(wt) — #) =0, (6.126) 


6.11 Solutions 
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Fig. 6.42 


where wo = /g/€ and a = A/€. Equation (6.126) cannot be solved exactly, but we 
can still get a good idea of how @ depends on time. We can do this both numerically 
and (approximately) analytically. 

Figure 6.41 shows how 6 depends on time for parameters with values £ = 1m, 
A = 0.1 m, and g = 10m/s*. Soa = 0.1, and w = 10s7?. We produced these plots 
numerically using Eq. (6.126), with the initial conditions of 8 (0) = 0.1 and 6(0) = 0. 
In the first plot, œ = 10s~!. And in the second plot, œ = 100s~!. The stick falls over 
in first case, but it undergoes oscillatory motion in the second case. Apparently, if w is 
large enough the stick won’t fall over. 

Let’s now explain this phenomenon analytically. At first glance, it’s rather surprising 
that the stick stays up. It seems like the average (over a few periods of the w oscillations) 
of the tangential acceleration in Eq. (6.126), namely —0(aw* cos(wt) — o$), equals 
the positive quantity 0a, because the cos(wt) term averages to zero (or so it appears). 
So you might think that there is a net force making 0 increase, causing the stick to 
fall over. 

The error in this reasoning is that the average of the —aw* cos(wt) term is not zero, 
because 6 undergoes tiny oscillations with frequency w, as seen in the second plot in 
Fig. 6.42. Both of these plots have a = 0.005, a, = 10s~?, and œ = 1000 s7! (we'll 
work with small a and large w from now on; more on this below). The second plot is 
a zoomed-in version of the first one near ¢ = 0. The important point here is that the 
tiny oscillations in 6 shown in the second plot are correlated with cos(œt). It turns out 
that the 6 value at the t where cos(wt) = 1 is larger than the 6 value at the £ where 
cos(wt) = —1. So there is a net negative contribution to the —aw°@ cos(wt) part of 
the acceleration. And it may indeed be large enough to keep the pendulum up, as we 
will now show. 

To get a handle on the —aw*6 cos(wt) term, let’s work in the approximation where 
wis large and a = A/€ is small. More precisely, we will assume a < 1 and aw? >> o$, 
for reasons we will explain below. Look at one of the little oscillations in the second 
plot in Fig. 6.42. These oscillations have frequency w, because they are due to the 
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support moving up and down. When the support moves up, @ increases; and when the 
support moves down, 0 decreases. Since the average position of the pendulum doesn’t 
change much over one of these small periods, we can look for an approximate solution 
to Eq. (6.126) of the form 


A(t) X C+ beos(at), (6.127) 


where b < C. C will change over time, but on the scale of 1/q@ it is essentially 
constant, if a = A/¢ is small enough. Plugging this guess for 0 into Eq. (6.126), and 
using a < | and aw? > a; we find —bw? cos(wt) + Caw? cos(wt) = 0, to leading 
order.!> So we must have b = aC. Our approximate solution for 0 is therefore 


6% c(1 + acos(wt)). (6.128) 


Let’s now determine how C gradually changes with time. From Eq. (6.126), the average 
acceleration of 0, over a period T = 277/a, is 


5 


—0 (ac? cos(wt) — w) 


Q 


-c(1 +a cos(wt)) (ae? cos(wt) — o) 
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(6.129) 


where 


Q= 


aao? g 
7 


5 (6.130) 


But if we take two derivatives of Eq. (6.127), we see that fj simply equals Ë. Equating 
this value of 6 with the one in Eq. (6.129) gives 


C(t) + ZCA © 0. (6.131) 


This equation describes nice simple-harmonic motion. Therefore, C oscillates sinu- 
soidally with the frequency Q given in Eq. (6.130). This is the overall back and forth 
motion seen in the first plot in Fig. 6.42. Note that we must have aw > V2 if 
this frequency is to be real so that the pendulum stays up. Since we have assumed 
a < 1, we see that a2w* > 20% implies aw? > wo, which is consistent with our 
initial assumption above. 

If aw > wo, then Eq. (6.130) gives Q ~ aw/V/2. This is the case if we change the 
setup and just have the pendulum lie flat on a horizontal table where the acceleration 
from gravity is zero. In this limit where g is irrelevant, dimensional analysis implies 
that the frequency of the C oscillations must be a multiple of w, because w is the only 


13 The reasons for the a < 1 and aœ? >> o qualifications are the following. If aw > oh, then 
the aw? cos(wt) term dominates the o term in Eq. (6.126). The one exception to this is when 
cos(wt) ~ 0, but this occurs for a negligibly small amount of time if aw? > oh. Ifa <« 1, then we 
can legally ignore the C term when Eq. (6.127) is substituted into Eq. (6.126). This is true because 
we will find below in Eq. (6.129) that our assumptions lead to C being roughly proportional to 
Ca*w*. Since the other terms in Eq. (6.126) are proportional to Caw”, we need a < | in order 
for the C term to be negligible. In short, a < 1 is the condition under which C varies slowly on 
the time scale of 1/w. 
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quantity in the problem with units of frequency. It just so happens that the multiple 
is a/J2. 

As a double check that we haven’t messed up somewhere, the Q value resulting 
from the parameters in Fig. 6.42 (namely a = 0.005, o = 10s7?, and w = 1000 s7!) 
is Q = ./25/2 — 10 = 1.58 s7!. This corresponds to a period of 27 / Q ~ 3.97 s. And 
indeed, from the first plot in the figure, the period looks to be about 4s (or maybe a 
hair less). For more on the inverted pendulum, see Butikov (2001). 


6.6. Minimum or saddle 


(a) For the given &(¢), the integrand in Eq. (6.26) is symmetric around the midpoint, 
so we obtain 


yi BA 2 t2 2 k 27 
as= f m( =) (É ) aeae (6.132) 
0 T T 27 24 
This is negative if T > ,/12m/k = 2/3/«. Since the period of the oscillation 
is T = 27 /œ, we see that T must be greater than (/3/m)t in order for AS to 


be negative, assuming that we are using the given triangular function for €. 
(b) With €() = e sin(zt/T), the integrand in Eq. (6.26) becomes 


1 ft ET i . 2 
AS = J) „(© cos(at/T)) -k(e sin(x#/T)) dt. 


men? = ke? T 
4T 4° 
where we have used the fact that the average value of sin’@ and cos”0 over half 


ofa period is 1/2 (or you can just do the integrals). This result for AS is negative 
if T > 2./m/k = n/w = T/2, where t is the period. 


(6.133) 


REMARK: _ It turns out that the (¢) « sin(t/T) function gives the best chance 
of making AS negative. You can show this by invoking a theorem from Fourier 
analysis that says that any function satisfying (0) = €(T) = 0 can be written 
as the sum &(t) = ST Cn sin(nt/T), where the cy, are numerical coefficients. 
When this sum is plugged into Eq. (6.26), you can show that all the cross terms 
(terms involving two different values of n) integrate to zero. Using the fact that 
the average value of sin?0 and cos?0 is 1/2, the rest of the integral yields 


bS 5 [mrn 
AS = 74 T ir). (6.134) 


In order to obtain the smallest value of T that can make this sum negative, we 
want only the n = 1 term to exist. We then have &(t) = cı sin(zt/T), and 
Eq. (6.134) reduces to Eq. (6.133), as it should. 

As mentioned in Remark 4 in Section 6.2, it is always possible to make AS 
positive by picking a £ (t) function that is small but wiggles very fast. Therefore, 
we see that for a harmonic oscillator, if T > 1/2, then the stationary value of 
S is a saddle point (some €’s make AS positive, and some make it negative), 
but if T < 1/2, then the stationary value of S is a minimum (all €’s make AS 
positive). In the latter case, the point is that T is small enough so that there is 
no way for £ to get large, without making é large also. & 


6.7. Normal force from a plane 


FIRST SOLUTION: The most convenient coordinates in this problem are w and z, where 
w is the distance upward along the plane, and z is the distance perpendicularly away 
from it. The Lagrangian is then 


1 
zma? +27) — mg(wsin@ + z cos) — V (2), (6.135) 
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6.8. 


6.9. 


where V (z) is the (very steep) constraining potential. The two equations of motion are 


mù = —mg sing, 
a dv 
mz = —mg cos — —. (6.136) 
dz 
At this point we invoke the constraint z = 0. So Z = 0, and the second equation gives 
F, = —V'(0) = mg cos 9, (6.137) 
as desired. We also obtain the usual result, ù = —g sin 8. 


SECOND SOLUTION: We can also solve this problem by using the horizontal and vertical 
coordinates, x and y. We’ll choose (x,y) = (0,0) to be at the top of the plane; see 
Fig. 6.43. The (very steep) constraining potential is V (z), where z = x sin 0 + y cos 0 
is the distance from the mass to the plane (as you can verify). The Lagrangian is then 


L Ghee V 6.138 
=F Hy“) — mgy (z). (6.138) 


Keeping in mind that z = x sin @ + y cos 0, the two equations of motion are (using the 
chain rule) 


mk = —— — = —V'(z)sin@, 
dz ax 
(6.139) 
x dV ðz A 
mý = —mg — — — = —mg — V (z) cosé. 
dz ðy 
At this point we invoke the constraint condition z = 0 => x = —ycot0. This 


condition, along with the two E-L equations, allows us to solve for the three unknowns, 
x, », and V'(0). Using X = —9 cot @ in Eq. (6.139), we find 


X=gcos@siné, p= —g sin?ð, F. = —V'(0) = mg cos8. (6.140) 


The first two results here are simply the horizontal and vertical components of the 
acceleration along the plane, which is g sin 8. 


Bead on a stick 
There is no potential energy here, so the Lagrangian consists of just the kinetic energy, 
T, which comes from the radial and tangential motions: 
1 1 
L=T= smi + smo. (6.141) 
Equation (6.52) therefore gives 


1 1 
E = =mi? — -mro . (6.142) 
2 2 


Claim 6.3 says that this quantity is conserved, because 0L/dt = 0. But it is not the 
energy of the bead, due to the minus sign in the second term. 

The point here is that in order to keep the stick rotating at a constant angular speed, 
there must be an external force acting on it. This force in turn causes work to be done 
on the bead, thereby increasing its kinetic energy. The kinetic energy T is therefore not 
conserved. From Eqs. (6.141) and (6.142), we see that E = T — mr?” is the quantity 
that is constant in time. See Exercise 6.39 for some F = ma ways to show that the 
quantity £ in Eq. (6.142) is conserved. 


Atwood’s machine 


FIRST SOLUTION: Ifthe left mass goes up by x and the right mass goes up by y, then 
conservation of string says that the middle mass must go down by x + y. Therefore, 
the Lagrangian of the system is 


oe eee ~ 
L= 5(Am)s? + 5 (3m)(—% — 3)? + zm? — (mgr + (Bm)g(—x — y) + mgy) 


7 
= smc H 3my + 2my* — mg(x — 2y). (6.143) 


6.10. 


6.11 Solutions 


This is invariant under the transformation x —> x + 2e and y —> y + €. Hence, we can 
use Noether’s theorem, with Ky = 2 and K, = 1. The conserved momentum is then 


aL L 
P=% Tk, =m H352) + m(3x +45) (1) = m17 + 10). (6.144) 
x iy 


This P is constant. In particular, if the system starts at rest, then x always equals 
—(10/17)y. 


SECOND SOLUTION: The Euler-Lagrange equations are, from Eq. (6.143), 


7mž + 3m) = —mg, 
(6.145) 
3mx + 4my = 2mg. 
Adding the second equation to twice the first gives 
17m + 10m} = 0 “(17mi 10mb) =Ü. (6.146) 
dt 


THIRD SOLUTION: We can also solve this problem using F = ma. Since the tension T 
is the same throughout the rope, we see that the three F = dP/dt equations are 


3m Pin 


dP. dP. 
al 2T — 3mg T’ 2T — mg = 


2T —4mg = 


(6.147) 


The three forces depend on only two quantities (T and mg), so there must be some 
combination of them that adds up to zero. If we set a(2T — 4mg) + b(2T — 3mg) 4 

c(2T — mg) = 0, then we have a + b + c = 0 and 4a + 3b + c = 0, which is satisfied 
by a = 2, b = —3, and c = 1. Therefore, 


d 
0= di (2P4m 3P3m + Pm) 


= © (24m) 3(3m)(—% — j) +>) 
dt 
= © atmi + 10my). (6.148) 


Hoop and pulley 

Let the radius to M make an angle @ with the vertical (see Fig. 6.44). Then the coor- 
dinates of M relative to the center of the hoop are R(sin 0, — cos 0). The height of 
m, relative to its position when M is at the bottom of the hoop, is y = —R@. The 
Lagrangian is therefore (and yes, we’ve chosen a different y = 0 reference point for 
each mass, but such a definition changes the potential only by a constant amount, 
which is irrelevant) 


1 ; 
L=5(M+ m)R?6* + MgR cos 6 + mgRO. (6.149) 
The equation of motion is then 


(M + m)RO = g(m — M sin). (6.150) 


This is just F = ma along the direction of the string (because Mg sin 6 is the tangential 
component of the gravitational force on M). 

Equilibrium occurs when 6 = 6 = 0. From Eq. (6.150), we see that this happens at 
sin 0o = m/M. Letting 0 = 6o + ô, and expanding Eq. (6.150) to first order in ô, gives 


5 Mg cos 69 
ô -2 js =0. 6.151 
j G =) ( ) 


Fig. 6.44 
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The frequency of small oscillations is therefore 
M cos M — m\' 
PEA An e "o JE, (6.152) 
M+mVR M+m R 
where we have used cos 6o = ,/1 — sin o. 


REMARKS: IfM œ m, then 0) ~ 0, and w ~ ./g/R. This makes sense, because m 
can be ignored, so M essentially oscillates around the bottom of the hoop, just like a 
pendulum of length R. 

If M is only slightly greater than m, then 0) ~ 2/2, and w ~ 0. This also makes 
sense, because if 0 ~ 7/2, then the restoring force g(m — M sin 0) doesn’t change 
much as @ changes (the derivative of sin @ is zero at 9 = 7/2), so it’s as if we have a 
pendulum in a very weak gravitational field. 

We can actually derive the frequency in Eq. (6.152) without doing any calcula- 
tions. Look at M at the equilibrium position. The tangential forces on it cancel, and 
the radially inward force from the hoop must be Mg cos 6o to balance the radial out- 
ward component of the gravitational force. Therefore, for all the mass M knows, it 
is sitting at the bottom of a hoop of radius R in a world where gravity has strength 
g’ = gcos. The general formula for the frequency of a pendulum (as you can 
quickly show) is œ = ,/F’/M’R, where F’ is the gravitational force (which is Mg’ 
here), and M’ is the total mass being accelerated (which is M +m here). This gives the 
æ in Eq. (6.152). (This reasoning is a little subtle; food for thought.) & 


Bead on a rotating hoop 


Breaking the velocity up into the component along the hoop plus the component 
perpendicular to the hoop, we find 


1 ; 
LS zor sin?6 + R767) + mgR cos @. (6.153) 


The equation of motion is then 
RÖ = sin 0 (a? R cos — g). (6.154) 


The F = ma interpretation of this is that the component of gravity pulling downward 
along the hoop accounts for the acceleration along the hoop plus the component of the 
centripetal acceleration along the hoop. 

Equilibrium occurs when 6 = Ë = 0. The right-hand side of Eq. (6.154) equals zero 
when either sin 0 = 0 (that is, 0 = 0 or 0 = 7) or cos 0 = g/(@?R). Since cos 0 must 
be less than or equal to 1, this second condition is possible only if œ? > g/R. So we 
have two cases: 


e Ifw* < g/R, then 0 = 0 and 6 = z are the only equilibrium points. 
The 9 = x case is unstable. This is fairly intuitive, but it can also be seen 
mathematically by letting 9 = a + ô, where ô is small. Equation (6.154) then 


becomes 
5 — 8(w? +.g/R) = 0. (6.155) 
The coefficient of ô is negative, so ô undergoes exponential instead of oscillatory 
motion. 
The 6 = 0 case turns out to be stable. For small 6, Eq. (6.154) becomes 


6 + 0(g/R—w*) =0. (6.156) 


The coefficient of 6 is positive, so we have sinusoidal solutions. The frequency 
of small oscillations is y g/R — w2. This goes to zero as w > ./g/R. 

e Ifa > g/R, then 0 = 0, 0 = z, and coso = g/(@?R) are all equilibrium 
points. The 6 = z case is again unstable, by looking at Eq. (6.155). And the 
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0 = 0 case is also unstable, because the coefficient of 6 in Eq. (6.156) is now 
negative (or zero, if œ? = g/R). 

Therefore, cos) = g/(w*R) is the only stable equilibrium. To find the fre- 
quency of small oscillations, let 9 = 6) + 6 in Eq. (6.154), and expand to first 
order in ô. Using cos 0) = g/(wR), we find 


5 + (w° sin?@)6 = 0. (6.157) 


The frequency of small oscillations is therefore w sin 0) = yœ? — g?/w*R?. 

The frequency w = ./g/R is the critical frequency above which there is a stable 
equilibrium at 9 4 0, that is, above which the mass wants to move away from the 
bottom of the hoop. 


Remark: This frequency of small oscillations goes to zero as œ > /g/R. 
And it approximately equals œ as w — oo. This second limit can be viewed 
in the following way. For very large w, gravity isn’t important, and the bead 
feels a centripetal force (the normal force from the hoop) essentially equal to 
mw’ R as it moves near 0 = 1/2. So for all the bead knows, it is a pendulum of 
length R in a world where “gravity” pulls sideways with a force mo? R = mg’ 
(outward, so that it is approximately canceled by the inward-pointing normal : m 
force, just as the downward gravitational force is approximately canceled by the (top view) 
upward tension in a regular pendulum). The frequency of such a pendulum is 


Jg'/R=Jw?R/R=0. & 


6.12. Another bead on a rotating hoop 


With the angles wt and 6 defined as in Fig. 6.45, the Cartesian coordinates for the bead 
are 


(x, y) = (R cos wt +r cos(wt + 0), R sin æt +r sin(wt + 8)). (6.158) 
The velocity is then Fig. 6.45 
(x,y) = (—oRsin wt — r(w + 6) sin(wt + 0), 


wR cos wt + r(w + Ê) cos(wt + 0) . (6.159) 
The square of the speed is therefore 


v = Ro r(@ 6)? 


+ 2Rra(@+ 6)( sin wt sin(wt + 6) + cos wt cos(wt + 8)) 


= Ro +r? (w +0}? + 2Rrw(w + 0) cos 6. (6.160) 


This speed can also be obtained by using the law of cosines to add the velocity 
of the center of the hoop to the velocity of the bead with respect to the center (as 
you can show). 

There is no potential energy, so the Lagrangian is simply L = mv? /2. The equation 
of motion is then, as you can show, 


rË + Rw’ sind = 0. (6.161) 


Equilibrium occurs when 6 = 6 = 0, so Eq. (6.161) tells us that the equilibrium 
is located at 0 = 0, which makes intuitive sense. (Another solution is 9 = x, but 
that’s an unstable equilibrium.) A small-angle approximation in Eq. (6.161) gives 
6 + (R/r)w6 = 0, so the frequency of small oscillations is Q = w./R/r. 


Remarks: If R <r, then Q ~ 0. This makes sense, because the frictionless hoop 
is essentially not moving. If R = r, then Q = øw. If R > r, then Q is very large. 
In this case, we can double-check the Q = w,/R/r result in the following way. In 
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the accelerating frame of the hoop, the bead feels a centrifugal force (discussed in 
Chapter 10) of m(R + r)@”. For all the bead knows, it is in a gravitational field with 
strength g’ = (R+r)w7. So the bead (which acts like a pendulum of length), oscillates 
with a frequency equal to 


r7 2 
J- [REDO oE RSH: (6.162) 
r r 


Note that if we try to use this “effective gravity” argument as a double check for smaller 
values of R, we get the wrong answer. For example, if R = r, we obtain an oscillation 
frequency of w./2R/7, instead of the correct value wy/R/r. This is because in reality 
the centrifugal force fans out near the equilibrium point, while our “effective gravity” 
argument assumes that the field lines are parallel (and so it gives a frequency that is 
too large). & 


Mass on a wheel 


Let the angle 0 be defined as in Fig. 6.46, with the convention that @ is positive if M is 
to the right of m. Then the position of m in Cartesian coordinates, relative to the point 
where m would be in contact with the ground, is 


(x,V)m = R(0 — sind, 1 — cos@). (6.163) 


We have used the nonslipping condition to say that the present contact point is a 
distance RO to the right of where m would be in contact with the ground. Differentiating 
Eq. (6.163), we find that the square of m’s speed is v2, = 2R76?(1 — cos@). 

The position of M is (x,y)m = R(0, 1), so the square of its speed is vy = R262. 
The Lagrangian is therefore 


1 . . 
L= JMR È? + mR767(1 — cos 0) + mgR cos 6, (6.164) 


where we have measured both potential energies relative to the height of M. The 
equation of motion is 


MR6 + 2mR6(1 — cos 0) + mRO? sin ð + mg sin@ = 0. (6.165) 


In the case of small oscillations, we can use cos @ ~ 1—67/2 andsin 0 ~ @. The second 
and third terms in Eq. (6.165) are then third order in @ and can be neglected (basically, 
the middle term in Eq. (6.164), which is the kinetic energy of m, is negligible), so we 
find 


6+ ()e=0. (6.166) 


The frequency of small oscillations is therefore 


Ge. J, (6.167) 


Remarks: IfM >> m, then w — 0. This makes sense. If m >> M, then w — oo. 
This also makes sense, because the huge mg force makes the situation similar to one 
where the wheel is bolted to the ground, in which case the wheel vibrates with a high 
frequency. 

Equation (6.167) can actually be derived in a much quicker way, using torque. For 
small oscillations, the gravitational force on m produces a torque of —mgR@ around 
the contact point on the ground. For small 60, m has essentially no moment of inertia 
around the contact point, so the total moment of inertia is simply MR?. Therefore, 
t = Ia gives —mgRO = MR?Ö, from which the result follows. & 
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6.14. Pendulum with a free support 


Let x be the coordinate of M, and let @ be the angle of the pendulum (see Fig. 6.47). 
Then the position of the mass m in Cartesian coordinates is (x + £ sin 0, —£ cos 8). 
Taking the derivative to find the velocity, and then squaring to find the speed, gives 
w= + 026? + 206 cos 6. The Lagrangian is therefore 


1 
2 2 


m? + LÀ? + 20%6 cos 0) + mgl cos 8. (6.168) 


The equations of motion obtained from varying x and 0 are 


(M + m)x + m6 cos6 — ml62 sin0 = 0, 


. (6.169) Fig. 6.47 
£6+Xcos@+gsind = 0. 


If 6 is small, we can use the small-angle approximations, cos@ ~ 1 — 67/2 and 
sin 6 ~ 0. Keeping only the terms that are first-order in 0, we obtain 
(M +m)xX+m06 = 0, 
p (6.170) 
X+ 6+ 26 = 0. 


The first equation expresses momentum conservation. Integrating it twice gives 


£ 
x ( ý Je 1 At +B. (6.171) 
M+m 
The second equation is F = ma in the tangential direction. Eliminating ¥ from 
Eq. (6.170) gives 

T M+m\g 
6 =6=0. 6.172 
+ ( M ) 7 (6.172) 


Therefore, 0 (t) = C cos(wt + p), where 


| m |g 
= | l+ —/ =. 6.173 
w + MNV? ( ) 


The general solutions for 6 and x are therefore 


O(t) =Ccos(ot+¢), x(t) Ome tpieaem. (617 
M+m 


The constant B is irrelevant, so we’ll ignore it. The two normal modes are: 


e A = Q: Inthis case, x = —0m£/(M +m). Both masses oscillate with the frequency 
w given in Eq. (6.173), always moving in opposite directions. The center of mass 
does not move (as you can verify). 

e C=0: In this case, 0 = 0 and x = At. The pendulum hangs vertically, with both 
masses moving horizontally at the same speed. The frequency of oscillations is 
zero in this mode. 


Remarks: IfM > m, then w = ./g/, as expected, because the support essentially 
stays still. 

If m > M, then œ > /m/M./g/l — ov. This makes sense, because the ten- 
sion in the rod is very large. We can actually be quantitative about this limit. For 
small oscillations and for m >> M, the tension of mg in the rod produces a side- 
ways force of mg@ on M. So the horizontal F = Ma equation for M is mgd = Mx. 
But x ~ —£0 in this limit, so we have mg9 = —M26, which gives the desired 
frequency. & 
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6.15. Pendulum support on an inclined plane 
Let z be the coordinate of M along the plane, and let 6 be the angle of the pendulum 
(see Fig. 6.48). In Cartesian coordinates, the positions of M and m are 


(x,y) = (Z cos B, —z sin £), 


(6.175) 
(x, Y)m = (Z cos B + £ sin 0, —z sin B — £ cos 0). 
Differentiating these positions, we find that the squares of the speeds are 
2 +2 
Um ==, 
(6.176) 


ve = 2 + À? + 228 (cos B cos — sin £ sin 0). 


m 


The Lagrangian is therefore 


1 1 : ; 
zM? } m? H €°6? + 2626 cos(0 + p)) + Mgz sin B + mg(z sin B + £ cos 8). 
(6.177) 
The equations of motion obtained from varying z and 0 are 
(M + m)i + me(6 cos(0 + 8) — 6” sin(6 4 B) = (M + m)g sin B, 
(6.178) 


£6 +Z cos(0 + 6) = —g sin 8. 


Let us now consider small oscillations about the equilibrium point (where 6 = 6 = 0). 
We must first determine where this point is. The first equation above gives Z = g sin £. 
The second equation then gives g sin $ cos(@ + B) = —gsin@. By expanding the 
cosine term, we find tan 0 = — tan £, so 0 = —$. (0 = x — $ is also a solution, but 
this is an unstable equilibrium.) The equilibrium position of the pendulum is therefore 


where the string is perpendicular to the plane.'* 
To find the normal modes and frequencies for small oscillations, let 0 = —f + ô, 
and expand Eq. (6.178) to first order in ô. Letting į} = Z — g sin £ for convenience, we 


obtain 


(M +m) + mes = 0, 
. (6.179) 
i + £5 + (g cos B)5 = 0. 
Using the determinant method (or using the method in Problem 6.14; either way works), 
we find the frequencies of the normal modes to be 


o1=0, and a =,/14 a (6.180) 


These are the same as the frequencies in the previous problem (where M moves hor- 
izontally), but with g cos £ in place of g; compare Eq. (6.179) with Eq. (6.170).!° 
Looking at Eq. (6.174), and recalling the definition of 7, we see that the general 
solutions for 6 and z are 


ot) =—-B+C t+), z(t) sd t+¢)4 ene P+At+B 
= cos(wt+), z(t) = M4m cos(wt + $) 4 7 t + B. 
(6.181) 


This makes sense. The tension in the string is perpendicular to the plane, so for all the pendulum 
bob knows, it may as well be sliding down a plane parallel to the given one, a distance £ away. 
Given the same initial speed, the two masses slide down their two “planes” with equal speeds at 
all times. 

This makes sense, because in a frame that accelerates down the plane at g sin £, the only external 
force on the masses is a gravitational force of g cos 6 perpendicular to the plane. As far as M 
and m are concerned, they live in a world where gravity pulls “downward” (perpendicular to the 
plane) with strength g’ = g cos £. 


6.16. 


6.11 Solutions 


The constant B is irrelevant, so we’ll ignore it. The basic difference between these 
normal modes and the ones in the previous problem is the acceleration down the plane. 
If you go to a frame that accelerates down the plane at g sin £, and if you tilt your head 
at an angle £ and accept the fact that g’ = g cos in your world, then the setup is 
identical to the one in the previous problem. 


Tilting plane 
Relative to the support, the positions of the masses are 


(x, y)m = (€sind, —£ cos 0), 


(6.182) 
(x, Y)m = (L sin 0 + xcos0,—£cos0 + xsin 8). 
Differentiating these positions, we find that the squares of the speeds are 
v= PO, wv = (0645) +2°67. (6.183) 


You can also obtain v2, by noting that (€6 + ž) is the speed along the long rod, and xÅ 


is the speed perpendicular to it. The Lagrangian is 


L= Luei? H sm (ed H x)? H x76?) + Mgt cos6 + mg(€ cos @ — x sing). 
(6.184) 
The equations of motion obtained from varying x and 0 are 
06 +X = x6? — g sin9, 
” (6.185) 


Me6 4 me(l6 +X) 4 mx°6 + 2mxx6 = (M + m)gé sin @ — mgx cos 8. 


Let us now consider the case where both x and 6 are small (or more precisely, 0 « 1 
and x/€ < 1). Expanding Eq. (6.185) to first order in 0 and x/£ gives 


(€6+%)+¢0 =0, 


A 3 (6.186) 
MEUË + gO) + meL + ¥) + mglO + mex = 0. 


We can simplify these a bit. Using the first equation to substitute —g@ for EË +), 
and also —x for (£0 + g0), in the second equation gives 
tö +ï +g =0, 
(6.187) 
—M&ž + mgx = 0. 
The normal modes can be found using the determinant method, or we can find them just 
by inspection. The second equation says that either x(t) = 0, or x(t) = A cosh (æt + £), 
where a = /mg/ME. So we have two cases: 


e Ifx(t) = 0, then the first equation in (6.187) says that the normal mode is 


( )=2( i ) eos(or + 0), (6.188) 


where w = ./g/€. This mode is fairly clear. With the proper initial conditions, 
m will stay right where M is. The normal force from the long rod will be exactly 
what is needed in order for m to undergo the same oscillatory motion as M. The 
two masses may as well be two pendulums of length £ swinging side by side. 

e If x(t) = Acosh(at + £), then the first equation in (6.187) can be solved (by 
guessing a particular solution for 0 of the same form) to give the normal mode, 


( : Jee eM +m) ) eoshiar + A (6.189) 


where a = y mg/M £. This mode is not as clear. And indeed, its range of validity 
is rather limited. The exponential behavior will quickly make x and 0 large, and 
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thus outside the validity of our small-variable approximations. You can show that 
in this mode the center of mass remains directly below the pivot. This can occur, 
for example, by having m move down to the right as the rods rotate and swing M 
up to the left. There is no oscillation in this mode; the positions keep growing. 
The CM falls, to provide for the increasing kinetic energy. 


6.17. Rotating curve 


The speed along the curve is xy 1 + y”, and the speed perpendicular to the curve is 
ox. So the Lagrangian is 


l 
is z" (ox 1 (14 y») mgy, (6.190) 


where y(x) = b(x/a)*. The equation of motion is then 


d (aL aL 
= za ' 2 “27 J z l, 6.191 
(3) ae FA +y) EX yy =w'x— gy (6.191) 


Equilibrium occurs when x = ¥ = 0, so Eq. (6.191) says that the equilibrium value of 
x satisfies 


gy’ xo) 
—. 


Xo = (6.192) 


w 
The F = ma explanation for this (writing y’ (xo) as tan, where @ is the angle of the 
curve, and then multiplying through by w? cos 0) is that the component of gravity along 
the curve accounts for the component of the centripetal acceleration along the curve. 
Using y(x) = b(x/a)*, Eq. (6.192) yields 


1/(a—2) 


ew 
= — N 6.193 
xo a( 1gb ) ( ) 


As à — 00, we see that x9 goes to a. This makes sense, because the curve essentially 
equals zero up to a, and then it rises very steeply. You can check numerous other limits. 
Letting x = x9 + ô in Eq. (6.191), and expanding to first order in ô, gives 


5 (1+y'G0)*) =6 (o? - gy"G0)). (6.194) 
The frequency of small oscillations is therefore given by 


n a 
gas te na (6.195) 
1+y’Qo) 


Using the explicit form of y, along with Eq. (6.193), we find 


2 (A — 2)? 
Y= - JOT (6.196) 
1 + aut ca) 
g? àgb 


We see that à must be greater than 2 in order for there to be oscillatory motion around 
the equilibrium point. For A < 2, the equilibrium point is unstable, that is, to the left 
the force is inward, and to the right the force is outward. 

For the case A = 2, we have y(x) = b(x/a)?, so the equilibrium condition, 
Eq. (6.192), gives x9 = (2gb/a?w)xo. For this to be true for some x9, we must 
have œ? = 2gb/a*. But if this holds, then Eq. (6.192) is true for all x. So in the special 
case of A = 2, the bead happily sits anywhere on the curve if œ? = 2gb/a’. (In the 
rotating frame of the curve, the tangential components of the centrifugal and gravita- 
tional forces exactly cancel at all points.) If à = 2 and œ? # 2gb/a?, then the particle 
feels a force either always inward or always outward. 


REMARKS: For w — 0, Eqs. (6.193) and (6.196) give x9 —> 0 and Q — 0. And for 
w — œ, they give x9 —> oo and Q — 0. In both cases Q — 0, because in both 


6.18. 


6.11 Solutions 


cases the equilibrium position is at a place where the curve is very flat (horizontally 
or vertically, respectively), and the restoring force ends up being small. 

For à —> 00, we have x9 > a and Q — ov. The frequency is large here because the 
equilibrium position at a is where the curve has a sharp corner, so the restoring force 
changes quickly with position. Or, you can think of it as a pendulum with a very small 
length, if you approximate the “corner” by a tiny circle. & 


Motion in a cone 

If the particle’s distance from the axis is r, then its height is r/ tan a, and its distance 
up along the cone is r/ sina. Breaking the velocity into components up along the cone 
and around the cone, we see that the square of the speed is v? = #2/ sin’w +7262. The 
Lagrangian is therefore 


1 72 
fecal == +i) - =. (6.197) 
2 sina tan œ 


The equations of motion obtained from varying 0 and r are 


do; 
—(mr*@) = 0 
dt (6.198) 
7 = rÅ? sin?a — g cosa sina. 
The first of these equations expresses conservation of angular momentum. The second 
equation is more transparent if we divide through by sina. With x = r/ sina being 
the distance up along the cone, we have ¥ = (r?) sina — g cos œ. This is the F = ma 
statement for the diagonal x direction. 
Letting mr?6 = L, we can eliminate 6 from the second equation to obtain 
I sin? 
r= 


z3 ` gcosa sina. (6.199) 
mr 


We will now calculate the two desired frequencies. 


e Frequency of circular oscillations, œ: For circular motion with r = ro, we have 
+ = ř = 0, so the second of Eqs. (6.198) gives 


o=b= 2 o (6.200) 
ro tana 


e Frequency of oscillations about a circle, Q: If the orbit were actually the circle 
r = ro, then Eq. (6.199) would give (with 7 = 0) 


hs 
L? sin?a 


= g cos & sina. (6.201) 
mere 
This is equivalent to Eq. (6.200), which can be seen by writing L as mrgo. 
We will now use our standard procedure of letting r(t) = ro + ô (t), where 5(t) 
is very small, and then plugging this into Eq. (6.199) and expanding to first order 
in 6. Using 


1 = 1 1 al ( 2) (6.202) 
mot 43725  rAd+38/r) rè ro)’ ' 
we have 
. Dsiwa 35 
6= 5 1 gcosa sina. (6.203) 
mre ro 


Recalling Eq. (6.201), the terms not involving ô cancel, and we are left with 


x 3L? sin2a 
6= - — r ô. (6.204) 
mr 
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Using Eq. (6.201) again to eliminate L we have 


© #3 
oa (2 sin @ cos «) j=; (6.205) 
ro 


3g . 
Q = | — sin g cos g. (6.206) 
ro 


Having found the two desired frequencies in Eqs. (6.200) and (6.206), we see that their 
ratio is 


Therefore, 


Q ; 
— = v3 sina. (6.207) 
w 


This ratio Q/w is independent of ro. The two frequencies are equal if sina = 1//3, 
that is, ifa@ ~ 35.3° = a. If@ = a, then after one revolution around the cone, r 
returns to the value it had at the beginning of the revolution. So the particle undergoes 
periodic motion. 


Remarks: In the limit a — 0 (that is, the cone is very thin), Eq. (6.207) says that 
Q/w — 0. In fact, Eqs. (6.200) and (6.206) say that œ —> oo and Q — 0. So the 
particle spirals around many times during one complete r cycle. This seems intuitive. 

In the limit @ — 7/2 (that is, the cone is almost a flat plane), both w and Q go to 
zero, and Eq. (6.207) says that Q/w — 4/3. This result is not at all obvious. 

IfQ/w = V3 sin o isarational number, then the particle undergoes periodic motion. 
For example, ifaw = 60°, then Q/w = 3/2, so it takes two complete circles for r to go 
through three cycles. Or, ifa = arcsin(1/2./3) = 16.8°, then Q/w = 1/2, so it takes 
two complete circles for r to go through one cycle. & 


6.19. Double pendulum 


Relative to the pivot point, the Cartesian coordinates of m1 and mz are, respectively 
(see Fig. 6.49), 


(x,y), = (£ sin 01, —£1 cos 01), 


(6.208) 
(x, y)2 = (£ sin 01 + £2 sin 02, —£1 cos 6; — £2 cos 62). 
Taking the derivative to find the velocities, and then squaring, gives 
2 = 28, 
(6.209) 


vw = bÀ? + 63 + 201 £20182 (cos 01 cos 02 + sin 04 sin 62). 
The Lagrangian is therefore 
L= sm 6? + smn G6? + C563 + 2€; 2618 cos(01 — 62)) 
+ mgt} cos 6; + mzg(£1 cos 01 + £2 cos 62). (6.210) 
The equations of motion obtained from varying 6; and 6 are 
0 = (my + mz) £76, + m1l26> cos(O1 — 42) + mL) 0265 sin(01 — 02) 
+ (mı +m2)gé; sin 0], 
0 = m63 + ml) C6) cos(01 — 02) — mae) L20? sin(01 — 62) (6.211) 


+ mgl sin 62. 


6.11 Solutions 


This is a bit of a mess, but it simplifies greatly if we consider small oscillations. 
Using the small-angle approximations and keeping only the leading-order terms, 
we obtain 
0 = (my + m); + mlz + (my + m2)gr, 
ss 5 (6.212) 
0 = L20 + £101 + gO. 


Consider now the special case, £1 = £2 = £. We can find the frequencies of the 
normal modes by using the determinant method, discussed in Section 4.5. You 


can show that the result is 
mı +m2+,/mym2 + m g 
; (6.213) 
m] £ 


The normal modes turn out to be, after some simplification, 


w4 = 


01 (6) F/m 
(ao ), i eoe ) cos(w+t + 4). (6.214) 


Some special cases are: 


e mı = m: The frequencies are 


PEENE A l (6.215) 
The normal modes are 
6 (t) F1 
(oo ). x (33 cos(wit + px). (6.216) 
em, > m: With m2/m, = e, the frequencies are (to leading nontrivial order 
in €) 
yE g 
w+ = (l vers. (6.217) 


The normal modes are 


(RB), eese em 


In both modes, the upper (heavy) mass essentially stands still, and the lower (light) 
mass oscillates like a pendulum of length £. 
eè m; X m: With m/m = e, the frequencies are (to leading order in €) 


2g g 
= = : 6.21 
eM eg Ty (6:219) 
The normal modes are 


N x ea ) cos(wat + $+). (6.220) 


In the first mode, the lower (heavy) mass essentially stands still (from the x2 in 
Eq. (6.208)), and the upper (light) mass vibrates back and forth at a high frequency 
(because there is a very large tension in the rods). In the second mode, the rods 
form a straight line, and the system is essentially a pendulum of length 2¢. 
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Consider now the special case, mı = m2. Using the determinant method, you can 
show that the frequencies of the normal modes are 


++ J+ os 
w+ = yg Ub (6.221) 


The normal modes turn out to be, after some simplification, 


0 (t) l2 
(a0 ), G -4 F ara) cos(w+t + $+). (6.222) 


Some special cases are: 


e lı = b2: We already considered this case above. You can show that Eqs. (6.221) 
and (6.222) agree with Eqs. (6.215) and (6.216), respectively. 
o L > £2: With 42/4 = e, the frequencies are (to leading order in €) 


| 2g [g 
= —— $ 6.223 
W+ b , (62) t ( ) 
The normal modes are 


e | x E ) cos(w4.t + 64), 


Ge ) x (i ) cos(w_t + ġ-—). 


In the first mode, the masses essentially move equal distances in opposite direc- 
tions, at a high frequency (assuming £2 is small). The factor of 2 in the frequency 
arises because the angle of £2 is twice what it would be if mı were bolted in 
place; so m2 feels double the tangential force. In the second mode, the rods form a 
straight line, and the masses move just like a mass of 2m. The system is essentially 
a pendulum of length £1. 

o Ll; < l2: With £;/l2 = e€, the frequencies are (to leading order in €) 


[2g [g 
= = . 6.225 
w+ a b ( ) 
The normal modes are 


9 (0) ) ( 1 ) 
x cos(w+t + d+), 
e (OAN —€ 
A(t 1 
ae ) x iG ) cos(w_t+ _). 
In the first mode, the bottom mass essentially stands still, and the top mass oscil- 
lates at a high frequency (assuming £1 is small). The factor of 2 in the frequency 
arises because the top mass essentially lives in a world where the acceleration 
from gravity is g’ = 2g (because of the extra mg force downward from the lower 
mass). In the second mode, the system is essentially a pendulum of length £2. 
The factor of 2 in the angles is what is needed to make the tangential force on 


the top mass roughly equal to zero (because otherwise it would oscillate at a high 
frequency, since £; is small). 


(6.224) 


(6.226) 


6.20. 


6.21. 


6.22. 


6.11 Solutions 


Shortest distance in a plane 


Let the two given points be (x1,y1) and (x2,2), and let the path be described 
by the function y(x). (Yes, we’ll assume it can be written as a function. Locally, 
we don’t have to worry about any double-valued issues.) Then the length of the 


path is 
x2 
t= / Vl+y? dx. (6.227) 
x] 


The “Lagrangian” is L = y 1 + y, so the Euler-Lagrange equation is 


d (A) ab daf y i P 
dx\ ay) ay dx\ J1 +y2 Ba ` 


We see that y’/,/1 + y” is constant. Therefore, y’ is also constant, so we have 
a Straight line, y(x) = Ax + B, where A and B are determined by the endpoint 
conditions. 


Index of refraction 


Let the path be described by y(x). The speed at height y is v « y. Therefore, the 
time to go from (x1, 1) to (x2, y2) is 


x2 d x2 ./] /2 
T= / 5a / VITI” k. (6.229) 
xy Y x] ¥ 


The “Lagrangian” is therefore 


Jit 


y 


L« (6.230) 


At this point, we could apply the E-L equation to this L, but let’s just use 
Lemma 6.5, with f (y) = 1/y. Equation (6.86) gives 


: B 
1+3? =B => l+ a5. 
y 


(6.231) 


We must now integrate this. Solving for y’, and then separating variables and 
integrating, gives 


fans 29 => x+4=FyB-y2. (6.232) 
75-7 

Therefore, (x + A)? + y? = B, which is the equation for a circle. Note that the 

circle is centered at a point with y = 0, that is, at a point on the bottom of the 

slab. This is the point where the perpendicular bisector of the line joining the two 

given points intersects the bottom of the slab. 


Minimal surface 


By “tension” in a surface, we mean the force per unit length in the surface. The 
tension throughout the surface must be constant, because it is in equilibrium. If 
the tension at one point were larger than at another, then some patch of the surface 
between these points would move. 

The ratio of the circumferences of the circular boundaries of the ring is 
y2/yı. Therefore, the condition that the horizontal forces on the ring cancel is 
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6.23. 


yı cos} = y2co0s62, where the 6’s are the angles of the surface, as shown 
in Fig. 6.50. In other words, ycos@ is constant throughout the surface. But 


cos@ = 1/y 1 + y’, so we have 
y 


pe 
JVi+y? 


This is equivalent to Eq. (6.77), and the solution proceeds as in Section 6.8. 


(6.233) 


Existence of a minimal surface 

The general solution for y(x) is given in Eq. (6.78) as y(x) = (1/b) cosh b(x + d). 
Ifwe choose the origin to be midway between the rings, thend = 0. Both boundary 
condition are thus 


1 
a: cosh b£. (6.234) 


We will now determine the maximum value of £/r for which the minimal surface 
exists. If £/r is too large, then we will see that there is no solution for b in 
Eq. (6.234). If you perform an experiment with soap bubbles (which want to 
minimize their area), and if you pull the rings too far apart, then the surface will 
break and disappear as it tries to form the two boundary circles. 

Define the dimensionless quantities, 


, and z=br. (6.235) 


Then Eq. (6.234) becomes 
z = cosh nz. (6.236) 


If we make a rough plot of the graphs of w = z and w = cosh nz for a few values 
of 7 (see Fig. 6.51), we see that there is no solution to Eq. (6.236) if 7 is too large. 
The limiting value of ņ for which there exists a solution occurs when the curves 
w = z and w = cosh nz are tangent; that is, when the slopes are equal in addition 
to the functions being equal. Let nọ be the limiting value of 7, and let zg be the 
place where the tangency occurs. Then equality of the values and the slopes gives 


zo = cosh(ngzo), and 1 = nọ sinh(nozo). (6.237) 
Dividing the second of these equations by the first gives 
1 = (n020) tanh (7920). (6.238) 
This must be solved numerically. The solution is 
nozo © 1.200. (6.239) 


Plugging this into the second of Eqs. (6.237) gives 


e 
(£) = no ~ 0.663. (6.240) 
max 


r 


(Note also that zọ = 1.200/ņo = 1.810.) We see that if £/r is larger than 
0.663, then there is no solution for y(x) that is consistent with the boundary 
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conditions. Above this value of £/r, the soap bubble minimizes its area by head- 
ing toward the shape of just two disks, but it will pop well before it reaches that 
configuration. 

To get a sense of the rough shape of the limiting minimal surface, note that the 
ratio of the radius of the “middle” circle to the radius of the boundary rings is 


yO) _ cosh(0) _ 1 _ i 
y(£&) cosh(be)  cosh(nozo) zo 


~ 0.55. (6.241) 


REMARKS: 

1. We glossed over one issue above, namely that there may be more than one solution 
for the constant b in Eq. (6.234). In fact, Fig. 6.51 shows that for any n < 0.663, 
there are two solutions for z in Eq. (6.236), and hence two solutions for b in 
Eq. (6.234). This means that there are two possible surfaces that might solve 
our problem. Which one do we want? It turns out that the surface corresponding 
to the smaller value of b is the one that minimizes the area, while the surface 
corresponding to the larger value of b is the one that (in some sense) maximizes 
the area. 

We say “in some sense” because the surface with the larger b is actually a 
saddle point for the area. It can’t be a maximum, after all, because we can always 
make the area larger by adding little wiggles to it. It’s a saddle point because there 
does exist a class of variations for which it has the maximum area, namely ones 
where the “dip” in the curve is continuously made larger (just imagine lowering 
the midpoint in a smooth manner). Such a set of variations is shown in Fig. 6.52. 
If we start with a cylinder for a surface and then gradually pinch in the center, the 
area decreases at first (the decrease in the cross-sectional area is the dominant 
effect at the start). But then as the dip becomes very deep, the area increases 
because the surface starts to look like the two disks, and these two disks have a 
larger area than the original narrow cylinder. The surface eventually resembles 
two nearly flat cones connected by a line. As these cones finally flatten out to 
the two disks, the area decreases. Therefore, the area must have achieved a local 
maximum (at least with respect to this class of variations) somewhere in between. 
This local maximum (or rather, saddle point) arises because the Euler-Lagrange 
technique simply sets the “derivative” equal to zero and doesn’t differentiate 
between maxima, minima, and saddle points. 

Ifn = £/r > 0.663 (so that the initial cylinder in now wide instead of narrow), 
there exists at least one class of variations for which the area decreases mono- 
tonically from the area of the cylinder down to the area of the two disks. If you 
draw a series of pictures (for a wide cylinder) analogous to those in Fig. 6.52, it 
is quite believable that this is the case. 

2. How does the area of the limiting surface (with no = 0.663) compare with the 
area of the two disks? The area of the two disks is 4g = 2mr?. And the area of 
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the limiting surface is 


£ 
A =f 27y 1 + y? dx. (6.242) 
s G y y 


Using Eq. (6.234), this becomes 


© On 2 On 
A, = — cosh*bx dx = — (1 + cosh 2bx) dx 
_¢ b eb 
2r £ i x sinh 2b£ 


b b? 


(6.243) 


But from the definitions of 7 and z, we have £ = nor and b = zo/r for the limiting 
surface. Therefore, A, can be written as 


2 inh 2 
A, = re( no ; sın. we) ; (6.244) 
zo ZO 


Plugging in the numerical values (no ~ 0.663 and zo ~ 1.810) gives 
Aq © (6.28)r?, and As ~ (7.54)r?. (6.245) 


The ratio of Ag to Aq is approximately 1.2 (it’s actually nozo, as you can show). 
The limiting surface therefore has a larger area. This is expected, because for 
£/r > no the surface tries to run off to one with a smaller area, and there are no 
other stable configurations besides the cosh solution we found. & 


6.24. The brachistochrone 


First SOLUTION: In Fig. 6.53, the boundary conditions are y(0) = 0 and y(xọ) = 
yo, with downward taken to be the positive y direction. From conservation of 
energy, the speed as a function of y is v = ./2gy. The total time is therefore 


(6.246) 


xo ds xo ./] +y2 
r= f — z) dx. 
0 v 0 v28 


Our goal is to find the function y(x) that minimizes this integral, subject to the 
boundary conditions above. We can therefore apply the results of the variational 
technique, with a “Lagrangian” equal to 


V1 +y? 
-z 


L (6.247) 


At this point, we could apply the E-L equation to this L, but let’s just use Lemma 
6.5, with f(v) = 1/7. Equation (6.86) gives 


B 
l+y*=Bf0)" => a ar (6.248) 


6.11 Solutions 


as desired. We must now integrate this. Solving for y’ and separating variables 
gives 


d 
vy ay =+tdx. 
B-y 


(6.249) 


A helpful change of variables to get rid of the square root in the denominator is 
y=B sin?ġ. Then dy = 2B sin ġ cos ọ dọ, and Eq. (6.249) simplifies to 


2B sin? dọ = + dx. (6.250) 


We can now use sin? = (1 — cos 2)/2 to integrate this. After multiplying 
through by 2, the result is B(2@ — sin2¢) = +2x — C, where C is a constant 
of integration. Now note that we can rewrite our definition of ø (which was 
y=B sin?) as 2y = B(1 — cos 2¢). If we then define 0 = 2¢, we have 


x=+a(0—sind)+d, y=a(1—cosé). (6.251) 


where a = B/2, and d = C/2. The particle starts at (x,y) = (0,0). Therefore, 0 
starts at 0 = 0, since this corresponds to y = 0. The starting condition x = 0 then 
implies that d = 0. Also, we are assuming that the wire heads down to the right, 
so we choose the positive sign in the expression for x. Therefore, we finally have 


x = a(0 — sin), y= a(l — cos), (6.252) 


as desired. This is the parametrization of a cycloid, which is the path taken by a 
point on the rim of a rolling wheel. The initial slope of the y(x) curve is infinite, 
as you can check. 


REMARK: The above method derived the parametric form in (6.252) from scratch. 
But since Eq. (6.252) was given in the statement of the problem, another route is to 
simply verify that this parametrization satisfies Eq. (6.248). To this end, assume that 
x = a(6 — sin 0) and y = a(1 — cos 8), which gives 


,_ dy _dy/d0 — sinð 
dx dx/dð® 1 -— cosð ` 


(6.253) 


Therefore, 


22], sin”0 2 2a 
~~" (d= cos)?  1—cos0 y 


which agrees with Eq. (6.248), with B = 2a. & 


, (6.254) 


SECOND SOLUTION: Let’s use a variational argument again, but now with y as the 
independent variable. That is, let the chain be described by the function x(y). The 


arclength is now given by ds = y 1 + x’? dy. Therefore, instead of the Lagrangian 
in Eq. (6.247), we have 


Vil+x?2 


Lo ~——. (6.255) 
J 


The Euler-Lagrange equation is 


d (aL _ aL = df 1 x! =i (6.256) 
dy \ ax’)  ðx dy \ J¥ J1 + x2 Ba ` 
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The zero on the right-hand side makes things nice and easy because it means that 
the quantity in parentheses is a constant. If we define this constant to be 1//B, 
then we can solve for x’ and then separate variables to obtain 


d 
vy ay =+dx. 
B-y 


(6.257) 
in agreement with Eq. (6.249). The solution proceeds as above. 


THIRD SOLUTION: The “Lagrangian” in the first solution above, which is given 
in Eq. (6.247) as 


vity? 
VW 


is independent of x. Therefore, in analogy with conservation of energy (which 
arises from a Lagrangian that is independent of f), the quantity 


_ ðL L= y? Vl +y? -1 
ay’ Jy 1 +y? JV Jy /1 +y? 


is constant (that is, independent of x). This statement is equivalent to Eq. (6.248), 
and the solution proceeds as above. 


L« ' (6.258) 


(6.259) 


Chapter 7 
Central forces 


A central force is by definition a force that points radially and whose magnitude 
depends only on the distance from the source (that is, not on the angle around the 
source).! Equivalently, we may say that a central force is one whose potential 
depends only on the distance from the source. That is, if the source is located at 
the origin, then the potential energy is of the form V (r) = V (r). Such a potential 
does indeed yield a central force, because 


dV. 
F(r) = VV (r) = -7 ô, (7.1) 


which points radially and depends only on r. Gravitational and electrostatic forces 
are central forces, with V (r) x 1/r. The spring force is also central, with V(r) « 
(r — £)?, where £ is the equilibrium length. 

There are two important facts concerning central forces: (1) they are ubiquitous 
in nature, so we had better learn how to deal with them, and (2) dealing with them 
is much easier than you might think, because crucial simplifications occur in the 
equations of motion when V is a function of r only. These simplifications will 
become evident in the following two sections. 


7.1 Conservation of angular momentum 


Angular momentum plays a key role in dealing with central forces because, as 
we will show, it is constant over time. For a point mass, we define the angular 
momentum L by 


L=r xp, (7:2) 


where the “cross product” is defined in Appendix B. L depends onr, so it therefore 
depends on where you pick the origin of your coordinate system. Note that L is 
a vector, and that it is orthogonal to both r and p, by nature of the cross product. 
You might wonder why we care enough about r x p to give it a name. Why not 
look at r3p°r x (r x p), or something else? The answer is that L has some very 
nice properties, one of which is the following. 


' Taken literally, the term “central force” would imply only the radial nature of the force. But a 
physicist’s definition also includes the dependence solely on the distance from the source. 
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Theorem 7.1 Jfa particle is subject to a central force only, then its angular 
momentum is conserved.* That is, 


dL 
If Vir) =V(r), then Fr 0. (7.3) 
Proof: We have 
dL d 
a at x p) 
dr = dp 
=— x rx — 
dt ” P dt 
=v x (mv)+rxF 
=0, (1.4) 
because F « r, and the cross product of two parallel vectors is zero. a 


We’ll prove this theorem again in the next section, using the Lagrangian 
method. Let’s now prove another theorem which is probably obvious, but good 
to show anyway. 


Theorem 7.2 Jfa particle is subject to a central force only, then its motion 
takes place in a plane. 


Proof: Ata given instant fo, consider the plane P containing the position vec- 
tor ro (with the source of the potential taken to be the origin) and the velocity 
vector vo. We claim that r lies in P at all times.* This is true because P is 
defined as the plane orthogonal to the vector no = ro x Vo. But in the proof of 
Theorem 7.1, we showed that the vector r x v = (r x p)/m does not change with 
time. Therefore, r x v = no for all t. Since r is certainly orthogonal to r x v, we 
see that r is orthogonal to no for all t. Hence, r must always lie in P. a 


An intuitive look at this theorem is the following. Since the position, velocity, 
and acceleration (which is proportional to F, which in turn is proportional to the 
position vector r) vectors initially all lie in P, there is a symmetry between the 
two sides of P. Therefore, there is no reason for the particle to head out of P on 
one side rather than the other. The particle therefore remains in P. We can then 
use this same reasoning again a short time later, and so on. 

This theorem shows that we need only two coordinates, instead of the usual 
three, to describe the motion. But since we’re on a roll, why stop there? We’ll 
show below that we really need only one coordinate. Not bad, three coordinates 
reduced down to one. 


2 This is a special case of the fact that torque equals the rate of change of angular momentum. We’ll 
talk about this in great detail in Chapter 8. 

3 The plane P is not well defined if v9 = 0, or if ro = 0, or if vo is parallel to rọ. But in these cases, 
you can quickly show that the motion is always radial, which is even more restrictive than planar. 


7.2 The effective potential 


7.2 The effective potential 


The effective potential provides a sneaky and useful method for simplifying a 
three-dimensional central-force problem down to a one-dimensional problem. 
Here’s how it works. Consider a particle of mass m subject to a central force 
only, described by the potential V(r). Let r and @ be the polar coordinates in 
the plane of the motion. In these polar coordinates, the Lagrangian (which we’ ll 
label as “£”, to save “L” for the angular momentum) is 


L= Lm? +7767) — V(r). (7.5) 


The equations of motion obtained from varying r and 0 are 


mr = mré* — V' (r), 
d ! (7.6) 
qn ®) =0. 


Since —V'(r) equals the force F(r), the first of these equations is the radial 
F = ma equation, complete with the centripetal acceleration, in agreement with 
the first of Eqs. (3.51). The second equation is the statement of conservation of 
angular momentum, because mr?6 = r(mr6) = rpo (where po is the momentum 
in the angular direction) is the magnitude of L = r x p, from Eq. (B.9). We 
therefore see that the magnitude of L is constant. And since the direction of L is 
always perpendicular to the fixed plane of the motion, the vector L is constant in 
time. We have therefore just given a second proof of Theorem 7.1. In the present 
Lagrangian language, the conservation of L follows from the fact that 6 is a 
cyclic coordinate, as we saw in Example 2 in Section 6.5.1. Since mr76 does not 
change with time, let us denote its constant value by 


L=mr’6. (7.7) 


L is determined by the initial conditions. It can be specified, for example, by 
giving the initial values ofr and 6. Using Ë = L/(mr?), we can eliminate 6 from 
the first of Eqs. (7.6). The result is 


. LE , 
mr = — — V (r). (7.8) 


Multiplying by 7 and integrating with respect to time yields 


al aroen (7.9) 
o 2mr2 ast dimes ` 


where Æ is a constant of integration. E is simply the energy, which can be seen 
by noting that this equation could also have been obtained by using Eq. (7.7) to 
eliminate 6 in the energy equation, (m/2)(#? + r78?) + V(r) = E. 
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Equation (7.9) is rather interesting. It involves only the variable r. And it 
looks a lot like the equation for a particle moving in one dimension (labeled by 
the coordinate r) under the influence of the potential, 


L2 
Veter) = — 54+ V(r). (7.10) 


2mr2 


The subscript “eff” here stands for “effective.” Vefe (r) is called the effective 
potential. The “effective force” is easily read off from Eq. (7.8) to be 


I 
Fætt) = — -V'@), (7.11) 
mr 
which agrees with Fer = Vie (1), as it should. This “effective” potential 


concept is a marvelous result and should be duly appreciated. It says that if 
we want to solve a two-dimensional problem (which could have come from a 
three-dimensional problem) involving a central force, then we can recast the 
problem into a simple one-dimensional problem with a slightly modified poten- 
tial. We can forget that we ever had the variable 6, and we can solve this 
one-dimensional problem (as we’ll demonstrate below) to obtain r(t). Hav- 
ing found r(t), we can use A(t) = L/mr? to solve for 6(t) (in theory, at 
least). This whole procedure works only because there is a quantity involving 
r and @ (or rather, Ê) that is independent of time. The variables r and @ are 
therefore not independent, so the problem is really one-dimensional instead of 
two-dimensional. 

To get a general idea of how r behaves with time, all we have to do is draw the 
graph of Veg (r). Consider the example where V(r) = Ar. This is the potential 
for a spring with relaxed length zero. Then 


2 


L 
Vere (r) = ma + Ar’. (7.12) 


To plot Veg (r), we must be given L (determined by the initial conditions), along 
with A and m (determined by the system we’re dealing with). But the general 
shape looks like the curve in Fig. 7.1. The energy E (determined by the initial 
conditions), which must be given too, is also drawn. The coordinate r bounces 
back and forth between the turning points, rı and r2, which satisfy Vefe (71,2) = 
E.* This is true because it is impossible for the particle to be located at an r 
for which E < Vege, since Eq. (7.9) would then imply an imaginary value for +. 
If E equals the minimum of Ver (r), then rı = r2, so r is stuck at this one value, 
which means that the motion is a circle. 


REMARK: The L7/2mr? term in the effective potential is sometimes called the angular momen- 
tum barrier. Ithas the effect of keeping the particle from getting too close to the origin. Basically, 
the point is that L = mr?6 is constant, so as r gets smaller, Ô gets bigger. But Ê increases at a 


4 Tt turns out that for our Ar? spring potential, the motion in space is an ellipse, with semi-axis lengths 
rı and rz (see Problem 7.5). But for a general potential, the motion isn’t so nice. 


7.3 Solving the equations of motion 


greater rate than r decreases, due to the square of the r in L = mr26. So eventually we end up 
with a tangential kinetic energy, mr?ĝ? /2, that is greater than what is allowed by conservation 
of energy.° 


As he walked past the beautiful belle, 
The attraction was easy to tell. 

But despite his persistence, 

He was kept at a distance 

By that darn conservation of L! & 


Note that it is by no means necessary to introduce the concept of the effective 
potential. You can simply solve the equations of motion in Eq. (7.6) as they are. 
But introducing V.¢ makes it much easier to see what’s going on ina central-force 
problem. 


When using potentials, effective, 
Remember the one main objective: 

The goal is to shun 

All dimensions but one, 

And then view things with 1-D perspective. 


7.3 Solving the equations of motion 


If we want to be quantitative, we must solve the equations of motion in Eq. (7.6). 
Equivalently, we must solve their integrated forms, Eqs. (7.7) and (7.9), which 
are the conservation of L and E statements, 


mr?6 =L, 

=m — +VN=E. 

2” 7 2mr2 a a 
The word “solve” is a little ambiguous here, because we should specify what 
quantities we want to solve for in terms of what other quantities. There are 
essentially two things we can do. We can solve for r and 0 in terms of t. Or 
we can solve for r in terms of 0. The former has the advantage of immedi- 
ately yielding velocities and, of course, the information of where the particle 
is at time ¢. The latter has the advantage of explicitly showing what the trajec- 
tory looks like in space, even though we don’t know how quickly it is being 
traversed. We’ll deal mainly with this latter case, particularly when we dis- 
cuss the gravitational force and Kepler’s laws below. But let’s look at both 
cases now. 


> This argument doesn’t hold if V(r) goes to —oo faster than —1/r?. You can see this by draw- 
ing the graph of Vege(r), which heads to —oo instead of +00 as r — 0. V(r) decreases 
fast enough to allow for the increase in kinetic energy. But such potentials don’t come up that 
often. 
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7.3.1 Finding r(t) and 6(f) 
The value of 7 at any point is found from Eq. (7.13) to be 


dr 2 / L? 
=+ E V(r). (7.14) 


2mr2 


To get an actual r(t) out of this, we must be supplied with £ and L (determined 
by the initial values of r, 7, and ô), and also the function V(r). To solve this 
differential equation, we “simply” have to separate variables and then (in theory) 
integrate: 


dr 2 2 
/ Z + f [2a = fa — to). (7.15) 
JE- P m m 


2mr2 V(r) 


We need to evaluate this (rather unpleasant) integral on the left-hand side, to 
obtain ¢ as a function of r. Having found t(r), we can then (in theory) invert the 
result to obtain r(t). Finally, substituting this r(¢) into the relation 6= L/mr? 
from Eq. (7.13) gives 6 as a function of t, which we can (in theory) integrate to 
obtain 0 (t). 

As you might have guessed, this procedure has the potential to yield some 
stress. For most V (r}’s, the integral in Eq. (7.15) is not calculable in closed form. 
There are only a few “nice” potentials V (r) for which we can evaluate it. And 
even in those cases, the remaining tasks are still a pain. But the good news 
is that these “nice” potentials are precisely the ones we are most interested in. 
In particular, the gravitational potential, which goes like 1/r and which we will 
concentrate on during the remainder of this chapter, leads to a calculable integral 
(the spring potential ~ r? does also). However, having said all this, we’re not 
going to apply this procedure to gravity. It’s nice to know that it exists, but we 
won’t be doing anything else with it. Instead, we’ll use the following strategy to 
solve for r as a function of 6. 


7.3.2 Finding r(@) 


We can eliminate the dt from Eqs. (7.13) by getting the 7? term alone on the left 
side of the second equation, and then dividing by the square of the first equation. 
The dt? factors cancel, and we obtain 


(3%) 2mE 1 2mV(r) 
= . (7.16) 


r? do L? r? I 
We can now (in theory) take a square root, separate variables, and integrate to 
obtain 6 as a function ofr. We can then (in theory) invert to obtain r as a function 


6 Of course, if you run out of patience or hit a brick wall, you always have the option of doing things 
numerically. See Section 1.4 for a discussion of this. 


7.4 Gravity, Kepler’s laws 


of 6. To do this, we must be given the function V (r). So let’s now finally give 
ourselves a V(r) and do a problem all the way through. We’ll study the most 
important potential of all, or perhaps the second most important one, gravity.’ 


7.4 Gravity, Kepler’s laws 
7.4.1 Calculation of r(@) 


Our goal in this subsection is to obtain r as a function of 0, for a gravitational 
potential. Let’s assume that we’re dealing with the earth and the sun, with masses 
Mo and m, respectively. The gravitational potential energy of the earth-sun 
system is 


Vir) =—~, wherea = GMom. (7.17) 
r 


In the present treatment, we’ll consider the sun to be bolted down at the origin of 
our coordinate system. Since Mo > m, this is approximately true for the earth- 
sun system. (If we want to do the problem exactly, we must use the reduced mass, 
which is the topic of Section 7.4.5.) Equation (7.16) becomes 


ldr\ mE 1 2 
( =) <9 ma (7.18) 


r? do L? r2 


As stated above, we could take a square root, separate variables, integrate to 
find 0 (r), and then invert to find r(@). This method, although straightforward, is 
rather messy. So let’s solve for r (8) in a slick way. 

With all the 1/r terms floating around, it might be easier to solve for 1/r instead 
ofr. Using d(1/r)/d@ = —(dr/d6)/r*, and letting y = 1/r for convenience, 
Eq. (7.18) becomes 


dy = 2ma 2mE 
(2) Say + et (7.19) 


At this point, we could also use the separation-of-variables technique, but let’s 
continue to be slick. Completing the square on the right-hand side gives 


dy : ma\?  2mE ma \? 
a) VR + T2 + 77): (7.20) 
Defining z = y — ma/L? for convenience yields 
dz \? 2 ma \* 2EL? 2 2 
= 1 =— B 7.21 
(Ga) =~ + (5) (eae oa 


7 The two most important potentials in physics are certainly the gravitational and harmonic-oscillator 
ones. They both lead to doable integrals, and interestingly both lead to elliptical orbits. 
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where 


ma 


_ (ma 2EL? 
B= T2 1+ a (7.22) 


At this point, in the spirit of being slick, we can just look at Eq. (7.21) and observe 
that 


z = B cos(0 — 0) (7.23) 


is the solution, because cos?x + sin?x = 1. But lest we feel guilty about not 
doing separation-of-variables at least once in this problem, let’s solve Eq. (7.21) 
that way, too. The integral is nice and doable, and we have 


Z 


dz “4 
i = fa — cos (5) = @ =, (7.24) 


which gives z = Bcos(@ — 00). It is customary to pick the axes so that 6) = 0, 
so we’ll drop the @ from here on. Recalling our definition z = 1/r — ma/L? 
and also the definition of B from Eq. (7.22), Eq. (7.23) becomes 


Des ache crane (7.25) 
es € j 
T cos 0), 
where 
2EL2 
esir (1.26) 
Ma 


is the eccentricity of the particle’s motion. We will see shortly exactly what 
€ signifies. 

This completes the derivation of r(@) for the gravitational potential, 
V(r)axl/r. It was a little messy, but not unbearably painful. At any rate, 
we just discovered the basic motion of objects under the influence of gravity, 
which takes care of virtually all of the gazillion tons of stuff in the universe. Not 
bad for one page of work. 


Newton said as he gazed off afar, 
“From here to the most distant star, 
These wond’rous ellipses 

And solar eclipses 


All come from a 1 over r.” 


What are the limits on r in Eq. (7.25)? The minimum value of r is obtained 
when the right-hand side reaches its maximum value, which is (ma JLX 1+e). 
Therefore, 


1 


Fmin 


7.4 Gravity, Kepler’s laws 


What is the maximum value of r? The answer depends on whether e€ is greater 
than or less than 1. Ife < 1 (which corresponds to circular or elliptical orbits, as 
we'll see below), then the minimum value of the right-hand side of Eq. (7.25) is 
(ma/L*)(1 — €). Therefore, 
p 
Fmax = ———————— (if € < 1). (7.28) 
ma(1 — e€) 

Ife > 1 (whichcorresponds to parabolic or hyperbolic orbits, as we’ll see below), 
then the right-hand side of Eq. (7.25) can become zero (when cos@ = —1/e). 
Therefore, 


Tmax =00 (if e > 1). (7.29) 


7.4.2 The orbits 


Let’s examine in detail the various cases for e€. 


e Circle (€ = 0) 
Ife = 0, then Eq. (7.26) says that E = —ma? /2L?. The negative E means that the poten- 
tial energy is more negative than the kinetic energy is positive, so the particle is trapped 
in the potential well. Equations (7.27) and (7.28) give rmin = max = L*/ma. There- 
fore, the particle moves in a circular orbit with radius L? /ma. Equivalently, Eq. (7.25) 
says that r is independent of 0. 

Note that it isn’t necessary to do all the work of Section 7.4.1 if we just want to look at 
circular motion. For a given L, the energy —ma? / 2L? is the minimum value that the E 
given by Eq. (7.13) can take. This is true because to achieve the minimum, we certainly 
want 7 = 0. And you can show that minimizing the effective potential, i? / 2mr? — a [r 
yields this value for E. If we plot Veff (r), we have the situation shown in Fig. 7.2. 
The particle is trapped at the bottom of the potential well, so it has no motion in the r 
direction. 

e Ellipse (0 < € < 1) 


IfO < € < 1, then Eq. (7.26) says that —ma?/2L* < E < 0. Equations (7.27) and 
(7.28) give rmin and rmax. It isn’t obvious that the resulting motion is an ellipse. We’ll 
demonstrate this below. 

If we plot Vere (r), we have the situation shown in Fig. 7.3. The particle oscillates 
between 7min and “max. The energy is negative, so the particle is trapped in the potential 
well. 

e Parabola (€ = 1) 


Ife = 1, then Eq. (7.26) says that E = 0. This value of E implies that the particle 
barely makes it out to infinity (its speed approaches zero as r —> oo). Equation (7.27) 
gives Tin = I? /2ma, and Eq. (7.29) gives rmax = 00. Again, it isn’t obvious that 
the resulting motion is a parabola. We’ll demonstrate this below. 


Fig. 7.3 
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If we plot Vege (r), we have the situation shown in Fig. 7.4. The particle does not 
oscillate back and forth in the r direction. It moves inward (or possibly not, if it was 
initially moving outward), turns around at rin = r /2ma, and then heads out to 
infinity forever. 

e Hyperbola (€ > 1) 


Ife > 1, then Eq. (7.26) says that E > 0. This value of E implies that the particle 
makes it out to infinity with energy to spare. The potential goes to zero as r —> 00, so 
the particle’s speed approaches the nonzero value ./2E/m as r —> oo. Equation (7.27) 
gives rin, and Eq. (7.29) gives rmax = 00. Again, it isn’t obvious that the resulting 
motion is a hyperbola. We’ll demonstrate this below. 

If we plot Vege (r), we have the situation shown in Fig. 7.5. As in the parabola case, 
the particle does not oscillate back and forth in the r direction. It moves inward (or 
possibly not, if it was initially moving outward), turns around at rin, and then heads 


out to infinity forever. 


7.4.3 Proof of conic orbits 


Let’s now prove that Eq. (7.25) does indeed describe the conic sections stated 
above. We’ll also show that the origin (the source of the potential) is a focus 
of the conic section. These proofs are straightforward, although the ellipse and 
hyperbola cases get a bit messy. In what follows, we’ll find it easier to work with 
Cartesian coordinates. For convenience, let 


p 
= —. (7.30) 
ma 
Multiplying Eq. (7.25) through by kr, and using cos @ = x/r, gives 
k=r+ex. (7.31) 
Solving for r and squaring yields 
x+y? =k? — 2kex + €7x?. (7.32) 


Let’s look at the various cases for €. We will invoke without proof various facts 
about conic sections (focal lengths, etc.). 


e Circle (€ = 0) 


In this case, Eq. (7.32) becomes x? + y? = k?. So we have a circle with radius k = 
L? /ma, with its center at the origin (see Fig. 7.6). 
e Ellipse (0 < € < 1) 


In this case, Eq. (7.32) can be written (after completing the square for the x terms, and 
expending some effort) as 


2 
ke 
Xe 2 
( iM) y k k 
> 4 [=l], where a= ——~, and b= ———. 7.33 
a2 b2 1— e€? l-g ven 
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This is the equation for an ellipse with its center located at (—ke/(1 — e2), 0). The 
semi-major and semi-minor axes are a and b, respectively. And the focal length is 
c = ya? — b2 = ke/(1 — e°). Therefore, one focus is located at the origin (see 
Fig. 7.7). Note that c/a equals the eccentricity, €. 

e Parabola (€ = 1) 


In this case, Eq. (7.32) becomes y? = k?—2kx, which can be written asy? = —2k(x— kj, 
This is the equation for a parabola with vertex at (k/2,0) and focal length k/2. (The 


focal length of a parabola written in the form y = 4ax is a.) So we have a parabola ioe 
with its focus located at the origin (see Fig. 7.8). 
e Hyperbola (e€ > 1) y 
In this case, Eq. (7.32) can be written (after completing the square for the x terms) 
(x 7 iy j n k/2 F 
a ag l Where a= z y’ DAT a (7.34) 


This is the equation for a hyperbola with its center (defined to be the intersection of the 
asymptotes) located at (ke/(€— 1), 0). The focal lengthisc = va? + b? = ke/(€— 1). 
Therefore, the focus is located at the origin (see Fig. 7.9). Note that c/a equals the 
eccentricity, €. 

The impact parameter (usually denoted by the letter b) of a trajectory is defined to be 
the closest distance to the origin the particle would achieve if it moved in the straight line 
determined by its initial velocity far from the origin (that is, along the dotted line in the 
Fig. 7.9). You might think that choosing the letter b here would cause a problem, because 
we already defined b in Eq. (7.34). However, it turns out that these two definitions are 
identical (see Exercise 7.14), so all is well. 

Equation (7.34) actually describes an entire hyperbola, that is, it also describes a 


branch that opens up to the right with its focus located at (2ke/ (e? — 1), 0). However, 
this right branch was introduced in the squaring operation that produced Eq. (7.32). It 
isn’t a solution to the original equation we wanted to solve, Eq. (7.31). It turns out that Fig. 7-9 
the right-opening branch (or its reflection across the y axis, depending on your sign 
convention for B and e€) is relevant for a repulsive, instead of attractive, 1/r potential; 

see Exercise 7.21. 


7.4.4 Kepler's laws 


We can now, with minimal extra work, write down Kepler’s laws. Kepler (1571- 
1630) lived before Newton (1642-1727), so he didn’t have Newton’s laws at his 
disposal. Kepler arrived at his laws via observational data, which was a rather 
impressive feat. It was known since the time of Copernicus (1473—1543) that 
the planets move around the sun, but it was Kepler who first gave a quantitative 
description of the orbits. Kepler’s laws assume that the sun is massive enough so 
that its position is essentially fixed in space. This is a very good approximation, 
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but the following subsection on the reduced mass shows how to modify the laws 
and solve things exactly. 


e First law: The planets move in elliptical orbits with the sun at one focus. 
We proved this in Eq. ( 7.33).° There are undoubtedly also objects flying past the sun in 
hyperbolic orbits, but we don’t call these things planets, because we never see the same 
one twice. 

e Second law: The radius vector to a planet sweeps out area at a rate that is independent 
of its position in the orbit. 

This law is nothing other than a fancy way of stating conservation of angular 
momentum. The area swept out by the radius vector during a short period of time 
is dA = r(rd@)/2, because r d@ is the base of the thin triangle in Fig. 7.10. Therefore, 
we have (using L = mr26) 

dd Ê L 


u ca = ag (7.35) 
which is constant, because Z is constant for a central force. This quick proof is 
independent of all the work we did in Sections 7.4.1—7.4.3. 

e Third law: The square of the period of an orbit, T, is proportional to the cube of the 
semi-major-axis length, a. More precisely, 

2 41703 


[7 = F 7.36 
ae (7.36) 


where Mg is the mass of the sun. Note that the planet’s mass m doesn’t appear in this 
equation. 


Proof: Integrating Eq. (7.35) over the time of a whole orbit gives 


at 


A=—. 7.37 
2m ( ) 


But the area of an ellipse is A = mab, where a and b are the semi-major and semi-minor 
axes, respectively. Squaring Eq. (7.37) and using Eq. (7.33) to write b = ay 1 — e? 


gives 
L? T2 
mat = (m) — (7.38) 


We have grouped the right-hand side in this way because we can now use the L? = mak 
relation from Eq. (7.30) to transform the term in parentheses into wk/(1 — e?) = aa, 
where a is given in Eq. (7.33). But waa = (GMem)a, so we obtain 


2 
wat = (GMeoma)T 
4m 
which gives Eq. (7.36), as desired. | 


(7.39) 


For an alternate geometrical proof due to Feynman (and also Maxwell), see Goodstein and 
Goodstein (1996). 


7.4 Gravity, Kepler’s laws 


These three laws describe the motion of all the planets (and asteroids, comets, 
and such) in the solar system. But our solar system is only the tip of the iceberg. 
There’s a lot of other stuff out there, and it’s all governed by gravity (although 
Newton’s inverse square law must be supplanted by Einstein’s General Relativity 
theory of gravitation). There’s a whole universe around us, and as time passes on 
we see and understand more and more of it, both experimentally and theoretically. 
In recent years, we’ve even begun to look for friends we might have out there. 
Why? Because we can. There’s nothing wrong with looking under the lamppost 
now and then. It just happens to be a very big one in this case. 


As we grow up, we open an ear, 
Exploring the cosmic frontier. 
In this coming of age, 

We turn in our cage, 


All alone on a tiny blue sphere. 


7.4.5 Reduced mass 


We assumed in Section 7.4.1 that the sun is large enough so that it is only 
negligibly affected by the presence of the planets. That is, it is essentially 
fixed at the origin. But how do we solve a problem in which the masses of 
the two interacting bodies are comparable in size? Equivalently, how do we 
solve the earth-sun problem exactly? It turns out that the only modification 
required is a replacement of the earth’s mass with the reduced mass, defined 
below. The following discussion actually holds for any central force, not just 
gravity. 

The Lagrangian of a general central-force system consisting of the interacting 
masses mı and mz is 


1, l- 
L= zmii + simak — V (iri = r2)). (7.40) 
We have written the potential in this form, dependent only on the distance |r; —ro|, 
because we are assuming a central force. Let us define 


mY, + mr? 
mı + m2 


R 


, and r=r]-—r?. (7.41) 


R andr are the position of the center of mass and the vector between the masses, 
respectively. Invert these equations to obtain 


m2 mı 
ry =R+ M” and r2 =R— M” (7.42) 
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where M = mı + m is the total mass of the system. In terms of R and r, the 
Lagrangian becomes 


1 r m2, 2 1 . mı. p) 
f= sim (R+ Tzi) + 5m (R- ai) -v (lr) 


l os 1 mm? 2 
=-MR? + (PP 
2 HA o) 


1, l 
= 5M¢R + 5 He — V(r), (7.43) 


where the reduced mass, u, is defined by 


1 1 1 

— = — + —. (7.44) 
hm ç m 

We now note that the Lagrangian in Eq. (7.43) depends on R, but not on R. 

Therefore, the Euler-Lagrange equations say that R is constant. That is, the CM 

moves at constant velocity (this is just the statement that there are no external 

forces). The CM motion is therefore trivial, so let’s ignore it. Our Lagrangian 

therefore becomes 


L—> sue — V(r). (7.45) 


But this is simply the Lagrangian for a particle of mass u that moves around a 
fixed origin under the influence of the potential V (r). For gravity, we have 


1 

L= spi? + Z (where v = GMom). (1.46) 
r 

To solve the earth-sun system exactly, we therefore just need to replace (in the 

calculation in Section 7.4.1) the earth’s mass, m, with the reduced mass, ju, 

given by 


1 1 
ra + 7A f (7.47) 
The resulting value of r in Eq. (7.25) is the distance between the earth and sun. 
The earth and sun are therefore distances of (Mo /M)r and (m/M)r, respectively, 
away from the CM, from Eq. (7.42). These distances are just scaled-down versions 
of the distance r, which represents an ellipse, so we see that the earth and sun 
move in elliptical orbits (whose sizes are in the ratio Mo/m) with the CM as a 
focus. Note that the m’s that are buried in Z and € in Eq. (7.25) must be changed 
to w’s. But « is still defined to be GMom, so the m in this definition does not get 
replaced with u. 

For the earth-sun system, the u in Eq. (7.47) is essentially equal to m, because 
Mo isso large. Using m = 5.97 - 1074 kg, and Mo = 1.99 - 10° kg, we find that 


7.4 Gravity, Kepler’s laws 


u is smaller than m by only one part in 3 - 10°. Our fixed-sun approximation is 
therefore a very good one. You can show that the CM is about 5 - 10° m from 
the center of the sun, which is well inside the sun (about a thousandth of the 
radius). 

How are Kepler’s laws modified when we solve for the orbits exactly using 
the reduced mass? 


e First law: The elliptical statement in the first law is still true, but with the CM 
(not the sun) located at a focus. The sun also travels in an ellipse with the CM at 
a focus.” Whatever is true for the earth must also be true for the sun, because they 
come into Eq. (7.43) symmetrically. The only difference is in the size of various 
quantities. 

e Second law: In the second law, we need to consider the position vector from the CM 
(not the sun) to the earth. This vector sweeps out equal areas in equal times, because the 
angular momentum of the earth (and the sun, too) relative to the CM is constant. This 
is true because the gravitational force always points through the CM, so the force is a 
central force with the CM chosen as the origin. 

e Third law: The period of the earth’s orbit (and the sun’s, too) is the same as the period of 
the orbit of our hypothetical particle of mass u orbiting around a fixed origin under the 
influence of the potential —a/r = —GMom/r. This is true because the radius vectors in 
all three of these systems are always in the same ratio. To find the period of the particle’s 
orbit, we can repeat the derivation leading up to Eq. (7.39). But in that equation, the m 
on the bottom is replaced by jz, while the m on the top remains m, because this is the m 


that appears in a. Therefore, we obtain!" 


4r? u Ana? 
=—“ =F, (7.48) 


T? = —— + 
GMom GM 


where we have used y = Mom/(Mo + m) 


Mom/M. This result is symmetric in 
Mo and m, as it must be because if we interchange the labels of Me and m, we still 
have the same system. And it also correctly reduces to Eq. (7.36) when Mo > m. 

If you want to write Eq. (7.48) in terms of the semi-major axis of the earth’s elliptical 
orbit, which is ag = (Mo/M)ap, then just plug in a, = (M/Mo)ag to obtain 


72 — 4 M/Mo ak _ O (7.49) 


GM MŽ GMo : 


Let’s perform a check on this formula by considering the special case of equal masses 


m orbiting around the same circular path of radius r (at diametrically opposite points) 


? Well, this statement is true only if there is just one planet. With many planets, the tiny motion of 
the sun is very complicated. This is perhaps the best reason to work in the approximation where 
it is essentially bolted down. 

10 We have put the subscript jz on the length a to remind us that it is the semi-major axis of our 
hypothetical particle’s orbit, and not the semi-major axis of the earth’s orbit. 
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with their CM at the center of the circle. For this simple system, we can solve for the 
period from scratch using F = ma: 


2 2 2 2 
2nr/T 
mes = Gm m( zr ) = Gm = T2 = š 
r (2r)? r (2r)? Gm 


which agrees with Eq. (7.49), in different notation, when M = 2Mo. 


REMARK: There’s actually a fairly quick way to see where the M? /M3 factor in Eq. (7.49) 
comes from. Imagine a new system where the earth’s orbit has the same dimensions but 
where the (slightly) moving sun is replaced by a stationary mass bolted down at the location 
of the earth-sun CM. This new mass is now a fraction Mo/M as far away from the earth 
as the sun was. Therefore, since the gravitational force is proportional to 1/r?, if we make 
our new mass equal to (Mo /M)*Mo, then it will exert the same force on the earth that the 
sun exerted. So if mother earth has her eyes closed, she’ll never know the difference. The 
periods of these two systems must therefore be the same. But from Eq. (7.36), the period of 
the second system, which has a bolted-down mass, is 


2,3 
An“ ay, 


Taen E 
G- (Mo /M} Mọ’ 


(7.51) 


in agreement with Eq. (7.49). & 


7.5 Problems 


Section 7.2: The effective potential 


7.1. Exponential spiral « 


Given L, find the V (r) that leads to a spiral path of the form r = roe®®. 
Choose E to be zero. Hint: Obtain an expression for 7 that contains no 
6’s, and then use Eq. (7.9). 


7.2. Cross section «x 
A particle moves in a potential, V(r) = —C/(3r?). 

(a) Given L, find the maximum value of the effective potential. 

(b) Let the particle come in from infinity with speed vg and impact 
parameter b. In terms of C, m, and vo, what is the largest value of 
b (call it bmax) for which the particle is captured by the potential? 
In other words, what is the “cross section” for capture, 7r b2 ax» for 
this potential? 


7.3. Maximum L xxx 
A particle moves in a potential, V(r) = — Voe¥ 


(a) Given L, find the radius of the stable circular orbit. An implicit 
equation is fine. 

(b) It turns out that if L is too large, then no circular orbit exists. What 
is the largest value of L for which a circular orbit does in fact exist? 


7.5 Problems 


If ro is the radius of the circle in this cutoff case, what is the value 
of Vege (ro)? 


Section 7.4: Gravity, Kepler s laws 


7.4. r* potential xxx 


A particle of mass m moves in a potential given by V(r) = Br*. Let the 
angular momentum be L. 


(a) Find the radius, ro, of the circular orbit. 

(b) If the particle is given a tiny kick so that the radius oscillates 
around ro, find the frequency, w,, of these small oscillations in r. 

(c) What is the ratio of the frequency w, to the frequency of the 
(nearly) circular motion, w = 6? Give a few values of k for 
which the ratio is rational, that is, for which the path of the nearly 
circular motion closes back on itself. 


7.5. Spring ellipse «x« 
Aparticle moves ina V(r) = Br? potential. Following the general strategy 
in Sections 7.4.1 and 7.4.3, show that the particle’s path is an ellipse. 
7.6. B/r? potential «+« 


A particle is subject to a V(r) = B/r? potential. Following the general 
strategy in Section 7.4.1, find the shape of the particle’s path. You will 
need to consider various cases for £. 


7.7. Rutherford scattering xx» 


A particle of mass m travels in a hyperbolic orbit past a mass M, whose 
position is assumed to be fixed. The speed at infinity is vo, and the impact 
parameter is b (see Exercise 7.14). 


(a) Show that the angle through which the particle is deflected is 


d=n—2tan (ys) => b= ~<ot($) : (7.52) 


where y = va/GM. 

(b) Let do be the cross-sectional area (measured when the particle 
is initially at infinity) that gets deflected into a solid angle of size 
dQ at angle ¢.'' Show that 


do _ 1 
dQ 4y2 sin4($/2) | 


(7.53) 


1L The solid angle of a patch on a sphere is the area of the patch divided by the square of the sphere’s 
radius. So a whole sphere subtends a solid angle of 42 steradians (the name for one unit of solid 
angle). 
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This quantity is called the differential cross section. Note: the 
label of this problem, Rutherford scattering, actually refers to 
the scattering of charged particles. But since the electrostatic and 
gravitational forces are both inverse-square laws, the scattering 
formulas look the same, except for a few constants. 


7.6 Exercises 


Section 7.1: Conservation of angular momentum 


7.8. 


7.9. 


Wrapping around a pole « 


A puck of mass m sliding on frictionless ice is attached by a horizontal 
string of length £ to a thin vertical pole of radius R. The puck initially 
travels in (essentially) a circle around the pole at speed vg. The string 
wraps around the pole, and the puck gets drawn in and eventually hits 
the pole. What quantity is conserved during this motion? What is the 
puck’s speed right before it hits the pole? 


String through a hole « 


Ablock of mass m sliding on a frictionless table is attached to a horizontal 
string that passes through a tiny hole in the table. The block initially trav- 
els in a circle of radius £ around the hole at speed vg. If you slowly pull 
the string down through the hole, what quantity is conserved during this 
motion? What is the block’s speed when it is a distance r from the hole? 


Section 7.2: The effective potential 


7.10. 


Power-law spiral x» 

Given L, find the V (r) that leads to a spiral path of the form r = rod. 
Choose E to be zero. Hint: Obtain an expression for 7 that contains 
no 6’s, and then use Eq. (7.9). 


Section 7.4: Gravity, Kepler 5 laws 


7.11. 


Circular orbit « 


For a circular orbit, derive Kepler’s third law from scratch, using 
F = ma. 


. Falling into the sun x 


Imagine that the earth is suddenly (and tragically) stopped in its orbit, 
and then allowed to fall radially into the sun. How long will this take? 
Use data from Appendix J, and assume that the initial orbit is essentially 
circular. Hint: Consider the radial path to be part of a very thin ellipse. 


7.13. 


7.14. 


7.15. 


7.16. 


7.17. 


7.6 Exercises 


Intersecting orbits x» 


Two masses, m and 2m, orbit around their CM. If the orbits are circular, 
they don’t intersect. But if they are very elliptical, they do. What is the 
smallest value of the eccentricity for which they intersect? 


Impact parameter «« 


Show that the distance b defined in Eq. (7.34) and Fig. 7.9 is equal to 
the impact parameter. Do this: 


(a) Geometrically, by showing that b is the distance from the origin 
to the dotted line in Fig. 7.9. 

(b) Analytically, by letting the particle come in from infinity at speed 
vo and impact parameter b’, and then showing that the b in 
Eq. (7.34) equals b’. 


Closest approach xx 


A particle with speed vo and impact parameter b starts far away from a 
planet of mass M. 


(a) Starting from scratch (that is, without using any of the results from 
Section 7.4), find the distance of closest approach to the planet. 
(b) Use the results of the hyperbola discussion in Section 7.4.3 
to show that the distance of closest approach to the planet is 
k/(€ +1), and then show that this agrees with your answer to 
part (a). 
Skimming a planet +x 


A particle travels in a parabolic orbit in a planet’s gravitational field and 
skims the surface at its closest approach. The planet has mass density p. 
Relative to the center of the planet, what is the angular velocity of the 
particle as it skims the surface? 


Parabola L xx 


A mass m orbits around a planet of mass M in a parabolic orbit of the 
formy = x? /(4£), which has focal length £. Find the angular momentum 
in three different ways: 


(a) Find the speed at closest approach. 

(b) Use Eq. (7.30). 

(c) Consider the point (x, x? /4£), where x is very large. Find approx- 
imate expressions for the speed and impact parameter at this 
point. 


. Circle to parabola «x 


A spaceship travels in a circular orbit around a planet. It applies a sudden 
thrust and increases its speed by a factor f. If the goal is to change the 
orbit from a circle to a parabola, what should f be if the thrust points in 
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the tangential direction? Is your answer any different if the thrust points 
in some other direction? What is the distance of closest approach if the 
thrust points in the radial direction? 


7.19. Zero potential «x 


A particle is subject to a constant potential, which we will take to be zero 
(equivalently, consider the a = GMm = 0 limit). Following the general 
strategy in Section 7.4, show that the particle’s path is a straight line. 


7.20. Ellipse axes +» 


Taking it as given that Eq. (7.25) describes an ellipse for 0 < € < 1, 
calculate the lengths of the semi-major and semi-minor axes, and show 
that the results agree with Eq. (7.33). 


7.21. Repulsive potential «x 


Consider an “anti-gravitational” potential (or more mundanely, the elec- 
trostatic potential between two like charges), V(r) = a/r, wherea > 0. 
What is the basic change in the analysis of Section 7.4? Show that 
circular, elliptical, and parabolic orbits do not exist. Draw the figure 
analogous to Fig. 7.9 for the hyperbolic orbit. 


7.7 Solutions 


7.1. Exponential spiral 
The given information r = rae” yields (using 6 = L/mr?) 


5 aby À L aL 
i = a(roe™ 0 = ar| — |= —. (7.54) 
mr mr 
Plugging this into Eq. (7.9) gives 
m (aL\? ; I LV) =E=0 (7.55) 
2\mr) ma TOE f 
Therefore, 
(1 +a?)L? 
V(r) =-—> (7.56) 


7.2. Cross section 
(a) The effective potential is 


L2 
me (7.57) 


Pe) = ial — 378 


Setting the derivative equal to zero gives r = mC/L?. Plugging this into 
Vere (r) gives 
L6 


Vee = aaa: (7.58) 


(b) If the energy E of the particle is less than V2", then the particle will reach a 
minimum value of r, and then head back out to infinity (see Fig. 7.11). If E is 
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greater than V2", then the particle will head all the way in to r = 0, never to 
return. The condition for capture is therefore Vi“ < E. Using L = mvob and 
E =E = mvg /2, this condition becomes 


(muob)®§ ma? 3c2 \* 
0 0 
< b < = Dax. (7.59) 
6m? C? 2 (5) 
The cross section for capture is therefore 
1/3 
3C? 
o = mbia = “( > ) ; (7.60) 
mvp 


It makes sense that this should increase with C and decrease with m and vp. 
7.3. Maximum L 


(a) The effective potential is 


r 72,2 
Vere (r) = ——z — oe . (7.61) 
2mr 
A circular orbit exists at the value(s) of r for which V/,,(r) = 0. Setting the Veer (r) 


derivative equal to zero and solving for L? gives 
L = QmVor2)r4e7?"”. (7.62) 


This implicitly determines r. As long as L isn’t too large, Veg (r) looks something 
like the graph in Fig. 7.12, although it doesn’t necessarily dip down to negative 
values; see the remark below. You can arrive at this picture by noting that for any 
L, Vefe (r) behaves like 1/ r? for both r —> 0 and r — oo; and for sufficiently 
small L, Vef (r) reaches negative values somewhere in between, due to the 
— Vo term. The curve must therefore look like the one shown, which has two Fig. 7.12 
locations where V aft (r) = 0. The smaller solution is the one with the stable orbit. 
However, if L is too large, then there are no solutions to Vle (r) = 0, because 
Vefe (r) decreases monotonically to zero (because L? /2mr? does so). We’ll be 
quantitative about this in part (b). 

(b) The function r4e" on the right-hand side of Eq. (7.62) has a maximum 
value, because it goes to zero for both r + 0 and r — oo. Therefore, there 
is a maximum value of L for which a solution for r exists. The maximum of 


2,2 
rte >T" occurs where 


drte" ) 
= dr 


a 2 
=e VP (47° + r*(—21°r)) => parte rê. 


0 = 
(7.63) 


Plugging ro into Eq. (7.62) gives 
2 8m Vo 
Lmax = Ree” (7.64) Vere (r) 


2 
max 


Plugging rp and Lz, into Eq. (7.61) gives 


Vo 
Vest (ro) = Z (for L = Lmax). (7.65) 


Note that this is greater than zero. For the L = Lmax case, the graph of Voge is 
shown in Fig. 7.13. This is the cutoff case between having a dip in the graph, 
and decreasing monotonically to zero. 


REMARK: A common error in this problem is to say that the condition fora Fig. 7.13 
circular orbit to exist is that Vege (r) < 0 at the point where Voge (r) is minimum. 
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The logic here is that since the goal is to have a well in which the particle can be 
trapped, it seems like we just need Vp to achieve a value less than the value at 
r = œ, namely 0. However, this gives the wrong answer (L2,,, = 2mVo/A7e, 


as you can show), because Vege (r) can look like the graph in Fig. 7.14. This has 
a local minimum with Vep (r) > 0. & 


7.4. r* potential 


(a) A circular orbit exists at the value of r for which the derivative of the effective 
potential (which is the negative of the effective force) is zero. This is simply the 
statement that the right-hand side of Eq. (7.8) equals zero, so that 7 = 0. Since 
V'(r) = Bkr*—!, Eq. (7.8) gives 


I2 lek! j . L 1/(k+2) 
o À (1.66) 


mr? mßk 


If k is negative, then 6 must also be negative if there is to be a real solution 
for ro. 

(b) The long method of finding the frequency is to set r(t) = ro + e(t), where 
€ represents the small deviation from the circular orbit, and to then plug this 
expression for r into Eq. (7.8). The result (after making some approximations) is 
a harmonic-oscillator equation of the form ë = —w?e. This general procedure, 
which was described in detail in Section 6.7, will work fine here (as you are 
encouraged to show), but let’s use an easier method. 

By introducing the effective potential, we have reduced the problem to a one- 
dimensional problem in the variable r. Therefore, we can make use of the result 
in Section 5.2, where we found in Eq. (5.20) that to find the frequency of small 
oscillations, we just need to calculate the second derivative of the potential. For 
the problem at hand, we must use the effective potential, because that is what 
determines the motion of the variable r. We therefore have 


yr 
w, = Vere o) : (7.67) 
\ m 


If you work through the r = ro + € method described above, you will find 
that you are basically calculating the second derivative of Veg, but in a rather 
cumbersome way. 

Using the form of the effective potential, we have 


3L? ; 1 3L 
Vero) = — + Bk(k — Ir? = zl + Bk(k bf?) (7.68) 
Mro ro m 


Using the ro from Eq. (7.66), this simplifies to 


V(k+2 V" (r LVk+2 
y". (ro) = (k +2) = ort ( 0) _ + f (7.69) 
off mra m mre 
0 0 


We could get rid of the ro here by using Eq. (7.66), but this form of œ, will be 
more useful in part (c). 
Note that we must have k > —2 for œ, tobe real. Ifk < —2, then Vle (ro) < 0, 

which means that we have a local maximum of Vett, instead of a local minimum. 
In other words, the circular orbit is unstable. Small perturbations grow, instead 
of oscillating around zero. 

(c) Since L = mrao for the circular orbit, we have wọ = 6 = L/(mr). Combining 
this with Eq. (7.69), we find 


UE NETR. (1.70) 


wo 
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k = -1/4 


A few values of k that yield rational values for this ratio are (the plots of the 
orbits are shown in Fig. 7.15): 


e k =—1 => w, /œ = 1: This is the gravitational potential. The variable r 
makes one oscillation for each complete revolution of the (nearly) circular 
orbit. 

e k = 2 => a,/w9 = 2: This is the spring potential. The variable r makes 
two oscillations for each complete revolution. 

e k = 7 => w,/w = 3: The variable r makes three oscillations for each 
complete revolution. 

ek 7/4 @;/@p = 1/2: The variable r makes half of an oscillation 
for each complete revolution. So we need to have two revolutions to get 
back to the same value ofr. 


There is an infinite number of k values that yield closed orbits. But note that 
this statement applies only to orbits that are nearly circular. Also, the “closed” 
nature of the orbits is only approximate, because it is based on Eq. (7.67) which 
is an approximate result based on small oscillations. The only k values that 
lead to exactly closed orbits for any initial conditions are k = —1 (gravity) and 
k = 2 (spring), and in both cases the orbits are ellipses. This result is known as 
Bertrand’s theorem; see Brown (1978). 


7.5. Spring ellipse 
With V (r) = Br?, Eq. (7.16) becomes 


l dr\? 2mE 1 2mBr? 

r? do L r? L 
As stated in Section 7.4.1, we could take a square root, separate variables, integrate to 
find 0 (r), and then invert to find r(@). But let’s solve for r(@) in a slick way, as we 
did for the gravitational case, where we made the change of variables, y = 1/r. Since 
there are lots of r? terms floating around in Eq. (7.71), it’s reasonable to try the change 
of variables, y = 7? or y = 1/r?. The latter turns out to be the better choice. So, using 
y = 1/r? and dy/d@ = —2(dr/dé)/r>, and multiplying Eq. (7.71) through by 1/77, 
we obtain 


(771) 


2 d0 I? i 


mE \* 2mB _(mé 2 
= y Le T2 T T . (7.72) 


Defining z = y — mE /L? for convenience, we have 


dz\? mE \* 26L? 
—) =—42 +4 1 
dé L? mE? 


—42? + 4B?. (1.73) 


C 2mEy , 2mp 
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7.6. 


As in Section 7.4.1, we can just look at this equation and observe that 
z = Bcos2(6 — 6) (7.74) 


is the solution. We can rotate the axes so that 6) = 0, so we’ll drop the ĝo from 
here on. Recalling our definition z = 1/ r? —mE/L? and also the definition of B from 
Eq. (7.73), Eq. (7.74) becomes 


1 mE 
z = zí + € cos 20), (7.75) 
where 
2BL? 
= — ; 7.76 
E mE? C6) 


It turns out, as we’ll see below, that € is not the eccentricity of the ellipse, as it was in 
the gravitational case. 

We will now use the procedure in Section 7.4.3 to show that Eq. (7.76) represents 
an ellipse. For convenience, let 


k=. (7.77) 


Multiplying Eq. (7.75) through by kr?, and using 
2 y 
B pe? 


cos 20 = cos’6 — sin?8 = (7.78) 


y) 


and also r* = x? + y, we obtain k = (x? + E(x? y 2), This can be written as 


2 2 
x y k | 
z + 7 = 1, where a= ja, and b= cs (7.79) 


This is the equation for an ellipse with its center located at the origin (as opposed to a 
focus located at the origin, as in the gravitational case). In Fig. 7.16, the semi-major 
and semi-minor axes are b and a, respectively, and the focal length isc = Vb? — a? = 


 2ke/(1 — €2). The eccentricity is c/b = /2€/(11 + ©). 


REMARK: Ife = 0, then a = b, which means that the ellipse is actually a circle. Let’s 
see if this makes sense. Looking at Eq. (7.76), we see that we want to show that circular 
motion implies 28L? = mE?. For circular motion, the radial F = ma equation is 
mv? /r = 2Br => v? = 26r? /m. The energy is therefore E = mv? /2 + Br? = 267". 
Also, the square of the angular momentum is L? = m?v?r? = 2mfr*. Therefore, 
2BL* = 2B (2mBr*) = m(2Br°)? = mE’, as we wanted to show. & 


b/r? potential 
With V (r) = B/r?, Eq. (7.16) becomes 


(44) 2mE 1 2mp 


r? do i ro rPI 
2mE 1 , 2mß 
T z 14 m) (7.80) 


Letting y = 1/r, and using dy/d0 = —(1/r?)(dr/d@), this becomes 


dy \? 2mE 2 
(2) Lay = k , where a a OE (1.81) 


7.7 Solutions 


We must now consider various possibilities for a”. These possibilities depend on how 
B compares to L?, which depends on the initial conditions of the motion. In what 
follows, note that the effective potential equals 


r B er 
Vert) = Imr? r2 mr 


(1.82) 


Case 1: a? > 0, or equivalently 6 > —L*/2m. In this case, the effective potential 
looks like the graph in Fig. 7.17. The solution for y in Eq. (7.81) is a trig function, 
which we will take to be a “sin” by appropriately rotating the axes. Using y = 1/r, we 


obtain 
1 1 /2mE . 
= VD sin 40. (7.83) 
r a 2 


0 = 0 and 0 = z/a make the right-hand side equal to zero, so they correspond to 
r = oo. And 0 = 7/2a makes the right-hand side maximum, so it corresponds to 
the minimum value of r, which is 7min = ay L?/2mE. This minimum r can also be 
obtained in a much quicker manner by finding where Vee (7) = E. 

Ifthe particle comes in from infinity at 9 = 0, it eventually heads back out to infinity 
at 0 = x /a. The angle that the outgoing path makes with the incoming path is therefore 
z/a. So if a is large (that is, if £ is large and positive, or if L is small), then the particle 
bounces nearly straight backward. If a is small (that is, if £ is negative and if £? is 
only slightly larger than —2m £), then the particle spirals around many times before it 
pops back out to infinity. 

A few special cases are: (1) 8 = 0 => a = 1, which means that the total angle 
is x, that is, there is no net deflection. In fact, the particle’s path is a straight line, 
because the potential is zero; see Exercise 7.19. (2) L? = —8m6/3 =—> a = 1/2, 
which means that the total angle is 27, that is, the final line of the particle’s motion 
is (anti)parallel to the initial line. These lines are shifted sideways from one another, 
with the separation depending on the initial conditions. 


Case 2: a = 0, or equivalently 8 = —L?/2m. In this case, the effective potential is 
identically zero, as shown in Fig. 7.18. Equation (7.81) becomes 
dy 2 _ 2mE (7.84) 
dd) P` 


The solution to this is y = 0,/2mE/L? + C, which gives 


SEa (1.85) 
r= =| — i 
OV 2mE’ 


where we have set the integration constant C equal to zero by choosing 0 = 0 to be 
the angle that corresponds to r = oo. Note that we can use 8 = —L?/2m to write r as 
r = (1/0)/—B/E. 

Since the effective potential is flat, the rate of change of r is constant. Therefore, if 
the particle has 7 < 0, it will reach the origin in finite time, even though Eq. (7.85) 
says that it will spiral around the origin an infinite number of times (because 6 —> oo 
asr — 0). 


Case 3: a? < 0, or equivalently 8 < —L*/2m. In this case, we have the situation 
shown in either Fig. 7.19 or Fig. 7.20, depending on the sign of E. For convenience, 
let b be the positive real number such that b? = —a?. Then Eq. (7.81) becomes 


dy\ 22  2mE 
- by = ; 7.86 
( Z) y QD (7.86) 
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Fig. 7.17 


Fig. 7.18 


Vere (r) ~ -1/72 


Vage (r) ~ -1/r2 


Fig. 7.20 
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The solution to this equation is a hyperbolic trig function. But we must consider two 
cases: 
e E > 0: Using the identity cosh?z — sinh?z = 1, and recalling y = 1/r, we see 
that the solution to Eq. (7.86) is!” 
1 1 /2mE 


~=3V 2 sinh b0. (7.87) 
F 2 


Unlike the a? > 0 case above, the sinh function has no maximum value. Therefore, 
the right-hand side can head to infinity, which means that r can head to zero, if 
the initial 7 is negative. It does so in a finite time, because + only becomes more 
negative as time goes by, from Fig. 7.19. For large z, we have sinhz ~ e7/2, so 
r heads to zero like e7”. The particle will therefore spiral around the origin an 
infinite number of times (because 0 —> œ as r — 0). 

e E <0: In this case, Eq. (7.86) can be rewritten as 


dy\>— 2m|E| 
By -| =) ==. 7.88 
y ( =) T2 (7.88) 
The solution to this equation is!* 
1 1 /2m|£| 
7 = 5 T2 cosh b8. (7.89) 


As in the sinh case, the cosh function has no maximum value. Therefore, the 
right-hand side can head to infinity, which means that r can head to zero. But in 
the present cosh case, the right-hand side does achieve a nonzero minimum value, 
when 6 = 0. So r achieves a maximum value (if the initial 7 is positive) equal to 
Tmax = bL?/2m|E|. This is clear from Fig. 7.20. This maximum r can also be 
obtained by simply finding where Ver (r) = E. After reaching max, the particle 
heads back down to the origin with behavior similar (for large 0) to the sinh case. 


7.7. Rutherford scattering 


(a) From Exercise 7.14, we know that the impact parameter b equals the distance b 
shown in Fig. 7.9. Therefore, Fig. 7.21 tells us that the angle of deflection (the 
angle between the initial and final velocity vectors) is 


o=n—21a0'(2). (7.90) 


a 


But from Eqs. (7.34) and (7.26), we have 


2 2 2 2 
boga 2 a _ bb ast 
a 


ma? m(GMm)? GM 


Substituting this into Eq. (7.90), with y = vw /(GM), gives the first expression 
in Eq. (7.52). Dividing by 2 and taking the cotangent of both sides then gives 


the second expression, 
1 
b=- cot( 2) ‘ (7.92) 
y 2 


It actually isn’t necessary to go through all the work of Section 7.4.3 to obtain 
this result via a and b. We can just use Eq. (7.25), which says that r — oo when 


12 More generally, we should write sinh(@ — 4) here. But we can eliminate the need for 49 by picking 
6 = 0 to be the angle that corresponds to r = oo. 

13 Again, we should write cosh(@ — 8o) here. But we can eliminate the need for 09 by picking 0 = 0 
to be the angle that corresponds to the maximum value of r. 
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cos@ —> —1/e. This then implies that the dotted lines in Fig. 7.21 have slope 
tan@ = Vsec2@ — 1 = Ve? — 1, which reproduces Eq. (7.91). 

(b) Imagine a wide beam of particles moving in the positive x direction toward 
the mass M. Consider a thin cross-sectional ring in this beam, with radius 
b and thickness db. Now consider a very large sphere centered at M. Any 
particle that passes through the cross-sectional ring of radius b will hit this 
sphere in a ring located at an angle ¢ relative to the x axis, with an angular 
spread of dø. The relation between db and dọ is found from Eq. (7.92). Using 
d(cot B)/dB = —1/sin*B, we have 


db 1 
| (7.93) 


do| 2y sin2($/2) ` 


The area of the incident cross-sectional ring is do = 27rb |db|. What is the solid 
angle subtended by a ring at angle @ with thickness dg? Taking the radius of 
the large sphere to be R (which will cancel out), the radius of the ring is R sin ¢, 
and the width is R |d@|. The area of the ring is therefore 27 (R sin ¢)(R |d|), 
and so the solid angle subtended by the ring is dQ = 27 sin ¢ |d@| steradians. 
Therefore, the differential cross section is 


do _—  2xb\db| b db 
sing] |do 


dQ 2x sing |dd| 
B ( (1/y) cot(g/2) ) ( l ) 
2sin(%/2) cos(/2) / \ 2y sin? (¢/2) 


1 
~ 4y2sint($/2) ` 


(1.94) 


REMARKS: What does this “differential cross section” result tell us? It tells us 
that if we want to find out how much cross-sectional area gets mapped into 
the solid angle dQ at the angle ¢ġ, then we can use Eq. (7.94) to say (recalling 
y = v3 /GM), 


G? M? G?M?m? 
o= am a2 => do = Pee ESET dQ, (7.95) 
4v sin" (¢/2) 16E* sin" (@/2) 

where we have used E = mua /2 to obtain the second expression. Let’s look at 
some special cases. If ø ~ 180° (that is, backward scattering), then the amount 
of area that gets scattered into a nearly backward solid angle of dQ equals 
do = (G?M?/4v}) dQ. If vo is small, then we see that do is large, that is, a 
large area gets deflected nearly straight backward. This makes sense, because 
with vo © 0, the orbit is essentially parabolic, which means that the initial and 
final velocities at infinity are (anti)parallel. (Ifyou release a particle from rest far 
away from a gravitational source, it will come back to you. Assuming it doesn’t 
bump into the source, of course.) If vo is large, then we see that do is small, that 
is, only a small area gets deflected backward. This makes sense, because the 
particle is more likely to fly past M without much deflection if it is moving fast, 
because the force has hardly any time to act. The particle needs to start with a 
very small b value (which corresponds to a very small area) in order to get close 
enough to M to allow there to be a large enough force to swing it around. 

Another special case is ¢ ~ 0, that is, negligible deflection. In this case, 
Eq. (7.95) tells us that the amount of area that gets scattered into a nearly forward 
solid angle of dQ is dø ~ oo. This makes sense, because if the impact parameter 
bis large (and there is an infinite cross-sectional area for which this is true), then 
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the particle will hardly feel the mass M, so it will continue to move essentially 
in a straight line. !4 

What if we consider the electrostatic force, instead of the gravitational force? 
What is the differential cross section in this case? To answer this, note that we 
can rewrite y as 


v 2(mvp/2) _ 2E 
GM GMm a` 


E (7.96) 
In the case of electrostatics, the force takes the form, Fe = kqıq2 /r?. This 
looks like the gravitational force, Fg = Gmjmy /r*, except that the constant 
a is now kqiqz2, instead of Gm mz. Therefore, the y in Eq. (7.96) becomes 
Ye = 2E/(kgiqz). Substituting this into Eq. (7.94), or equivalently replacing 
GMm by kqıq in Eq. (7.95), gives the differential cross section for electrostatic 
scattering, 


do _ aig 
dQ  16E2 sin4(¢/2) 


(7.97) 


This is the Rutherford-scattering differential-cross-section formula. Around 
1910, Rutherford and his students bombarded metal foils with alpha particles. 
Their results for the distribution of scattering angles were consistent with the 
above formula. In particular, they observed backward scattering of the alpha 
particles. Since the above formula is based on the assumption of a point source 
for the potential, this led Rutherford to his theory that atoms contained a dense 
positively charged nucleus, as opposed to being made of a spread-out “plum 
pudding” distribution of charge, which (as a special case of not yielding the 
correct distribution of scattering angles in general) doesn’t yield backward 
scattering. & 


14 Remember, all we care about is the angle here. So when you’ re picturing the large sphere of radius 
R centered at M, don’t say, “If b is large (for example, R/2), then a straight-line trajectory will hit 
the sphere at a large angle up above the x axis (for example, 30°).” This is incorrect. If you want 
to think in terms of a physical sphere of radius R, it is understood that R is infinitely large. Or 
more precisely, R >> b, for any impact parameter b you might choose. So even if b is “large,” it is 
still small compared with R, so a straight-line trajectory will hit the sphere of radius R at an angle 
that is essentially zero. Alternatively, you can just think in terms of angles, and not visualize an 
actual large sphere centered at M. 


Chapter 8 
Angular momentum, Part I 
(Constant L) 


The angular momentum of a point mass, relative to a given origin, is defined by 
L=rxp. (8.1) 


For a collection of particles, the total L is simply the sum of the L’s of 
all the particles. The vector r x p is a useful thing to study because it has 
many nice properties. One of these is the conservation law presented in The- 
orem 7.1, which allowed us to introduce the “effective potential” in Section 
7.2. And later in this chapter we will introduce the concept of torque, Tt, 
which appears in the bread-and-butter statement, t = dL/dt (analogous to 
Newton’s F = dp/dt law). 

There are two basic types of angular momentum problems in the world. Since 
the solution to any rotational problem invariably comes down to using t = dL /dt, 
as we will see, we must determine how L changes in time. And since L is a vector, 
it can change because (1) its length changes, or (2) its direction changes (or 
through some combination of these effects). In other words, if we write L = IL 
where L is the unit vector in the L direction, then L can change because L changes, 
or because L changes, or both. 

The first of these cases, that of constant L, is the easily understood one. 
Consider a spinning record, with the center chosen as the origin. The vector 
L = `r x pis perpendicular to the record, because every term in the sum has 
this property. If we give the record a tangential force in the proper direction, then 
it will speed up (in a precise way which we will soon determine). There is nothing 
mysterious going on here. If we push on the record, it goes faster. L points in the 
same direction as before, but now simply with a larger magnitude. In fact, in this 
type of problem, we can completely forget that L is a vector. We can deal just 
with its magnitude L, and everything will be fine. This first case is the subject of 
the present chapter. 

The second case, however, where L changes direction, can get rather con- 
fusing. This is the subject of the following chapter, where we will talk about 
twirling tops and other kinds of spinning objects that have a tendency to make 
one’s head spin too. In these situations, the entire point is that L is actually a 
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vector. And unlike in the constant-L case, we really have to visualize things in 
three dimensions to see what’s going on.! 

The angular momentum of a point mass is given by the simple expression in 
Eq. (8.1). But in order to deal with setups in the real world, which invariably 
consist of many particles, we must learn how to calculate the angular momentum 
of an extended object. This is the task of the Section 8.1. In this chapter, we’ll 
deal only with rotations around the z axis, or an axis parallel to the z axis. We’ll 
save the general 3-D motion for Chapter 9. 


8.1 Pancake object in x-y plane 


Consider a flat rigid body undergoing arbitrary motion (both translating and 
spinning) in the x-y plane; see Fig. 8.1. What is the angular momentum of this 
body, relative to the origin of the coordinate system?” If we imagine the body to 
consist of particles of mass m;, then the angular momentum of the entire body is 
the sum of the angular momenta of each m;, which is L; = r; x p;. So the total 
angular momentum is 


L=} r; x pi (8.2) 


For a continuous distribution of mass, we would have an integral instead of a 
sum. L depends on the locations and momenta of the masses. The momenta in 
turn depend on how fast the body is translating and spinning. Our goal here is 
to find the dependence of L on the distribution and motion of the constituent 
masses. The result will involve the geometry of the body in a specific way, as we 
will show. 

In this section, we will deal only with pancake-like objects that move in the 
x-y plane. We will calculate L relative to the origin, and we will also derive 
an expression for the kinetic energy. We will deal with non-pancake objects in 
Section 8.2. Note that since both the r and p ofall the masses in our pancake-like 
objects always lie in the x-y plane, the vector L = ` r x p always points in 
the z direction. As mentioned above, this fact is what makes these pancake cases 
easy to deal with. L changes only because its length changes, not its direction. 
So when we eventually get to the t = dL/dt equation, it will take on a simple 
form. Let’s first look at a special case, and then we’ll look at general motion in 


the x-y plane. 
' The difference between these two cases is essentially the same as the difference between the two 
basic F = dp/dt cases. The vector p can change because its magnitude changes, in which case 
we have F = ma (assuming that m is constant). Or, p can change because its direction changes, 
in which case we have the centripetal-acceleration statement, F = mv? /r. (Or there could be a 
combination of these effects.) The former case seems a bit more intuitive than the latter. 
Remember, L is defined relative to a chosen origin, because it has the vector r in it. So it makes 
no sense to ask what L is, without specifying what origin we’ve chosen. 


8.1 Pancake object in x-y plane 


8.1.1 Rotation about the z axis 


The pancake in Fig. 8.2 is pivoted at the origin and rotates with angular speed w 
around the z axis, in the counterclockwise direction (as viewed from above). 
Consider a little piece of the body, with mass dm and position (x,y). This 
little piece travels in a circle around the origin with speed v = wr, where 
r = x? + y*. Therefore, the angular momentum of this piece (relative to the 
origin) equals L = r x p = r(udm)z = dmr?w2. The 2 direction arises from 
the cross product of the (orthogonal) vectors r and p. The angular momentum of 
the entire body is therefore 


L= f Petan = fe + y?)w2z dm, (8.3) 


where the integration runs over the area of the body. If the density of the object 
is constant, as is usually the case, then we have dm = p dx dy. If we define the 
moment of inertia around the z axis to be 


L= f Pdm= [+n (8.4) 
then the z component of L is 
Lz = Lo, (8.5) 


and both Ly and L, are zero. In the case where the rigid body is made up of a 
collection of point masses m; in the x-y plane, the moment of inertia in Eq. (8.4) 
takes the discretized form, 


L=} mr}. (8.6) 
l 
Given any rigid body in the x-y plane, we can calculate /,. And given w, we 
can then multiply it by Z; to find Lz. In Section 8.3.1, we’ll get some practice 
calculating various moments of inertia. 
What is the kinetic energy of our object? We need to add up the energies of 
all the little pieces. A little piece has energy dm v? /2 = dm(rw)? /2. So the total 
kinetic energy is 


ra 
T= / dm. (8.7) 
2 
With our definition of Z; in Eq. (8.4), this becomes 
r ee (8.8) 
ar ee . 


This is easy to remember, because it looks a lot like the kinetic energy of a point 
mass, mv? /2. 


Fig. 8.2 
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8.1.2 General motion in x-y plane 


How do we deal with general motion in the x-y plane? For the motion in Fig. 8.3, 
where the object is both translating and spinning, the various pieces of mass don’t 
travel in circles around the origin, so we can’t write v = wr as we did above. 
It turns out to be highly advantageous to write the angular momentum, L, and 
the kinetic energy, T, in terms of the center-of-mass (CM) coordinates and the 
coordinates relative to the CM. The expressions for L and T take on very nice 
forms when written this way, as we now show. 

Let the position of the CM relative to a fixed origin be R = (X, Y). And let 
the position of a given point relative to the CM be r’ = (x’, y’). Then the position 
of the given point relative to the fixed origin is r = R + r’ (see Fig. 8.4). Let 
the velocity of the CM be V, and let the velocity relative to the CM be v’. Then 
v = V + v’. Let the body rotate with angular speed w’ around the CM (around 
an instantaneous axis parallel to the z axis, so that the pancake remains in the x-y 
plane at all times).? Then v’ = w'r’. 

Let’s look at L first. Let M be the total mass of the pancake. The angular 
momentum relative to the origin is 


L= frxvdm 


= | R+) x (V + v^) dm 


= / Rx Vdm+ | r’ x v'dm (cross terms vanish; see below) 


=MRxV+ (fwan) 


=RxP+ (1a) 2. (8.9) 


In going from the second to third line above, the cross terms, fr’ x Vdm and 
{Rx v'dm, vanish by definition of the CM, which says that fr’ dm = 0. 
(See Eq. (5.58); basically, the position of the CM in the CM frame is zero.) This 
implies that f v’ dm = d (f r' dm) /dt also equals zero. And since we can pull the 
constant vectors V and R out of the above integrals, we are therefore left with 
zero. The quantity J in the final result is the moment of inertia around an axis 
through the CM, parallel to the z axis. Equation (8.9) is a very nice result, and it 
is important enough to be called a theorem. In words, it says: 


Theorem 8.1 The angular momentum (relative to the origin) of a body can be 
found by treating the body like a point mass located at the CM and finding the 


3 What we mean here is the following. Consider a coordinate system whose origin is the CM and 
whose axes are parallel to the fixed x and y axes. Then the pancake rotates with angular speed w’ 
with respect to this system. 
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angular momentum of this point mass relative to the origin, and by then adding 
on the angular momentum of the body relative to the CM.* 


Note that if we have the special case where the CM travels around the origin 
in a circle with angular speed Q (so that V = QR), then Eq. (8.9) becomes 
L = (MR Q + IMa')3. 

Now let’s look at 7. The kinetic energy is 


1 
r= [5am 


1 
= | 5IV+¥Pan 


1 1 
=5 f V? dm + 5 f v dm (cross term vanishes; see below) 


1 
MV” + Ma. (8.10) 


In going from the second to third line above, the cross term f V > v'dm = 
V - f v' dm vanishes by definition of the CM, as in the above calculation of L. 
Again, Eq. (8.10) is a very nice result. In words, it says: 


Theorem 8.2 The kinetic energy of a body can be found by treating the body 
like a point mass located at the CM, and by then adding on the kinetic energy of 
the body due to the motion relative to the CM.° 


To calculate £, my dear class, 

Just add up two things, and you’ll pass. 
Take the CM point’s E, 

And then add on with glee, 

The £ ’round the center of mass. 


Example (Cylinder on a ramp): A cylinder of mass m, radius r, and moment of 
inertia Z = (1 /2)mr? (this is the J for a solid cylinder around its center, as we’ll see 
in Section 8.3.1) rolls without slipping down a plane inclined at an angle 6. What is 
the acceleration of the center of the cylinder? 


4 This theorem works only if we use the CM as the location of the imagined point mass. True, in 
the above analysis we could have chosen a point P other than the CM, and then written things 
in terms of the coordinates of P and the coordinates relative to P (which could also be described 
by a rotation). But then the cross terms in Eq. (8.9) wouldn’t vanish, and we’d end up with an 
unenlightening mess. 

> We already knew this from Section 5.6.2. It’s just that now we know that the kinetic energy in the 
CM frame takes the form of [™Mw? /2. 
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Solution: We’ll use conservation of energy to determine the speed v of the center 
of the cylinder after it has moved a distance d down the plane, and then we’ll read 
off a from the standard constant-acceleration kinematic relation, v = f 2ad. 

The loss in potential energy of the cylinder is mgd sin 0. This shows up as kinetic 
energy, which equals mv? /2+J2 /2 from Theorem 8.2. But the nonslipping condition 
is v = ar. Therefore, œ = v/r, and conservation of energy gives 


mgd sin 0 


La 13 
= pay 
mu” + lw 


| 
N 
3 
© 
N 
+ 
NI = 
Ae 
NI = 
3 
i 
N 
i n 
ea 
i a 
—" 
N 


2 (8.11) 


l 
3 
= 
Se 


So the speed as a function of distance is v = ,/(4/3)gd sin@. Hence, v = /2ad 
gives a = (2/3)g sin 0, which is independent of r. 


REMARKS: Our answer is 2/3 of the g sin @ result for a block sliding down a frictionless 
plane. It is smaller because there is kinetic energy “wasted” in the rotational motion. 
Alternatively, it is smaller because there is a friction force pointing up the plane (to provide 
the torque necessary to get the cylinder rotating, but we’ll talk about torque in Section 8.4), 
so this decreases the net force down the plane. 

If we let Z take the general form J = Bmr?, where £ is a numerical factor, then you can 
show that the acceleration becomes a = yg sin@/(1 + £). So if 6 = 0 (all the mass is at 
the center), then we simply have a = g sin, so the cylinder behaves just like a sliding 
block. If 6 = 1 (all the mass is on the rim), then a = (1/2)gsin@. If B —> o,° then 
a — 0. After reading Section 8.4, you can think about these special cases alternatively in 
terms of the forces and torques involved. 

Although this problem can also be solved by using force and torque (the task of Exer- 
cise 8.37), the conservation-of-energy method is generally quicker in more complicated 
problems of this type, as you can see by doing, for example, Exercises 8.28 and 8.46. & 


8.1.3 The parallel-axis theorem 


Consider the special case where the CM rotates around the origin at the same 
rate as the body rotates around the CM. This can be achieved, for example, by 
gluing a stick across the pancake and pivoting one end of the stick at the origin; 
see Fig. 8.5. In this special case, we have the simplified situation where all points 
in the pancake travel in circles around the origin. Let their angular speed be w. 
Then the speed of the CM is V = oR, so Eq. (8.9) gives the angular momentum 
around the origin as 


L: = (MR? + I™ yo. (8.12) 


é This can be obtained by adding on long spokes protruding from the cylinder and having them pass 
through a deep groove in the plane, or by having a spool with a very small inner radius roll down 
a thin plane with only its inner “axle” rolling on the plane. 
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In other words, the moment of inertia around the origin is 
I, = MR? + I, (8.13) 


This is the parallel-axis theorem. It says that once you’ ve calculated the moment 
of inertia of an object around the axis passing through the CM (namely rE, 
then if you want to calculate the moment of inertia around a parallel axis, you 
simply have to add on MR’, where R is the distance between the two axes, and 
M is the mass of the object. Note that since the parallel-axis theorem is a special 
case of the result in Eq. (8.9), it is valid only with the CM, and not with any other 
point. The parallel-axis theorem actually holds for arbitrary nonplanar objects 
too, as we’ll see in Section 8.2. And we’ll also derive a more general form of the 
theorem in Chapter 9. 

We can also look at the kinetic energy in this special case where the CM 
rotates around the origin at the same rate as the body rotates around the CM. 
Using V = œR in Eq. (8.10), we find 


1 1 
T=; (ur? +1) o? = shea. (8.14) 


Example (A stick): Let’s verify the parallel-axis theorem for a stick of mass m 
and length £, in the case where we want to compare the moment of inertia around an 
axis through an end (perpendicular to the stick) with the moment of inertia around an 
axis through the CM (perpendicular to the stick). 

For convenience, let p = m/£ be the density. The moment of inertia around an axis 
through an end is 


£ £ 1 1 1 
end =| x? dm -f xp dx = ze = x (oye = qe’. (8.15) 


The moment of inertia around an axis through the CM is 


CM see e2 a Se eae a oe 
I =f x am = | x“ pdx = — pl = — (pL) = —mé*. (8.16) 
—£/2 —£/2 12 12 12 


This is consistent with the parallel-axis theorem, Eq. (8.13), because 
g 2 
pend m (5) + IM. (8.17) 


Remember that this works only with the CM. If we instead want to compare 7 end 
with the / around a point, say, £/6 from that end, then we cannot say that they differ 
by m(£/6)?. But we can compare each of them to J™ and say that they differ by 
(€/2)? — (€/3)? = 5€2/36. 
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8.1.4 The perpendicular-axis theorem 


This theorem is valid only for pancake objects. Consider a pancake object in the 
x-y plane (see Fig. 8.6). Then the perpendicular-axis theorem says that 


L=k+ly, (8.18) 


where J, and J, are defined analogously to the J, in Eq. (8.4). That is, to find Zy, 
imagine spinning the object around the x axis at angular speed w, and then define 
I, = Ly/æ. (Only the distance from the x axis matters in calculating the speed of 
a given point. So the fact that the object has extent along the x direction and is 
therefore not a pancake in the y-z plane is irrelevant. The following section has 
further discussion of this.) Likewise for J,,. In other words, 


i= fo? +2?)dm, p= fetan, L= [tan (8.19) 


To prove the perpendicular-axis theorem, we simply use the fact that z = 0 
for our pancake object. Equation (8.19) then gives J, = Jy + J. In the limited 
number of situations where this theorem is applicable, it can save you some 
trouble. A few examples are given in Section 8.3.1. 


8.2 Nonplanar objects 


In Section 8.1, we restricted the discussion to pancake objects in the x-y plane. 
However, nearly all the results we derived carry over to nonplanar objects, pro- 
vided that the axis of rotation is parallel to the z axis, and provided that we are 
concerned only with Lz, and not Ly or Ly. So let’s drop the pancake assumption 
and run through the results we obtained above. 

First, consider an object rotating around the z axis. Let the object have exten- 
sion in the z direction. If we imagine slicing the object into pancakes parallel to 
the x-y plane, then Eqs. (8.4) and (8.5) correctly give the L, for each pancake. 
And since the L, of the whole object is the sum of the L,’s of all the pancakes, 
we see that the Z, of the whole object is the sum of the /,’s of all the pancakes. 
The difference in the z values of the pancakes is irrelevant. Therefore, for any 
object rotating around the z axis, we have 


L= | @? +y?) dm, and L,=Lo, (8.20) 
y 


where the integration runs over the entire volume of the body. We’ll calculate the 
I, for many nonplanar objects in Section 8.3.1. Note that even though Eq. (8.20) 
gives the L, for an arbitrary object, the analysis in this chapter is still not com- 
pletely general because (1) we are restricting the axis of rotation to be the (fixed) 
z axis, and (2) even with this restriction, an object outside the x-y plane might 
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have nonzero x and y components of L, but we found only the z component in 
Eq. (8.20). This second fact is strange but true. We’ll deal with it in great detail 
in Section 9.2. 

As far as the kinetic energy goes, the T for a nonplanar object rotating around 
the z axis is still given by Eq. (8.8), because we can obtain the total T by adding 
up the T’s of all the pancake slices. 

Also, Eqs. (8.9) and (8.10) continue to hold for a nonplanar object in the 
case where the CM is translating while the object is spinning around it (or more 
precisely, spinning around an axis parallel to the z axis and passing through 
the CM). The velocity V of the CM can actually point in any direction, and 
these two equations are still valid. But we’ll assume in this chapter that all 
velocities are in the x-y plane. 

Lastly, the parallel-axis theorem still holds for a nonplanar object (the deriva- 
tion using Eq. (8.9) is the same). But as mentioned in Section 8.1.4, the 
perpendicular-axis theorem does not hold. This is the one instance where we 
need the planar assumption. 


Finding the CM 

The center of mass has come up repeatedly in this chapter. For example, when we 
used the parallel-axis theorem, we needed to know where the CM was. In some 
cases, such as with a stick or a disk, the location is obvious. But in other cases, 
it isn’t so clear. So let’s get a little practice calculating the location of the CM. 
Depending on whether the mass distribution is discrete or continuous, the position 
of the CM is defined by (see Eq. (5.58)) 


Yrm; frdm 
R; = 
m °> X «(Rem w` 


Rem = (8.21) 


where M is the total mass. We’ll do an example with a continuous mass distribu- 
tion here. As is often the case with problems involving an integral, the main step 
in the solution is deciding how you want to slice up the object to do the integral. 


Example (Hemispherical shell): Find the location of the CM of a hollow 
hemispherical shell, with uniform mass density and radius R. 


Solution: By symmetry, the CM is located on the line above the center of the base. 
So our task reduces to finding the height, ycm. Let the mass density be ø. We’ll slice 
the hemisphere up into horizontal rings, described by the angle 6 above the horizontal, 
as shown in Fig. 8.7. If the angular thickness of a ring is d0, then its mass is 


dm = o dA = o (length) (width) = øo (27Rcos6)(Rdé). (8.22) 
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All points on the ring have a y value of R sin 0. Therefore, 


m/2 
vem = 5 z frar = Í (R sin 6) (2 R20 cos 6 d0) 


m/2 
=r sin 0 cos 0 dé 
0 


Rsin26 |7/2 


a 


0 

R 

= 8.23 

; (8.23) 
The simple factor of 1/2 here is nice, but it’s not all that obvious. It comes from the 
fact that each value of y is represented equally. If you solved the problem by doing a 
dy integral instead of a d0 one, you would find that there is the same area (and hence 
the same mass) in each ring of vertical height dy. In short, as y increases, the larger 
tilt of the surface cancels out the smaller radius of the rings, yielding the same area. 
You are encouraged to work this out. 


The calculation of a CM is very similar to the calculation of a moment of 
inertia. Both involve an integration over the mass of an object, but the former 
has one power of a length in the integrand, whereas the latter has two powers. 


8.3 Calculating moments of inertia 
8.3.1 Lots of examples 


Let’s now calculate the moments of inertia of various objects, around specified 
axes. We’ll use p to denote the mass density (per unit length, area, or volume, as 
appropriate), and we’ll assume that this density is uniform throughout the object. 
For the more complicated objects in the list below, it is generally a good idea to 
slice them up into pieces for which Z is already known. The problem then reduces 
to integrating over these known /’s. There is usually more than one way to do this 
slicing. For example, a sphere can be looked at as a series of concentric shells 
or a collection of disks stacked on top of each other. In the examples below, you 
may want to play around with slicings other than the ones given. Consider at least 
a few of these examples to be problems and try to work them out for yourself. 


1. Aring of mass M and radius R (axis through center, perpendicular to plane; Fig. 8.8): 


20 
rajy m= f R2pRd0 = (27 Rp)R? =| MR? |, (8.24) 
0 


as expected, because all of the mass is a distance R from the axis. 


8.3 Calculating moments of inertia 319 


2. Aring of mass M and radius R (axis through center, in plane; Fig. 8.8): The distance 
from the axis is (the absolute value of) R sin 9. Therefore, 


2n 1 
I= f r? dm = Í (R sin 0)? pR d0 = 5 2m Rp)R? =| ime? |, (8.25) 


where we have used sin2@ = (1 — cos 20)/2. You can also find J by using the 
perpendicular-axis theorem. In the notation of Section 8.1.4, we have Jy = Iy, by 
symmetry. Therefore, Iz = 21y. Using = MR? from Example | then gives Jy = 
MR?/2. 

3. Adisk of mass M and radius R (axis through center, perpendicular to plane; Fig. 8.9): 


2m pR 
1 
I= J” dm a f r pr dr dO = (R*/4)2xp = 5 (Om RR’ =| AMR? |. 
0 0 


(8.26) 


You can save the (trivial) step of integrating over 0 by considering the disk to be 
made up of many concentric rings, and invoking Example 1. The mass of each ring is 
p2rr dr. Integrating over the rings gives Z = re (p2ar dr)r? = mR* p/2 = MR? /2, 
as above. Slicing up the disk is fairly inconsequential in this example, but it will save 


up into rings, and use Example 2. 


you some trouble in others. 
4. Adisk of mass M and radius R (axis through center, in plane; Fig. 8.9): Slice the disk i 


R 
1 a s 
I= Í (1/2 (o2rr dr)r? = (R? /4) pr = gOTROR? =| MR?|. (8.27) 
0 
; : > Fig. 8.9 
Or just use Example 3 and the perpendicular-axis theorem. 
5. A thin uniform rod of mass M and length LZ (axis through center, perpendicular to 
rod; Fig. 8.10): We already found this Z and the next one in Section 8.1.3, but we’ll : 
include them here for completeness. LL 
L/2 1 
I= [Pan 5) xp dx = — (PDI? =| PME |. (8.28) 
<12 12 : 
i L 
6. A thin uniform rod of mass M and length L (axis through end, perpendicular to rod; i 
Fig. 8.10): ' 
Fig. 8.10 


L 
1 
I= I x?dm = [ pdx = zD? =| IME |, (8.29) 


7. A spherical shell of mass M and radius R (any axis through center; Fig. 8.11): Let’s 
slice the sphere into horizontal ring-like strips. In spherical coordinates, the radius 
of a ring is given by r = Rsin0, where 0 is the angle down from the north pole. 
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The area of a strip is then 27 (R sin 0)R d0. Using f sin?@ = f sinod — cos?0) = 
— cos + cos?0/3, we have 


T T 
I= / r? dm = ) (Rsin 0)? 2xp(R sin 0)R d0 = 2xpR* f sin°6 
0 0 


2 
= 2r pR (4/3) = 3 OR? p)R? =| $MR? |. (8.30) 


An alternate and slick way of deriving this is to use the following result, similar in 
spirit to the perpendicular-axis theorem: Adding up the three moments of inertia in 
Eq. (8.19) gives 


+ht+k =2for+y?+22)dm=2 | dm. (8.31) 


a> If we apply this to a spherical shell, where r always equals the radius R, then the 
right-hand side is 2MR?. And since the J’s are all equal by symmetry, they must all 
Ver be 2MR?/3. 
i 8. A solid sphere of mass M and radius R (any axis through center; Fig. 8.11): A sphere 
is made up of concentric spherical shells. The volume of a shell is 4xr?dr. Using 
Fig. 8.11 Example 7, we have 


R 
I= f (2/3)(4pr-dr)r? = (R°/5)(82p/3) = £4 p/3)° =| 2MR? |. 
0 


(8.32) 
L The task of Exercise 8.33 is to derive this result by slicing the sphere into horizontal 
disks. 
9. An infinitesimally thin triangle of mass M and length Z (axis through tip, perpendicular 
L to plane; Fig. 8.12): Let the base have length a, where a is infinitesimally small. Then 
a tiny vertical slice a distance x from the tip has length a(x/L). Ifthe slice has thickness 
dx, then it is essentially a point mass of mass dm = pax dx/L. Therefore, 
L 2 Za l ZST 
T= f> dm = [ x“ pax/L dx = z pal /2)L =| 7ML* |, (8.33) 
Fig. 8.12 because aL/2 is the area of the triangle. This has the same form as the disk in 


Example 3, because a disk is made up of many of these triangles. 

10. An isosceles triangle of mass M, vertex angle 28, and common-side length L (axis 
through tip, perpendicular to plane; Fig. 8.12): Let A be the altitude of the triangle 
(so h = Lcos £). Slice the triangle into thin strips parallel to the base. Let x be the 
distance from a strip to the vertex. Then the length of a strip is £ = 2x tan £, and its 
mass is dm = p(2x tan £ dx). Using Example 5 above, along with the parallel-axis 


theorem, we have 


g2 h 2x t 2 
l= [am — 437} = (p2x tan B dx) seta a + 
12 0 12 


h tan? B 3 tan2B \ h4 
= 2petan B 1+ 3 x” dx =2ptang | 1+ 3 7° (8.34) 
0 
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But the area of the entire triangle is h? tan B, so we have I = (Mh? /2)(1 + 
(1/3) tan”). In terms of L = h/ cos £, this is 


I = (ML?/2) (cos? 6 + sin? 6/3) = Mr (1 = 3 sin?) . (8.35) 


11. A regular V-gon of mass M and “radius” R (axis through center, perpendicular to 
plane; Fig. 8.13): The N-gon is made up of N isosceles triangles, so we can use 
Example 10, with 6 = x/N. The masses of the triangles simply add, so if M is the 


mass of the whole N-gon, we have 


I =| 5MR?(1 — 3 sin?Z,) |. (8.36) 


We can list the values of J for a few N. With the shorthand notation (N, T /MR2), 
Eq. (8.36) gives (3, D, (4, 3) (6, 5), (co, 5). These values of / forma nice arithmetic 
progression. 

12. A rectangle of mass M and sides of length a and b (axis through center, perpendicular 
to plane; Fig. 8.13): Let the z axis be perpendicular to the plane. We know that 
k= Mb?/12 and Jy = Ma?/12 (because the extent of an object along an axis doesn’t 
affect the moment around that axis, when written in terms of the total M). So the 


perpendicular-axis theorem gives 


L=Iet+ly =| GM@ +?) |. (8.37) 


8.3.2 A neat trick 


For some objects with certain symmetries, it’s possible to calculate the moment 
of inertia without doing any integrals. The only things we need are a scaling 
argument and the parallel-axis theorem. We’ll illustrate this technique by finding 
the 7 for a stick around its center (Example 5 above). Yov’ll find other applications 
in the problems for this chapter. 

In the present example, the basic trick is to compare the / for a stick of length L 
with the / for a stick of length 2L (and same density p). A quick scaling argument 
shows that the latter is eight times the former. This is true because the integral 
J x? dm = f x? p dx has three powers of x in it (yes, the dx counts). So a change 
of variables, x = 2y, brings in a factor of 2° = 8. Equivalently, if we imagine 
expanding the smaller stick into the larger one, then a corresponding piece in the 
larger stick will be twice as far from the axis, and also twice as massive. The 
integral f x? dm therefore increases by a factor of 2? - 2 = 8. 

The technique is most easily illustrated with pictures. If we denote the moment 
of inertia of an object by a picture of the object, with a dot signifying the axis, 


Fig. 8.13 
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then we have: 


L L L 
—_e——— = 5 —eo— 
———e—— = 2 ea 


— = —— +mu(Z) 


The first line comes from the scaling argument, the second line comes from the 
fact that moments of inertia simply add (the left-hand side is two copies of the 
right-hand side, attached at the pivot), and the third line comes from the parallel- 
axis theorem. Equating the right-hand sides of the first two equations gives 


— = 4 —e— 


Plugging this expression for e——— into the third equation gives the desired 
result, 


——— = L mr? 


12 


Note that sooner or later we must use real live numbers, which enter here through 
the parallel-axis theorem. Using only scaling arguments isn’t sufficient, because 
they provide only linear equations homogeneous in the /’s, and therefore give 
no means of picking up the proper dimensions. (For an interesting account of 
Galileo’s discovery of scaling laws, see Peterson (2002).) 

Once you’ve mastered this trick and applied it to the fractal objects in Prob- 
lem 8.8, you can impress your friends by saying that you know how to “use 
scaling arguments, along with the parallel-axis theorem, to calculate moments of 
inertia of objects with fractal dimension.” And you never know when that might 
come in handy! 


8.4 Torque 


We will now show that under certain conditions (stated below), the rate of change 
of angular momentum is equal to a certain quantity, t, which we call the torque. 
That is, t = dL/dt. This is the rotational analog of our old friend F = dp/dt 
involving linear momentum. The basic idea here is straightforward, but there are 
two subtle issues. One deals with internal forces within a collection of particles. 
The other deals with the possible acceleration of the origin (the point relative to 
which the torque and angular momentum are calculated). To keep things straight, 
we’ll prove the general result by dealing with three increasingly complicated 
situations. 


8.4 Torque 


Our derivation of t = dL/dt here holds for completely general motion, so 
we can take the result and use it in the following chapter, too. If you wish, you 
can construct a more specific proof of t = dL/dt for the special case where the 
axis of rotation is parallel to the z axis. But since the general proof is no more 
difficult, we'll present it here in this chapter and do it once and for all. 


8.4.1 Point mass, fixed origin 


Consider a point mass at position r relative to a fixed origin (see Fig. 8.14). The 
time derivative of the angular momentum, L = r x p, is 


dL 
FA gex 
dr p dp 
= — x x — 
dt P dt 
=vx(mv)+rxF 
=04rxF, (8.38) 


where F is the force acting on the particle. This is the same calculation as in 
Theorem 7.1, except that here we are considering an arbitrary force instead of a 
central one. If we define the torque on the particle as 


t=rxF, (8.39) 
then Eq. (8.38) becomes 
3 (8.40) 
TS ; 


It is understood that the r in the torque is measured with respect to the same 
origin as the r in the angular momentum. 


8.4.2 Extended mass, fixed origin 


In an extended object, there are internal forces acting on the various pieces of the 
object, in addition to whatever external forces exist. For example, the external 
force on a given molecule in a body might come from gravity, while the internal 
forces come from the adjacent molecules. How do we deal with these different 
types of forces? 

In what follows, we will deal only with internal forces that are central forces, 
so that the force between two objects is directed along the line between them. 
This is a valid assumption for the pushing and pulling forces between molecules 
in a solid. (It isn’t valid, for example, when dealing with magnetic forces. But 
we won’t be interested in such things here.) We will invoke Newton’s third law, 


Fig. 8.14 
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which says that the force that particle 1 applies to particle 2 is equal and opposite 
to the force that particle 2 applies to particle 1. 

For concreteness, let’s assume that we have a collection of N discrete particles 
labeled by the index i (see Fig. 8.15). In the continuous case, we would need to 
replace the following sums with integrals. The total angular momentum of the 
system is 


N 
L=} r; x pi (8.41) 
i=l 


The force acting on each particle is F ex + pint = dp;/dt. Therefore, 


dL d 
Tg 
dri dpi 
E aa A 
=} v; x (mvj) + È` r; x (Ft + F™) 
i i 
=0+) r; x Fe 


i 
= yor (8.42) 
i 


The second-to-last line follows because v; x v; = 0, and also because `; rj x 
Fint = 0, as you can show in Problem 8.9. In other words, the internal forces 
provide no net torque. This is quite reasonable. It basically says that a rigid object 
with no external forces won’t spontaneously start rotating. 

Note that the right-hand side involves the total external torque acting on the 
body, which may come from forces acting at many different points. Note also 
that nowhere did we assume that the particles are rigidly connected to each other. 
Equation (8.42) still holds even if there is relative motion among the particles. 
But in that case, it’s usually hard to get a handle on L, because it doesn’t take the 
nice Jw form. 


8.4.3 Extended mass, nonfixed origin 


Let the position of the origin be rọ (see Fig. 8.16), and let the positions of the 
particles be r;. The vectors rọ and r; are measured with respect to a given fixed 
coordinate system. The total angular momentum of the system, relative to the 


8.4 Torque 


(possibly accelerating) origin ro, is’ 
L= SG — ro) x m;(ři — fo). (8.43) 
i 
Therefore, 
dL d : ; 
qr T HA Di o) x mCi — to) 
t 


= Di — fo) x mi(t; — řo) + DY — ro) x mi(ř; — fo) 


=0+ Xoor — ro) x (F™ + F™ — mip), (8.44) 


l 


because m;ř; is the net force (namely F°*t + F int) acting on the ith particle. But 
a quick corollary to Problem 8.9 is that the term involving F int vanishes (as you 
should check). And since X` m;r; = MR (where M = }` mi; is the total mass, 
and R is the position of the center of mass), we have 


dL R 

Pi (De — ro) x re) — M(R — ro) x Fo. (8.45) 
L 

The first term here is the external torque, measured relative to the origin ro. The 

second term is something we wish would go away. And indeed, it usually does. 

It vanishes if any of the following three conditions is satisfied. 


1. R = rọ, that is, the origin is the CM. 
2. Fo = 0, that is, the origin is not accelerating. 
3. (R — ro) is parallel to Fg. This condition is rarely invoked. 


If any of these conditions is satisfied, then we are free to write 


dL 


Gp T-ri) xE =) e". (8.46) 
i i 


In other words, we can equate the total external torque with the rate of change of 
the total angular momentum. An immediate corollary of this result is: 


Corollary 8.3 Jf the total external torque on a system is zero, then its angular 
momentum is conserved. In particular, the angular momentum of an isolated 
system (one that is subject to no external forces) is conserved. 


7 More precisely, we are calculating the angular momentum relative to a coordinate system whose 
origin is rọ and whose axes remain parallel to the fixed axes. If we allowed for a rotation of 
the axes, then we would have to deal with all the fictitious-force issues that are the subject 
of Chapter 10. As it is, we will still end up dealing with one fictitious force (see the remark 
below). 
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Everything up to this point is valid for arbitrary motion. The particles can 
be moving relative to each other, and the various L;’s can point in different 
directions, etc. But let’s now restrict the motion. In the present chapter, we are 
dealing only with cases where L is constant (taken to point in the z direction). 
Therefore, dL/dt = d (LL) /dt = (aL / dt)L. If in addition we consider only rigid 
objects (where the relative distances among the particles are fixed) that undergo 
pure rotation around a given point, then L = Jw, which gives dL/dt = Iw = Ia. 
Taking the magnitude of both sides of Eq. (8.46) then gives 


Text = Ia. (8.47) 


Invariably, we will calculate angular momentum and torque around either the 
CM ora fixed point (or a point that moves with constant velocity, but this doesn’t 
come up often). These are the “safe” origins, in the sense that Eq. (8.46) holds. As 
long as you always use one of these safe origins, you can simply apply Eq. (8.46) 
and not worry much about its derivation. 


REMARK: You'll probably never end up invoking the third condition above, but it’s interesting 
to note that there’s a simple way of understanding it in terms of accelerating reference frames. 
This is the topic of Chapter 10, so we’re getting a little ahead of ourselves here, but the reasoning 
is as follows. Let ro be the origin of a reference frame that is accelerating with acceleration 
¥o. Then all objects in this accelerating frame feel a mysterious fictitious force of —mřo. For 
example, on a train accelerating to the right with acceleration a, you feel a strange force of 
ma pointing to the left. If you don’t counter this with another force (by grabbing a handle, 
for example), then you will fall over. This fictitious force acts just like a gravitational force, 
because it is proportional to the mass. Therefore, it effectively acts at the CM, producing a 
torque of (R — ro) x (—M yo). This is the second term in Eq. (8.45). This term vanishes if the 
CM is directly “above” or “below” (as far as the fictitious gravitational force is concerned) the 
origin, in other words, if (R — ro) is parallel to fo. See Problem 10.8 for further discussion of 
this in terms of fictitious forces. 

There is one common situation where the third condition can be invoked. Consider a wheel 
rolling without slipping on the ground. Mark a dot on the rim. At the instant this dot is in 
contact with the ground, it is a valid choice for the origin. This is true because (R — rg) points 
vertically. And rp also points vertically, because a dot on a rolling wheel traces out a cycloid. 
Right before the dot hits the ground, it is moving straight downward. And right after it hits the 
ground, it is moving straight upward. But having said this, there’s usually nothing to be gained 
by picking such an origin. So the safe thing to do is to always pick your origin to be either the 
CM or a fixed point, even if the third condition holds. 


For conditions that number but three, 
We say, “Torque is dL by dt.” 

But though they’re all true, 

Pl stick to just two; 

It’s CM’s and fixed points forme! & 


Example: A string wraps around a uniform cylinder of mass M, which rests on a 
fixed plane. The string passes up over a massless pulley and is connected to a mass m, 
as shown in Fig. 8.17. Assume that the cylinder rolls without slipping on the plane, 
and that the string is parallel to the plane. What is the acceleration of the mass m? 
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What is the condition on the ratio M /m for which the cylinder accelerates down the 
plane? 


First solution: The friction, tension, and gravitational forces are shown in 
Fig. 8.18. Define positive a], a2, and œ as shown. These three accelerations, along 


with T and F, are five unknowns. We therefore need to produce five equations. 


They are: 
1. F=maonm => T-—mg=may. 
2. F=maonM => Mgsin0 -T —F = Maj. 
3. t = Iæ on M (around the CM) = FR- TR = (MR?/2)a. 
4. Nonslipping condition => a=aj/R. 
5. Conservation of string => a= 2a]. 


A few comments on these equations: The normal force and the gravitational force 
perpendicular to the plane cancel, so we can ignore them. We have picked positive F 
to point up the plane, but if it happens to point down the plane and thereby turn out 
to be negative, that’s fine (but it won’t); we don’t need to worry about which way it 
really points. In (3), we are using the CM of the cylinder as our origin, but we can 
also use a fixed point; see the second solution below. In (5), we have used the fact that 
the top of a rolling wheel moves twice as fast as the center. This is true because the 
top has the same speed relative to the center as the center has relative to the ground. 

We can go about solving these five equations in various ways. Three of the equations 
involve only two variables, so it’s not so bad. (3) and (4) give F— T = Maj, /2. Adding 
this to (2) gives Mg sin 8 — 2T = 3May/2. Using (1) to eliminate T, and using (5) to 
write aq in terms of a2, then gives 


3Maz _ M sin — 2m)g 


Mg sin @ — 2(mg + maz) = 2 =~ G/4)M + 2m 


(8.48) 


And a; = a2 /2. We see that a is positive (that is, the cylinder rolls down the plane) 
if M/m > 2/sin@. If — 0, then this gives M/m — oo, which makes sense. 
If@ — 2/2, then M/m — 2. The basic reason for this is that the friction force F, in 
addition to the tension 7, is holding the cylinder up (the coefficient of friction must 
be very large in this case if no slipping is to occur). 


Second solution: Inusing t = dL /dt, we can also pick a fixed point as our origin, 
instead of the CM. The most sensible point is one that is located somewhere along 
the plane. The Mg sin 6 force now provides a torque, but the friction does not. And 
the lever arm for the tension is now 2R. The angular momentum of the cylinder with 
respect to a point on the plane is L = Jw+ Mv 1R, where the second term comes from 
the angular momentum due to the object being treated like a point mass at the CM. 
So dL/dt = Ia + Ma,R, and t = dL /dt gives 


(Me sin 0)R — T(2R) = (MR2/2)a + MayR. (8.49) 


This turns out to be the sum of the third equation plus R times the second equation in 
the first solution. We therefore obtain the same result. 


Fig. 8.18 
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8.5 Collisions 


In Section 5.7, we looked at collisions involving point particles or otherwise 
nonrotating objects. The fundamental ingredients there that enabled us to solve 
problems were conservation of linear momentum and conservation of energy 
(if the collision was elastic). With conservation of angular momentum now at 
our disposal, we can extend our study of collisions to ones with rotating objects. 
The additional fact of conservation of L is compensated for by the new degree of 
freedom of the rotation. Therefore, provided that the problem is set up properly, 
we will still have the same number of equations as unknowns. 

Conservation of energy can be used in a collision only if it is elastic (by defini- 
tion). But conservation of angular momentum is similar to conservation of linear 
momentum, in that it can always be used (see the remark below), assuming that 
the system is isolated. However, conservation of L is a little different from con- 
servation of p, because we have to pick an origin before we can proceed. In view 
of the three conditions that are necessary for Corollary 8.3 to hold, we must pick 
our origin to be either a fixed point or the CM of the system (we’ll ignore the third 
condition, since it’s rarely used). If we unwisely choose an accelerating point, 
then t = dL/dt does not hold, so we have no right to claim that dL/dt equals 
zero just because the torque is zero (as it is for an isolated system). In collision 
problems, it is easy to fall into the trap of picking an accelerating point as your 
origin. For example, you might choose the center of a stick as your origin. But if 
another object collides with the stick, then the center will accelerate, making it 
an invalid choice for the origin. 


Remark: As far as conservation goes, the way that E differs from p and L is that energy 
can be hidden in the microscopic motion of molecules in the body, in the form of heat. This 
motion consists of little vibrations with small amplitudes but high speeds. The energy of these 
vibrations can be large enough to be on the same order of magnitude as the overall energy 
of the system. But because the vibrations are too small to see, it appears that energy is lost. 
However, note that even though they’re too small to see, you can still feel them with your hand, 
as heat. 

Linear momentum, however, can’t be hidden. Ifan object (not necessarily rigid) has nonzero 
momentum, then it has to be moving as a whole, and there’s no way around that. In short, since 
P = MVcm, we see that if P is nonzero, then Vcy is also. So the motion must be on a 
macroscopic scale; there’s no way for it to be hidden on a microscopic scale. 

With angular momentum, things are a little trickier. If the object is rigid, then it can’t 
have hidden angular momentum, for reasons similar to those in the linear momentum case; 
since L = Jw, we see that if L is nonzero, then w is also. However, if the object isn’t 
rigid (consider, say, a gas of particles), then it turns out that it can theoretically have hid- 
den angular momentum in microscopic motion, although in practice it ends up being too small 
to notice. This hidden angular momentum can arise from little swirling regions throughout 
the system. In contrast with linear momentum, it is possible to have angular momentum with- 
out any overall motion. So in this respect, this microscopic swirling motion is similar to the 
microscopic vibrational motion that yields hidden energy. There are, however, three main 
differences. 

First, if we’re assuming that the swirling motion takes place on a microscopic scale, then the 
r in L = mrv is very small, and this leads to a negligible L. This argument doesn’t hold for £, 
because the energy of the vibrations doesn’t involve r. Instead, it involves only v, in the form of 
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mv” /2, so it can end up being large. Second, there’s no easy way to start up the circular motion 
of many little swirls by means of a collision, in contrast with the easily started random linear 
motion that makes up the energy of heat; you just smash two things together. And third, in the 
case where the object is rigid, the molecules can easily vibrate, but they can’t rotate indefinitely 
because this would involve ripping apart the bonds between adjacent molecules. The issue is 
that in vibrational motion all coordinates remain small, whereas in rotational motion they don’t, 
because 0 eventually becomes large. 

There is, however, one very common phenomenon, namely magnetism, that (in a sense) is 
an exception to all three of the above points. Although magnetism isn’t an angular momentum, 
it does come (roughly speaking) from the “circular” motion of electrons around the nuclei in 
atoms. (In general, it actually comes more from the “spin” of the electrons than their orbital 
motion around the nuclei, but let’s not worry about that here. To get everything right, we’d 
have to think in terms of quantum mechanics, anyway. Let’s just work in a rough classical 
approximation.) Electrons throughout a magnetic material move in tiny little correlated loops. 
We can escape the above three points of reasoning because first, the magnetic field involves the 
electric charge e, and this is large enough (on the scale of things) to cancel out the smallness of 
the r factor. (The actual angular momentum of the electrons in a magnetic material is negligible 
because their mass is so small. There isn’t a large quantity like e to save the day.) Second, it 
is quite easy to get the electrons moving in correlated circles by means of magnetic forces; 
there’s no need to smash things together. And third, the electrons are free to move around in 
little circles (in a classical sense) in atoms without ripping things apart. & 


It is important to remember that you are free to choose your origin from the 
legal possibilities of fixed points or the CM. Since it is generally the case that one 
choice is better than others (in that it makes the calculations easier), you should 
take advantage of this freedom. Let’s do two examples. First an elastic collision, 
and then an inelastic one. 


Example (Elastic collision): A mass m travels perpendicular to a stick of mass 
m and length £, which is initially at rest. At what location should the mass collide 
elastically with the stick, so that the mass and the center of the stick move with equal 
speeds after the collision? 


Solution: Let the initial speed of the mass be vg. We have three unknowns in the 
problem (see Fig. 8.19), namely the desired distance from the middle of the stick, 
h; the final (equal) speeds of the stick and the mass, v; and the final angular speed 
of the stick, w. We can solve for these three unknowns by using our three available 
conservation laws: 

e Conservation of p: 
— vo 


mvo = mv + mvo => v= 3“ 


e Conservation of E: Remembering that the energy of the stick equals the energy 


(8.50) 


of the rotational motion around the center, plus the energy of the effective point 


mass at the center, we have 


2 2 
mv A). Zay e ) 2 _ 6x9 
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(8.51) 


Fig. 8.19 
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e Conservation of L: Let’s pick our origin to be the fixed point in space that 
coincides with the initial location of the center of the stick. Then conservation 
of L gives 


muph = m(2)h eh (is) Bo | (8.52) 


The zero here comes from the fact that the CM of the stick moves directly away 
from the origin, so there is no contribution to L from the first of the two parts in 
Theorem 8.1. Plugging the w from Eq. (8.51) into Eq. (8.52) gives 


1 
= h= 
5 mvo 


12 


ej 2s. pee, (8.53) 


You are encouraged to solve this problem again with a different choice of origin, 
for example, the fixed point that coincides with the spot where the mass hits the 
stick, or the CM of the entire system. 


Remark: After the stick makes half of a revolution, it will hit the backside of m. The 
resulting motion will have the stick sitting at rest (both translationally and rotationally) 
and the mass moving with its initial speed vo. You can show this by working though the 
second collision, using the quantities we found above. Or you can just use the fact that this 
scenario certainly satisfies conservation ofp, E£, and L with the initial conditions, so it must 
be what happens (because the quadratic conservation statements have only two solutions, 
and the other one corresponds to the intermediate motion). Note that the time it takes the 
stick to make half of a revolution is 7/w = m £/ /6v0. So the stick travels a distance of 
(vp /2) (1 £// 6v0) = (1 l/2/6) before it ends up at rest. This distance is independent of 
vo (which follows from dimensional analysis). & 


Let’s now look at an inelastic collision, where one object sticks to another. We 


won’t be able to use conservation of E now. But conservation of p and L will be 
sufficient since there is one fewer degree of freedom in the final motion, because 
the objects don’t move independently. 


Example (Inelastic collision): A mass m travels at speed vp perpendicular to a 
stick of mass m and length £, which is initially at rest. The mass collides completely 
inelastically with the stick at one of its ends and sticks to it. What is the resulting 
angular velocity of the system? 


Solution: The first thing to note is that the CM of the system is £/4 from the end, 
as shown in Fig. 8.20. After the collision, the system rotates about the CM as the CM 
moves in a straight line. Conservation of momentum quickly tells us that the speed 
of the CM is vg /2. Also, using the parallel-axis theorem, the moment of inertia of the 
system around the CM is 


P mée2 £\2 EN 2 5 
Ten, = [stick 4. pmass _ (=) (=) = me. 8.54 
cM =I¢M_+ ICM Dae © t” 24” (8:30) 


8.6 Angular impulse 


There are now many ways to proceed, depending on what point we choose as our 
origin. 

First METHOD: Choose the origin to be the fixed point that coincides with the location 
of the CM right when the collision happens (that is, the point £/4 from the end of the 


stick). Conservation of L says that the initial L of the ball must equal the final L of 
the system. This gives 


_ vo 


=z (8.55) 


£ 5 4 
Z) =(—me ) 0 
mug ( 7 ) ( 74 me jw + => 
The zero here comes from the fact that the CM of the stick moves directly away from 
the origin, so there is no contribution to Z from the first of the two parts in Theorem 


8.1. Note that we didn’t need to use conservation of p in this method. 


SECOND METHOD: Choose the origin to be the fixed point that coincides with the 
initial center of the stick. Then conservation of L gives 


meo(5) = (Syme?) + em (H) (£) => o= - (8.56) 


The right-hand side is the angular momentum of the system relative to the CM, plus 
the angular momentum (relative to the origin) of a point mass of mass 2m located 
at the CM. 


THIRD METHOD: Choose the origin to be the CM of the system. This point moves to 
the right with speed vg/2, along the line a distance 4/4 below the top of the stick. 
Relative to the CM, the mass m moves to the right, and the stick moves to the left, 
both with speed vg /2. Conservation of L gives 


m(2)(É) + [o+(2) G) = (Zm?) => v= ae (8.57) 


The zero here comes from the fact that the stick initially has no Z around its center. 
A fourth reasonable choice for the origin is the fixed point that coincides with the 
initial location of the top of the stick. You can work this one out for practice. 


8.6 Angular impulse 


In Section 5.5.1, we defined the impulse, which we’ll label as Z, to be the time 
integral of the force applied to an object. From Newton’s second law, F = dp/dt, 
the impulse is therefore the net change in linear momentum. That is, 


t 
T= f i F(t) dt = Ap. (8.58) 
ti 


We now define the angular impulse, Tọ, to be the time integral of the torque 
applied to an object. From t = dL/dt, the angular impulse is therefore the net 
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change in angular momentum. That is, 
to 
To = f t(t)dt = AL. (8.59) 
ti 


These are just definitions, devoid of any content. The place where the physics 
comes in is the following. Consider a situation where F(t) is always applied 
at the same position relative to the origin around which f(t) is calculated (this 
origin must be a legal one, of course). Let this position be R. Then we have 
T(t) = R x F(¢). Plugging this into Eq. (8.59), and taking the constant R outside 
the integral, gives Za = R x Z. In other words, 


AL =R x (Ap) (for F(¢) applied at one position). (8.60) 


This is a very useful result. It gives the relation between the net changes in L 
and p, as opposed to the individual values of each. Even if F(t) is changing in 
some arbitrary manner as time goes by, so that we have no idea what AL and 
Ap themselves are, we still know that they are related by Eq. (8.60). In many 
cases, we don’t have to worry about the cross product in Eq. (8.60), because the 
lever arm R is perpendicular to the change in momentum Ap. In such cases, 
we have 


|AL| = R| Apl. (8.61) 


Also, in many cases the object starts at rest, so we don’t have to bother with 
the A’s. The following example is a classic application of angular impulse 
and Eq. (8.61). 


Example (Striking a stick): A stick of mass m and length £, initially at rest, is 
struck with a hammer. The blow is made perpendicular to the stick at one end. Let 
the blow occur quickly, so that the stick doesn’t have time to move much while the 
hammer is in contact. If the CM of the stick ends up moving at speed v, what are the 
velocities of the ends right after the blow? 


Solution: Although we have no hope of knowing exactly what F (t) looks like, or 
the length of time it is applied, we still know from Eq. (8.61) that AL = (€/2)Ap, 
where we have chosen our origin to be the CM, which gives a lever arm of 
£/2. Therefore, (me? /12)@ = (€/2)mv, so the final v and œ are related by 
w = 6v/¢. 

The velocities of the ends right after the blow are obtained by adding (or subtracting) 
the rotational motion to the CM’s translational motion. The rotational velocities of the 
ends relative to the CM are tw(€/2) = +(6v/€)(€/2) = +3v. Therefore, the end 
that is hit moves with velocity v + 3v = 4v, and the other end moves with velocity 
v — 3v = —2v (that is, backward). 


8.7 Problems 


What L was, he just couldn’t tell. 
And p? He was clueless as well. 


But despite his distress, 


He wrote down the right guess 


For their quotient: the lever arm’s £. 


Impulse is also useful for “collisions” that occur over extended times. See, for 
example, Problem 8.24. 


8.7 Problems 


Section 8.1: Pancake object in x-y plane 


8.1. 


8.2. 


8.3. 


8.4. 


Massive pulley x 


Consider the Atwood’s machine shown in Fig. 8.21. The masses are m 
and 2m, and the pulley is a uniform disk of mass m and radius r. The 
string is massless and does not slip with respect to the pulley. Find the 
acceleration of the masses. Use conservation of energy. 


Leaving the sphere «x 


A ball with moment of inertia mr? rests on top of a fixed sphere. 
There is friction between the ball and the sphere. The ball is given an 
infinitesimal kick, and it rolls down without slipping. Assuming that 
r is much smaller than the radius of the sphere, at what point does 
the ball lose contact with the sphere? How does your answer change 
if the size of the ball is comparable to, or larger than, the size of the 
sphere? You may want to solve Problem 5.3 first, if you haven’t already 
done so. 


Sliding ladder +s» 


A ladder of length £ and uniform mass density stands on a frictionless 
floor and leans against a frictionless wall. It is initially held motionless, 
with its bottom end an infinitesimal distance from the wall. It is then 
released, whereupon the bottom end slides away from the wall, and the 
top end slides down the wall (see Fig. 8.22). When it loses contact with 
the wall, what is the horizontal component of the velocity of the center 
of mass? 


Leaning rectangle x+» 


A rectangle of height 2a and width 2b rests on top of a fixed cylinder of 
radius R (see Fig. 8.23). The moment of inertia of the rectangle around 
its center is J. The rectangle is given an infinitesimal kick and then 
“rolls” on the cylinder without slipping. Find the equation of motion for 
the tilt angle of the rectangle. Under what conditions will the rectangle 
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8.5. 


m M 


Fig. 8.24 


fall off the cylinder, and under what conditions will it oscillate back and 
forth? Find the frequency of these small oscillations. 


Mass in a tube xxx 


A tube of mass M and length £ is free to swing around a pivot at one end. 
A mass m is positioned inside the (frictionless) tube at this end. The tube 
is held horizontal and then released (see Fig. 8.24). Let 0 be the angle 
of the tube with respect to the horizontal, and let x be the distance the 
mass has traveled along the tube. Find the Euler-Lagrange equations for 
0 and x, and then write them in terms of 9 and n = x/£ (the fraction of 
the distance along the tube). 

These equations can only be solved numerically, and you must pick 
a numerical value for the ratio r = m/M in order to do this. Write a 
program (see Section 1.4) that produces the value of n when the tube is 
vertical (9 = 2/2). Give this value of 7 for a few values ofr. 


Section 8.3: Calculating moments of inertia 


CI n 8.6. 


8.7. 


8.8. 


Fig. 8.26 


Minimum / « 


Amoldable blob of matter of mass M is to be situated between the planes 
z = Oandz = 1 (see Fig. 8.25) so that the moment of inertia around the 
z axis is as small as possible. What shape should the blob take? 


Slick calculations of Z «« 


In the spirit of Section 8.3.2, find the moments of inertia of the following 
objects (see Fig. 8.26). 
(a) A uniform square of mass m and side £ (axis through center, 
perpendicular to plane). 
(b) A uniform equilateral triangle of mass m and side £ (axis through 
center, perpendicular to plane). 


Slick calculations of J for fractal objects xxx 


In the spirit of Section 8.3.2, find the moments of inertia of the following 
fractal objects. Be careful how the mass scales. 


(a) Takeastick of length £, and remove the middle third. Then remove 
the middle third from each of the remaining two pieces. Then 
remove the middle third from each of the remaining four pieces, 
and so on, forever. Let the final object have mass m, and let the axis 
be through the center, perpendicular to the stick; see Fig. 8.27.° 


8 This object is the Cantor set, for those who like such things. It has no length, so the density of 


the remaining mass is infinite. If you suddenly develop an aversion to point masses with infinite 


density, simply imagine the above iteration being carried out only, say, a million times. 
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(b) Take an equilateral triangle of side £, and remove the “middle” 
triangle (1/4 of the area). Then remove the “middle” triangle from 
each of the remaining three triangles, and so on, forever. Let the 
final object have mass m, and let the axis be through the center, 
perpendicular to the plane; Fig. 8.28. 

Take a square of side £, and remove the “middle” square (1/9 
of the area). Then remove the “middle” square from each of the 
remaining eight squares, and so on, forever. Let the final object 
have mass m, and let the axis be through the center, perpendicular 
to the plane; see Fig. 8.29. 


(c 


wm 


Section 8.4: Torque 


8.9. 


8.10. 


8.12. 


Zero torque from internal forces +s 


Given a collection of particles with positions rj, let the force on the ith 
particle due to all the others be F int, Assuming that the force between 
any two particles is directed along the line between them, use Newton’s 
third law to show that 7; r; x Fi" = 0. 


Removing a support x 


(a) Auniform rod of length £ and mass m rests on supports at its ends. 
The right support is quickly removed (see Fig. 8.30). What is the 
force from the left support immediately thereafter? 

(b) A rod of length 27 and moment of inertia mr? rests on top of 
two supports, each of which is a distance d away from the center. 
The right support is quickly removed (see Fig. 8.30). What is the 
force from the left support immediately thereafter? 


. Falling stick « 


A massless stick of length b has one end pivoted on a support and the 
other end glued perpendicular to the middle of a stick of mass m and 
length £. 


(a) If the two sticks are held in a horizontal plane (see Fig. 8.31) and 
then released, what is the initial acceleration of the CM? 

(b) If the two sticks are held in a vertical plane (see Fig. 8.31) and 
then released, what is the initial acceleration of the CM? 


Pulling a cylinder «x 


In Exercise 8.50 below, the cylinder moves directly to the right. The fact 
that it doesn’t have any transverse motion follows from the fact that the 
two segments of the string pull only to the right and therefore cannot 
supply a transverse force. Demonstrate this result again by explicitly 


Fig. 8.28 
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Fig. 8.32 


8.14. 


Fig. 8.33 


G 8.16. 


integrating the string’s force on the cylinder over the semicircle of con- 
tact. (The N = T d0 result from the “Rope wrapped around a pole” 
example in Section 2.1 will come in handy.) 


. Oscillating ball +x 


A small ball with radius r and uniform density rolls without slipping 
near the bottom of a fixed cylinder of radius R (see Fig. 8.32). What is 
the frequency of small oscillations? Assume r < R. 


Oscillating cylinders +» 


A hollow cylinder of mass M, and radius R; rolls without slipping on 
the inside surface of another hollow cylinder of mass M2 and radius 
R2. Assume Rı < R2. Both axes are horizontal, and the larger cylin- 
der is free to rotate about its axis. What is the frequency of small 
oscillations? 


. Lengthening the string «« 


A mass hangs from a massless string and swings around in a hori- 
zontal circle, as shown in Fig. 8.33. The length of the string is then 
very slowly increased (or decreased). Let 0, £, r, and A be defined as 
shown. 


(a) Assuming that 0 is very small, how does r depend on £? 
(b) Assuming that 0 is very close to 2/2, how does A depend on £? 


A triangle of cylinders *«« 


Three identical cylinders with moments of inertia = BmR? are situated 
ina triangle as shown in Fig. 8.34. Find the initial downward acceleration 
of the top cylinder for the following two cases. Which case has a larger 
acceleration? 


(a) There is friction between the bottom two cylinders and the ground 
(so they roll without slipping), but there is no friction between 
any of the cylinders. 

(b) There is no friction between the bottom two cylinders and the 
ground, but there is friction between the cylinders (so they don’t 
slip with respect to each other). 


. Falling chimney *xxx 


A chimney initially stands upright. It is given a tiny kick, and it topples 
over. At what point along its length is it most likely to break? In doing this 
problem, work with the following two-dimensional simplified model of 
a chimney. Assume that the chimney consists of boards stacked on top 
of each other, and that each board is attached to the two adjacent ones 
with tiny rods at each end (see Fig. 8.35). The goal is to determine 
which rod in the chimney achieves the maximum tension. Work in the 
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approximation where the width of the chimney is very small compared 
with the height. 


Section 8.5: Collisions 


8.18. 


8.19. 


Ball hitting stick «+ 


A ball of mass M collides with a stick with moment of inertia I = Bm€? 
(relative to its center, which is its CM). The ball is initially traveling 
at speed Vp perpendicular to the stick. The ball strikes the stick at a 
distance d from the center (see Fig. 8.36). The collision is elastic. Find 
the resulting translational and rotational speeds of the stick, and also the 
resulting speed of the ball. 


A ball and stick theorem +x 


Consider the setup in Problem 8.18. Show that the relative speed 
of the ball and the point of contact on the stick is the same before 
and immediately after the collision. (This result is analogous to 
the “relative speed” result for a 1-D collision, Theorem 5.3 in 
Section 5.7.1) 


Section 8.6: Angular impulse 


8.20. 


8.21. 


8.22. 


The superball «« 

A ball with radius R and J = (2/5)mR? is thrown through the air. It 
spins around the axis perpendicular to the (vertical) plane of the motion. 
Call this the x-y plane. The ball bounces off a floor without slipping 
during the time of contact. Assume that the collision is elastic, and that 
the magnitude of the vertical v, is the same before and after the bounce. 
Show that v’. and w’ after the bounce are related to vy and œ before the 


bounce by 
vl al 3 -4 Vy 
eo. ae Bete. een 


where positive vy is to the right, and positive w is counterclockwise. 


Many bounces « 

Using the result of Problem 8.20, describe what happens over the course 
of many superball bounces. 

Rolling over a bump ++ 


A ball with radius R and moment of inertia J = (2/5)mR? rolls with 
speed Vo without slipping on the ground. It encounters a step of height 
h and rolls up over it. Assume that the ball sticks to the corner of 
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the step briefly (until the center of the ball is directly above the cor- 
ner). Show that if the ball is to climb over the step, then Vo must 


satisfy 
[0gh 5A\ T! 
Vo > 4 —— | 1 — — : 8. 
02> ( m7) (8.63) 


After buttering your toast (assumed to be a uniform rigid square of 
side length £, for the sake of having a doable problem) one morning and 
holding it in a horizontal position with buttered side up, you accidentally 
drop it from a height H above a counter, which itself is a height h above 
the ground. The toast is oriented “parallel” to the counter, and as it falls, 
one edge barely clips the counter (elastically), causing the toast to spin. 
What value of H, in terms of h and £, yields the unfortunate scenario 
where the toast makes half of a revolution, landing buttered side down 
on the floor? There is a special value of £ in terms of h. What is it, and 
why is it special? 


. Falling toast xx 


Sliding to rolling «« 
A ball initially slides without rotating on a horizontal surface with friction 


(see Fig. 8.37). The initial speed is Vo, and the moment of inertia around 
the center is I = BmR?. 


(a) Without knowing anything about the nature of the friction force, 
find the speed of the ball when it begins to roll without slipping. 
Also, find the kinetic energy lost while sliding. 

(b) Now consider the special case where the coefficient of kinetic 
friction is u, independent of position. At what time, and at what 
distance, does the ball begin to roll without slipping? Verify that 
the work done by friction equals the energy loss calculated in part 
(a). (Be careful on this.) 


. Lots of sticks xxx 


Consider a collection of rigid sticks of length 2r, masses m;, and 
moments of inertia Bmir’, with mj) >> m > m > ---. The CM 
of each stick is located at its center. The sticks are placed on a horizon- 
tal frictionless surface, as shown in Fig. 8.38. The ends overlap a tiny 
distance in the y direction and are a tiny distance apart in the x direc- 
tion. The first (heaviest) stick is given an instantaneous blow (as shown) 
which causes it to translate and rotate. The first stick will strike the 
second stick, which will then strike the third stick, and so on. Assume 
that all the collisions are elastic. Depending on the size of £, the speed 
of the nth stick will either (1) approach zero, (2) approach infinity, or 
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(3) be independent of n, as n —> oo. Show that the special value of £ 
corresponding to the third of these three scenarios is 8 = 1/3, which 
happens to correspond to a uniform stick. 
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Section 8.1: Pancake object in x-y plane 


8.26. 


8.27. 


8.28. 


8.29. 


Swinging stick +» 

A uniform stick of length LZ is pivoted at its bottom end and is initially 
held vertical. It is given an infinitesimal kick, and it swings down around 
the pivot. After three-quarters of a turn (in the horizontal position shown 
in Fig. 8.39), the pivot is somehow vaporized, and the stick flies freely 
up in the air. What is the maximum height of the center of the stick in 
the resulting motion? At what angle is the stick tilted when the center 
reaches this maximum height? 


Atwood’s with a cylinder +» 


A massless string of negligible thickness is wrapped around a uniform 
cylinder of mass m and radius r. The string passes up over a massless 
pulley and is tied to a block of mass m at its other end, as shown in 
Fig. 8.40. The system is released from rest. What are the accelerations 
of the block and the cylinder? Assume that the string does not slip with 
respect to the cylinder. Use conservation of energy (after applying a 
quick F = ma argument to show that the two objects move downward 
with the same acceleration). 


Board and cylinders ++ 


A board lies on top of two uniform cylinders that lie on a fixed plane 
inclined at an angle 0, as shown in Fig. 8.41. The board has mass m, and 
each of the cylinders has mass m/2. The system is released from rest. If 
there is no slipping between any of the surfaces, what is the acceleration 
of the board? Use conservation of energy. 


Moving plane xxx 


A ball of mass m and moment of inertia Z = mr? is held motionless ona 
plane of mass M and angle of inclination 0 (see Fig. 8.42). The plane rests 
on a frictionless horizontal surface. The ball is released. Assuming that 
it rolls without slipping on the plane, what is the horizontal acceleration 
of the plane? Hint: You might want to do Problem 3.8 first. But as 
with all the exercises in this section, use conservation of energy instead 
of force and torque; this problem gets extremely messy with the latter 
strategy. 
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Section 8.2: Nonplanar objects 


8.30. 


Semicircle CM x 


A wire is bent into a semicircle of radius R. Find the CM. 


. Hemisphere CM « 


Find the CM of a solid hemisphere. 


Section 8.3: Calculating moments of inertia 


8.32. 


8.34. 


Fig. 8.43 


A cone x 


Find the moment of inertia of a solid cone (mass M, base radius R) 
around its symmetry axis. 


. A sphere « 


Find the moment of inertia of a solid sphere (mass M, radius R) around 
a diameter. Do this by slicing the sphere into disks. 


A triangle, the slick way «« 


In the spirit of Section 8.3.2, find the moment of inertia of a uniform 
equilateral triangle of mass m and side £, around a line joining a vertex 
and the opposite side; see Fig. 8.43. 


. Fractal triangle «« 


Take an equilateral triangle of side £, and remove the “middle” trian- 
gle (1/4 of the area). Then remove the “middle” triangle from each 
of the remaining three triangles, and so on, forever. Let the final 
fractal object have mass m. In the spirit of Section 8.3.2, find the 
moment of inertia around a line joining a vertex and the opposite side; 
see Fig. 8.44. 


Section 8.4: Torque 


Fig. 8.44 


8.36. 


Swinging your arms « 


You are standing on the edge of a step on some stairs, facing up the 
stairs. You feel yourself starting to fall backward, so you start swinging 
your arms around in vertical circles, like a windmill. This is what people 
tend to do in such a situation, but does it actually help you not to fall, or 
does it simply make you look silly? Explain your reasoning. 


. Rolling down the plane « 


A ball with moment of inertia Bmr? rolls without slipping down a plane 
inclined at an angle 6. What is its linear acceleration? 


8.39. 


8.40. 


8.41. 


8.42. 


8.43. 


8.44. 


8.45. 


8.46. 
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. Coin ona plane x 


Auniform coin rolls down a plane inclined at an angle 0. If the coefficient 
of static friction between the coin and the plane is u, what is the largest 
angle 6 for which the coin doesn’t slip? 


Accelerating plane « 

A ball with J = (2/5)MR? is placed ona plane inclined at an angle 6. The 
plane is accelerated upwards (along its direction) with acceleration a; 
see Fig. 8.45. For what value of a does the CM of the ball not move? 
Assume that there is sufficient friction so that the ball doesn’t slip with 
respect to the plane. 


Bowling ball on paper « 


A bowling ball sits on a piece of paper on the floor. You grab the paper 
and pull it horizontally along the floor, with acceleration ao. What is the 
acceleration of the center of the ball? Assume that the ball does not slip 
with respect to the paper. 


Spring and cylinder « 

The axle of a solid cylinder of mass m and radius r is connected to a 
spring with spring constant k, as shown in Fig. 8.46. If the cylinder rolls 
without slipping, what is the frequency of the oscillations? 

Falling quickly « 

A massless stick of length L is pivoted at one end and has a mass m 
attached to its other end. It is held in a horizontal position, as shown in 
Fig. 8.47. Where should a second mass m be attached to the stick, so 
that the stick falls as fast as possible when dropped? 

Maximum frequency « 

A pendulum is made of a uniform stick of length L. It is allowed to swing 
in a vertical plane. Where should the pivot be placed on the stick so that 
the frequency of (small) oscillations is maximum? 

Massive pulley x 

Solve Problem 8.1 again, but now use force and torque instead of 
conservation of energy. 

Atwood’s with a cylinder +x 

Solve Exercise 8.27 again, but now use force and torque instead of 
conservation of energy. 

Board and cylinders «++ 


Solve Exercise 8.28 again, but now use force and torque instead of 
conservation of energy. 


Fig. 8.45 


Fig. 8.46 


Fig. 8.47 
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8.47. The spool «« 


A spool of mass m and moment of inertia Z is free to roll without slipping 
on a table. It has an inner radius r, and an outer radius R. If you pull 
= on the string with tension T at an angle 0 (see Fig. 8.48), what is the 
acceleration of the spool? Which way does it move? 


N 


8.48. Stopping the coin +x 


Fig. 8.48 A coin stands vertically on a table. It is projected forward (in the plane 
of itself) with speed v and angular speed w, as shown in Fig. 8.49. The 
coefficient of kinetic friction between the coin and the table is u. What 
should v and w be so that the coin comes to rest (both translationally 
and rotationally) a distance d from where it started? 


— 8.49. Measuring g +x 


(a) Consider an extended pendulum whose CM is a distance £ from 
H the pivot, and whose moment of inertia around the pivot is /. Show 
that the frequency of small oscillations is œ = ./mg@/T, which 
Fig. 8.49 gives T = 27 /w = 2r ~T /mgt, and hence g = 47°I /(mlT?). 
Therefore, by measuring J, m, £, and T, you can determine g. 
However, if the pendulum has an odd shape, it may be difficult 
to determine /. Consider, then, the following alternative method 
of measuring g. 

(b) For simplicity, assume that the pendulum is planar. Pick an arbi- 
trary point as the pivot and measure the period, 7, of small 
oscillations. Then with the pendulum at rest, draw a vertical line 
through this pivot. By trial and error, find another pivot point 
on this line on the same side of the CM° (you may need to 
extend the line with a massless extension) that yields the same 
period T. Let L be the sum of the lengths from these two points 
to the CM.!° Show that g is given by g = 42°L/T?, which is 
independent of m and 7. 


8.50. Pulling a cylinder «+ 


Asolid cylinder of mass m and radius r lies flat ona frictionless horizontal 
table, with a massless string running halfway around it, as shown in 


The CM can be found by hanging the pendulum from a point not on the drawn line, and then 
drawing a vertical line through this point. The intersection of the two lines is the CM. 

There are also two points on the other side of the CM that yield the same period (except in the 
special case where the two points coincide and produce the minimum period, as in Exercise 8.43, 
but don’t worry about this). These two other points are the same distances from the CM as the 
original two points, as you can show. So if you happen to find all four points, you can obtain L by 
instead measuring the distance from the “inside” point on one side to the “outside” point on the 
other. This method doesn’t require knowing the location of the CM. 


8.52. 


8.54. 
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Fig. 8.50. A mass also of mass m is attached to one end of the string, 
and you pull on the other end with a force T. The circumference of the 
cylinder is sufficiently rough so that the string does not slip with respect 
to it. What is the acceleration of the mass m attached to the end of the 
string? 


. Coin and plank +» 


A coin of mass M and radius R stands vertically on the right end of 
a horizontal plank of mass M and length L, as shown in Fig. 8.51. 
The system starts at rest. The plank is then pulled to the right with a 
constant force F. Assume that the coin does not slip with respect to 
the plank. What are the accelerations of the plank and coin? How far 
to the right does the coin move by the time the left end of the plank 
reaches it? 


Cylinder, board, and spring +» 

A board of mass m, which is free to slide on a frictionless floor, is 
connected by a spring (with spring constant k) to a wall. A cylinder, 
also of mass m (and J = mR*/2), rests on top of the board, as shown in 
Fig. 8.52, and is free to roll without slipping on the board. If the board 
and the cylinder are pulled some distance to the left and then released 
from rest, what is the frequency of the resulting oscillatory motion? 


. Swirling around a cone x» 


A fixed hollow frictionless cone is positioned with its tip pointing down. 
A particle is released from rest on the inside surface. After it has slid 
halfway down to the tip, it bounces elastically off a platform. The plat- 
form is positioned at a 45° angle along the surface of the cone, so the 
particle ends up being deflected horizontally along the surface (in other 
words, into the page in Fig. 8.53). The particle then swirls up and 
around the cone before coming down. Measured from the tip of the 
cone, show that the ratio of the particle’s maximum swirling height to 
the height of the platform is (v5 + 1)/2, which just happens to be the 
golden ratio. 

Raising a hoop ++ 

A bead of mass m is positioned at the top of a frictionless hoop of mass 
M and radius R, which stands vertically on the ground. A wall touches 
the hoop on its left, and a short wall of height R touches the hoop on 
its right, as shown in Fig. 8.54. All surfaces are frictionless. The bead 
is given a tiny kick, and it slides down the hoop, as shown. What is the 
largest value of m/M for which the hoop never rises up off the ground? 
Note: It’s possible to solve this problem by using only force, but solve 
it here by using torque. 
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Block and cylinder x» 


A block and a cylinder (with J = BmR?), both of mass m, lie on a plane 
(inclined at an angle 0), touching each other as shown in Fig. 8.55. 
There is sufficient friction between the cylinder and the plane so that 
it rolls without slipping. But the bottom of the block is coated with 
grease, so there is no friction between it and the plane. However, there 
is a coefficient of kinetic friction u = 1 between the block’s right face 
and the cylinder. What is the acceleration of the block? How does your 
answer compare with the result for a lone cylinder rolling down a plane? 
Assume that 6 is small enough so that the block’s bottom face remains in 
contact with the plane at all times; what is the condition on 0 for which 
this is true? 


Falling and sliding stick ««« 


One end of a uniform stick is attached to a pivot, and the pivot is free 
to slide along a frictionless horizontal rail. The stick is held at an initial 
angle 69 with respect to the (upward) vertical direction and then released; 
see Fig. 8.56. Assume that the stick can somehow swing down below 
the horizontal position without running into the rail (perhaps by having 
the pivot attached to the side of the rail, so that the stick is shifted 
horizontally a small distance from the rail). 


(a) Show that when the stick is horizontal, the normal force N from 
the rail equals mg/4, independent of 6p. 

(b) If 9 = 0 (a tiny kick is allowed), show that N = 13mg when the 
stick is at the bottom of its motion (at 9 = 7). 

(c) If) = 0, show that the minimum N occurs at 9 ~ 61.5° and that 
the value is Nmin © (0.165)mg. You will obtain a cubic; feel free 
to solve it numerically. 


. Tower of cylinders x*** 


Consider the infinitely tall system of massless planks and identical mas- 
sive cylinders shown in Fig. 8.57. The moment of inertia of the cylinders 
is Z = MR?/2. There are two cylinders at each level, and the number of 
levels is infinite. The cylinders do not slip with respect to the planks, 
but the bottom plank is free to slide on a table. If you pull on the bottom 
plank so that it accelerates horizontally with acceleration a, what is the 
horizontal acceleration of the bottom row of cylinders? 


Section 8.5: Collisions 


8.58. 


Pendulum collision « 


A stick of mass m and length £ is pivoted at an end. It is held horizontal 
and then released. It swings down, and when it is vertical the free end 


8.59. 


8.60. 


8.61. 


8.62. 


8.63. 
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elastically collides with a ball, as shown in Fig. 8.58. (Assume that 
the ball is initially held at rest and then released a split second before 
the stick strikes it.) If the stick loses half of its angular velocity during 
the collision, what is the mass of the ball? What is the speed of the ball 
right after the collision? 


No final rotation x 


A stick of mass m and length £ spins around on a frictionless horizontal 
table, with its CM at rest (but not fixed by a pivot). A ball of mass M is 
placed on the table, and one end of the stick collides elastically with it, 
as shown in Fig. 8.59. What should M be so that after the collision the 
stick has translational motion, but no rotational motion? 


Same final speeds « 


A stick slides perpendicular to itself (without rotating) across a friction- 
less horizontal table and collides elastically at one of its ends with a 
stationary ball. Both the stick and the ball have mass m. The mass of the 
stick is distributed in such a way that the moment of inertia around the 
CM (which is at the center of the stick) is Z = Am€?, where A is some 
number. What should A be so that the ball moves at the same speed as 
the center of the stick after the collision? 


Perpendicular deflection ++ 


A mass M moves at speed Vo perpendicular to a dumbbell at rest on 
a frictionless horizontal table, as shown in Fig. 8.60. The dumbbell 
consists of two masses m at the ends of a massless rod of length £. 
The mass M collides elastically with one of the masses (not head-on), 
and afterwards it is observed that M moves perpendicular to its original 
direction, with speed u. What is u in terms of Vo, m, and M? What is the 
smallest value of m (in terms of M) for which this scenario is possible? 


Glancing off a stick xx 


A frictionless stick of mass m and length £ lies at rest on a frictionless 
horizontal table. A mass km (where k is some number) moves with speed 
vo at a 45° angle to the stick and collides elastically with it very close 
to an end; see Fig. 8.61. What should k be so that the mass ends up 
moving in the y-direction, as shown? Hint: Remember that the stick is 
frictionless. 


Sticking to a dumbbell « 


A mass m moves at speed v perpendicular to a dumbbell at rest on 
a frictionless horizontal table, as shown in Fig. 8.62. The dumbbell 
consists of two masses also of mass m at the ends of a massless rod 
of length £. The moving mass collides and sticks to one of the masses. 
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What is the resulting w of the system? What is the velocity of the end of 
the rod that has the two masses on it, after the rod has made one half of 
a revolution? 


. Colliding sticks +» 


On a frictionless horizontal table, a stick of mass m and length £ spins 
around a pivot at one of its ends with angular frequency œ. It collides and 
sticks to an identical stick, with an overlap length equal to x, as shown 
in Fig. 8.63. Immediately before the collision, the pivot is removed. 
What should x be so that after the collision the double-stick system has 
translational but no rotational motion? 


. Lollipop +» 


A hockey puck of mass m and radius R slides across frictionless ice, as 
shown in Fig. 8.64 (the view is from above). It has translational speed 
v to the right and rotational speed w clockwise. It grazes the “top” end 
of a rod of mass m and length 2R which is initially at rest on the ice. 
It sticks to the rod, forming a rigid object that looks like a lollipop. 


(a) In the special case of v = Rw, what is the resulting angular speed 
of the lollipop? 

(b) How much energy is lost during the collision? How do you explain 
the fact that energy is lost, given that the v = Rw condition implies 
that the contact point on the puck touches the rod with zero relative 
speed (and thus doesn’t crash into it, as is commonly the case with 
inelastic collisions)? 

(c) Given w, show that v should equal 6Rw/5 if you want the 
minimum amount of energy to be lost. 


Pencil on a plane xxx» 


This exercise deals with the terminal velocity of a “pencil” rolling down 
a plane. To simplify things, we'll assume that the pencil has all its mass 
on its axis. And to avoid messy complications, we’ll assume that the 
cross section of the pencil looks like a wheel with six equally spaced 
massless spokes and no rim (see Fig. 8.65).!! Let the length of the spokes 
be r, and let the plane be inclined at an angle 0. Assume that there is 
sufficient friction to prevent the spokes from slipping on the plane, and 
assume that the pencil does not bounce when a spoke hits the plane. 


(a) Explain qualitatively why the pencil reaches a terminal (average) 
velocity, assuming that it remains in contact with the plane at all 
times. 


1L Tf the pencil instead looks like a hexagon with flat sides, then it is impossible to say how it 


behaves, because if the sides bow outward an infinitesimal amount then the system conserves 


energy, whereas if they bow inward an infinitesimal amount then it does not (for reasons you will 


figure out). 
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(b) Assume that conditions have been set up so that the pencil 
eventually reaches a nonzero terminal (average) velocity, while 
remaining in contact with the plane at all times. Describe this 
terminal velocity. You may do this by stating the maximum speed 
of the axis in the limiting steady state. 

What is the minimum value of 6 for which a nonzero terminal 
velocity exists? An initial kick to the pencil is allowed. 

What is the maximum value of 8 for which the pencil remains in 
contact with the plane at all times? As a check on your answer, 
the difference between the answers in parts (c) and (d) is about 
5.09°. 


(c 
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. Striking a pool ball « 


At what height should you horizontally strike a pool ball so that it 
immediately rolls without slipping? 


Center of percussion + 


You hold one end of a uniform stick of length L. The stick is struck 
with a hammer. Where should this blow occur so that the end you are 
holding doesn’t move (immediately after the blow)? In other words, 
where should the blow occur so that you don’t feel a “sting” in your 
hand? This point is called the center of percussion. 


Another center of percussion « 


You hold the top vertex ofa solid equilateral triangle of side length L. The 
plane of the triangle is vertical. It is struck with a hammer, somewhere 
along the vertical axis. Where should this blow occur so that the point 
you are holding doesn’t move (immediately after the blow)? Use the 
fact that the moment of inertia about any axis through the CM of an 
equilateral triangle is mL7/24. 


Not hitting the pole «x 


A (possibly non-uniform) stick of mass m and length £ lies on frictionless 
ice. Its midpoint (which is also its CM) touches a thin pole sticking out 
of the ice. One end of the stick is struck with a quick blow perpendicular 
to the stick, as shown in Fig. 8.66, so that the CM moves away from the 
pole. What is the minimum value of the stick’s moment of inertia that 
allows the stick not to hit the pole? 


Pulling the paper «x 


A ball sits at rest on a piece of paper on a table. You pull the paper 
in a straight line out from underneath the ball. You are free to pull the 
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paper in an arbitrary (straight line) manner, frontward or backward. You 
may even give it abrupt, jerky motions, so that the ball slips with respect 
to it. After the ball comes off the paper, it will eventually roll without 
slipping on the table. Show (and you are encouraged to experimentally 
verify this) that the ball in fact ends up at rest. (The generalization of 
this fact is given in Problem 9.29.) Is it possible to pull the paper in such 
a way that the ball ends up exactly where it started? 


. Up, down, and twisting +» 


A uniform stick is held horizontally and then released. At the same 
instant, one end is struck with a quick upwards blow. If the stick ends 
up horizontal when it returns to its original height, what are the possible 
values for the maximum height to which the center rises? 


. Doing work +» 


(a) A pencil of mass m and length £ lies at rest on a frictionless table. 
You push on it at its midpoint (perpendicular to it), with a constant 
force F for a time t. Find the final speed and the distance traveled. 
Verify that the work you do equals the final kinetic energy. 

(b) Assume that you apply the same force F for the same time t 
as above, but that you now apply it at one of the pencil’s ends 
(perpendicular to the pencil). Assume that ¢ is small, so that the 
pencil doesn’t have much time to rotate (this means that you can 
assume that your force is always essentially perpendicular to the 
pencil, as far as the torque is concerned). Find the final CM speed, 
the final angular speed, and the distance your hand moves. Verify 
that the work you do equals the final kinetic energy. 


Bouncing between bricks ««« 


A stick of length £ slides on frictionless ice. It bounces elastically 
between two parallel fixed bricks, a distance L apart, in such a way 
that the same end touches both bricks, and the stick hits the bricks at 
an angle 0 each time. See Fig. 8.67. What is 0 in terms of L and £ 
(an implicit equation is fine)? Draw a reasonably accurate picture of 
what the situation looks like in the limit L « £. 

What should 6 be in terms of L and £ (again, an implicit equation 
is fine), if the stick makes an additional n half revolutions between the 
bricks? What is the minimum value of L/£ for which n half revolutions 
are possible? 


. Repetitive bouncing « 


Using the result of Problem 8.20, what must the relation between vy and 
Rw be so that a superball continually bounces back and forth between 
the same two points of contact on the ground? 
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8.76. Bouncing under a table +x 


You throw a superball so that it bounces off the floor, then off the under- 
side of a table, then off the floor again. What must the initial relation 
between vy and Ro be so that the ball returns to your hand, with the 
outward and return paths the same? Use the result of Problem 8.20, and 
modifications thereof. !? 


8.77. Bouncing under a table again «x«« 


Consider the setup in the previous exercise, where we assumed that the 
outward and return paths were the same. Is this trajectory (where the 
path retraces itself) the only possible one for which the ball returns to 
your hand? Show!? that the answer is “yes” unless t; = 7t2, where tı 
is the time the ball spends between your hand and the floor (which is 
the same out and back, because the magnitude of v, isn’t changed by 
a bounce), and f is the time the ball spends between the floor and the 
table (again, the same out and back). For the special case where f = 7f2, 
you will find that the ball returns to your hand for any initial relation 
between vy and Ro. !4 

For a ball with a general moment of inertia J = Bmr*, show that 
the answer is always “yes,” without exception, if 6 < 1/3 (which 
corresponds to a wheel with massive spokes and a massless rim). Also, 
show that if 6 = 1 (ahoop), then the ti = 7f condition becomes ft; = h2. 
In other words, if you throw a “super-hoop” from the same height as 
the table, then no matter how you throw it (as long as the throw is 
downward and in the plane of the hoop), it will return to your hand. This 
is a little more believable if you look at the remark in the solution to 
Problem 8.20. 


8.9 Solutions 


8.1. Massive pulley 
The two masses have equal speeds at all times. Let v be their common speed after they 
have each moved a distance d. If 2m falls a distance d (so that m rises a distance d), 
then the change in potential energy is —2mgd + 2mgd = —mgd. The total kinetic 
energy is 


1 1 1 
K ==mv 4 mw? a 
2 2 2 


12 You are strongly encouraged to bounce a ball in such a manner and have it magically come back 
to your hand. It turns out that the required value of w is small, so a natural throw with w ~ 0 
essentially gets the job done. 

13 You will want to use Mathematica or some other aid to keep track of the matrix multiplications, 
especially in the second part of this exercise, which would be completely intractable otherwise. 

'4 I learned of this extremely bizarre fact from Howard Georgi. 
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7 
= sme, (8.64) 


where we have used the nonslipping condition, v = rw. Conservation of energy 
therefore gives 
2 4 


7 
0= g” mgd v= gd. (8.65) 


The usual kinematic result v = /2ad holds here, so we obtain a = 2g/7. 


Leaving the sphere 
In this setup, as in Problem 5.3, the ball leaves the sphere when the normal force 
becomes zero, that is, when 


mv? 
F mg cos 0. (8.66) 


The only change from the solution to Problem 5.3 comes in the calculation of v. The 
ball now has rotational energy, so conservation of energy gives mgR(1 — cos 0) = 
mv? /2 + Iw? /2 = mv? /2 + Bmr?o*/2. But the nonslipping condition is v = rw, so 
we have 


la + B)mv? = mgR(1 — cos 0) a= a. (8.67) 


Plugging this into Eq. (8.66), we see that the ball leaves the sphere when 
2 
3+8. 


cos = (8.68) 


REMARKS: For 6 = 0, this equals 2/3, as in Problem 5.3. For a uniform ball with 
B = 2/5, we have cos 0 = 10/17, so 8 ~ 54°. For B —> oo (for example, a spool with 
a very thin axle rolling down the rim of a circle), we have cos > 0, so 0 © 90°. 
This makes sense because v is always very small, since most of the energy takes the 
form of rotational energy. The coefficient of friction would have to be very large in 
this case, of course, to keep the spool from slipping near 0 ~ 90°. & 


If the size of the ball is comparable to, or larger than, the size of the sphere, then we 
must take into account the fact that the CM of the ball does not move along a circle of 
radius R. Instead, it moves along a circle of radius R + r, so Eq. (8.66) becomes 


mv? 


R+r 


= mg_cosé@. (8.69) 


Also, the conservation of energy equation takes the form, mg(R + r)(1 — cos@) = 
mv? /2 + Bmr?w?/2. But v still equals rw (because the ball can be considered to be 
instantaneously rotating around the contact point with angular speed w), so the kinetic 
energy still equals (1 + 6)mv*/2. The conservation of energy statement is thus 


La } B)mv? = mg(R+r)(1 — cos 0). (8.70) 


We therefore have the same equations as above, except that R is replaced everywhere by 
R +r. But R didn’t appear in the result for 6 in Eq. (8.68), so the answer is unchanged. 


REMARK: The method of the second solution to Problem 5.3 does not work in this 
problem, because there is a force available to make v, decrease, namely the friction 


8.3. 


8.9 Solutions 


force. And indeed, vy does decrease before the rolling ball leaves the sphere. At a given 
value of 6, the v in the present problem is simply 1/ V/T + £ times the v in Problem 
5.3, so the maximum vy is achieved here at cos@ = 2/3, just as in Problem 5.3. But 
the angle in Eq. (8.68) is larger than this, so vy decreases while the ball is between 
these two angles. (However, see the following problem for a setup involving rotations 
where the max vy is relevant.) & 


Sliding ladder 


The important point to realize in this problem is that the ladder loses contact with the 
wall before it hits the ground. So we need to find where this loss of contact occurs. Let 
r = €/2, for convenience. While the ladder is in contact with the wall, its CM moves 
in a circle of radius r. This follows from the fact that the median to the hypotenuse of 
a right triangle has half the length of the hypotenuse. Let @ be the angle between the 
wall and the radius from the corner to the CM (see Fig. 8.68). This is also the angle 
between the ladder and the wall. 

We’ll solve this problem by assuming that the CM always moves in a circle, and 
then determining the position at which the horizontal CM speed starts to decrease, that 
is, the point at which the normal force from the wall would have to become negative. 
Since the normal force of course can’t be negative, this is the point where the ladder 
loses contact with the wall. 

By conservation of energy, the kinetic energy of the ladder equals the loss in potential 
energy, which is mgr(1 — cos@). This kinetic energy can be broken up into the CM 
translational energy plus the rotation energy. The CM translational energy is mr?6?/2, 
because the CM travels in a circle of radius r. The rotational energy is /67/2. The 
same Ê applies here as in the CM translational motion, because @ is the angle between 
the ladder and the vertical, and is thus the angle of rotation of the ladder. Letting 
I = mr? to be general (6 = 1/3 for our ladder), the conservation of energy statement 
is (1 +B)mr262 /2 = = mgr(1—cos8@). Therefore, the speed of the CM, which is v = rÅ, 


equals 
2gr(1 — cos 0 
ie 2gr(l — cos0) f (8.71) 
1+£ 


The horizontal component of this is 


Sy JV — cos8) cos 6. (8.72) 


Taking the derivative of ./(1 — cos@)cos@, we see that the horizontal speed is 
maximum when cos @ = 2/3. Therefore the ladder loses contact with the wall when 


2 
cos9=3 = 048.2", (8.73) 


which is independent of £. This means that, for example, a dumbbell (two masses at 
the ends of a massless rod, with 6 = 1) loses contact with the wall at the same angle. 
Plugging the value of 0 from Eq. (8.73) into Eq. (8.72), and using 6 = 1/3, we obtain 
a final horizontal speed of 

v2ar = vt . (8.74) 

3 3 

Note that this is 1/3 of the ./2gr horizontal speed that the ladder would have if it 
were arranged (perhaps by having the top end slide down a curve) to eventually slide 
horizontally along the ground. You are encouraged to compare various aspects of this 
problem with those in Problem 8.2 and Problem 5.3. 


vy = 


Remark: The normal force from the wall is zero at the start and zero at the finish, so 
it must reach a maximum at some intermediate value of 0. Let’s find this 0. Taking the 
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derivative of vy in Eq. (8.72) to find the CM’s horizontal acceleration ay, and then using 
0 x »/1 — cos @ from Eq. (8.71), we see that the force from the wall is proportional to 
sin 0 (3 cos — 2)6 
ay x ———— 

i ~ 1 — cos 


Taking the derivative of this, we find that the force from the wall is maximum when 
cos? = (1 + v19)/6 => 0%26.7° & 


œ sin (3 cos — 2). (8.75) 


Leaning rectangle 

We must first find the position of the rectangle’s CM when it has rotated through an 
angle 0. Using Fig. 8.69, we can obtain this position (relative to the center of the 
cylinder) by adding up the distances along the three shaded triangles. Because there 
is no slipping, the contact point has moved a distance RO along the rectangle. We find 
that the position of the CM is 


(x,y) = R(sin 0, cos 0) + RO(— cos 6, sin 0) + a(sin 0, cos 0). (8.76) 


We’ll now use the Lagrangian method to find the equation of motion and the frequency 
of small oscillations. Using Eq. (8.76), you can show that the square of the speed of 
the CM is 


war 477 = (eC + R070". (8.77) 


The simplicity of this result suggests that there is a quicker way to obtain it. And 
indeed, the CM instantaneously rotates around the contact point with angular speed 
6, and from Fig. 8.69, the distance to the contact point is a? + R262. Therefore, the 
speed of the CM is wr = V'a? + R267. 

The Lagrangian is 


I 4 292\42 . 1742 : 
L=T-V= ame + R°@-)O~ 4 ai mg((R + a) cos@ + RO sin 8). (8.78) 
The equation of motion is, as you can check, 


(ma? + mR76? + DË + mR*06* = mgasin 6 — mgRO cos 0. (8.79) 


Let us now consider small oscillations. Using the small-angle approximations, 
sin@ © @ and cos@ ~ 1 — 67/2, and keeping terms up to first order in 0, we obtain 


(ma? + DË +mg(R — a)0 = 0. (8.80) 


The coefficient of @ is positive ifa < R. Therefore, oscillatory motion occurs ifa < R. 
Note that this condition is independent of b. The frequency of small oscillations is 


/mg(R — a) 
= ,/ ———. 8.81 
s ma? +I (5:31) 


Ifa > R, then the rectangle falls off the cylinder. 


REMARKS: Let’s look at some special cases. If J = 0 (that is, all of the rectangle’s 
mass is located at the CM), we have œw = \/g(R — a)/a?. If in addition a < R, then 
@ ~ ,/gR/a*. You can also derive these results by considering the CM to be a point 
mass sliding in a parabolic potential. If the rectangle is instead a uniform horizontal 
stick, so that a « R, a < b, and I ~% mb? /3, then we have w © ,/3gR/b?. If the 
rectangle is a vertical stick (satisfying a < R), so that b < a and I ~ ma?/3, then we 
have w © \/3g(R — a) /4a?. If in addition a < R, then w © /3gR/4a?. 

Without doing much work, there are two other ways we can determine the condition 
under which there is oscillatory motion. The first is to look at the height of the CM 
(although this is essentially what we ended up doing in the solution above). Using 
small-angle approximations in Eq. (8.76), the height of the CM is y © (R +a) + 


8.5. 


8.9 Solutions 


(R — a)6*/2. Therefore, if a < R, the potential energy increases with 6, so the 
rectangle wants to decrease its 6 and fall back down to the middle. But if a > R, the 
potential energy decreases with 0, so the rectangle wants to increase its 0 and fall off 
the cylinder. 

The second way is to look at the horizontal positions of the CM and the contact 
point. Small-angle approximations in Eq. (8.76) show that the former equals a0 and 
the latter equals RO. Therefore, if a < R then the CM is to the left of the contact point, 
so the torque from gravity (relative to the contact point) makes 0 decrease, and the 
motion is stable. But if a > R then the torque from gravity makes @ increase, and the 
motion is unstable. & 


Mass in a tube 


The Lagrangian is 


c= > (Me) 62 + (imei + Lm? oriel in (8.82) 
=-ļ|- + mx + -mx + mex sin — }sing. ` 
2\3 2 2 ae Elz’ 


The Euler-Lagrange equations are then 


d (ƏL aL i 

-]= mł = mx6? 4 mgsinð, 
dt \ ax ax 
d (IL\ aL i ee Mgt 

-=)= M06 + mx’6) = Ø (8.83 
dt (5) 38 di (; — ) (mex 2 ) eos (8.83) 

1 as i Met 
(jue } mx?) d + 2mxx0 = (mex + 5 ) cose, 


In terms of n = x/£, these equations become 


ij = nô? + sind, 


ee 3 n 
(1+. 3rn2)6 = (sa Z) cose 6rniid, (8.84) 


where r = m/M and g = g/€. Below is a Maple program that numerically finds the 
value of n when @ equals 7/2, in the case where r = 1. As mentioned in Problem 
1.2, this value of n does not depend on g or £, and hence not g. In the program, we’ll 
denote g by g, which we’ll give the arbitrary value of 10. We’ll use q for 6, and n for 
n. Also, we'll denote 6 by q1 and Ö by q2, etc. Even if you don’t know Maple, this 
program should still be understandable. See Section 1.4 for more discussion of solving 
differential equations numerically. 


n:=0: initial n value 

n1:=0: initial n speed 

q:=0: initial angle 

ql:=0: initial angular speed 

e:=.0001: small time interval 

g:=10: value of g/l 

r:=1: value of m/M 

while q<1.57079 do do this process until 
the angle is pi/2 

n2:=n*ql1l*2+g*sin(q): the first E-L equation 

q2:=((3* r*g*n+3*g/2) *cos (q) 

—6*r*n*n1*g1) / (14+3*r*n*2) : the second E-L equation 
n:=nte*nl: how n changes 
nl:=nlt+e*n2: how nl changes 
q:=qte*ql: how q changes 
ql:=ql+e*q2: how ql changes 
end do: stop the process 
n; print the value of n 
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8.7. 


The resulting value of 7 is 0.378. If you actually run this program on Maple with 
different values of g, you will find that the result for n doesn’t depend on g, as stated 
above. A few results for ņ for various values of r are, in (r, n) notation: (0, 0.349), 
(1, 0.378), (2, 0.410), (10, 0.872), (20, 3.290). It turns out that r ~ 11.25 yields n © 1. 
That is, the mass m gets to the end of the tube right when the tube becomes vertical. 

For 7 values larger than 1, we could imagine attaching a massless tubular extension 
on the end of the given tube. It turns out that n —> oo as r —> ov. In this case, the 
mass m drops nearly straight down, causing the tube to quickly swing down to a nearly 
vertical position. But m ends up slightly to one side and then takes a very long time to 
move over to become directly below the pivot. 


Minimum J 

The shape should be a cylinder with the z axis as its symmetry axis. A quick proof 
(by contradiction) is as follows. Assume that the optimal blob is not a cylinder, and 
consider the surface of the blob. If the blob is not a cylinder, then there exist two 
points on the surface, Pı and P2, that are located at different distances, rı and ro, 
from the z axis. Assume rı < r2 (see Fig. 8.70). If we move a small piece of the 
blob from P2 to P1, then we decrease the moment of inertia, f r? dm. Therefore, the 
proposed noncylindrical blob cannot be the one with the smallest J. In order to avoid 
this contradiction, all points on the surface must be equidistant from the z axis. The 
only blob with this property is a cylinder. 


Slick calculations of J 


(a) We claim that the J for a square of side 2£ is 16 times the Z for a square of 
side £, assuming that the axes pass through any two corresponding points. 
This is true because dm is proportional to the area, which is proportional to 
length squared, so the corresponding dm’s differ by a factor of 4. And then 
there are the two powers of r in the integrand. Therefore, when changing vari- 
ables from one square to the other, there are four powers of 2 in the integral 
J? dm= fr pdxdy. 

As in Section 8.3.2, we can express the relevant relations in terms of pictures: 


21 
I 
-sE 


BERE] 

= + m{— 

L z 

The first line comes from the scaling argument, the second comes from the fact 


that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 


to eliminate [ | gives 


[Jie 


This agrees with the result of Example 12 in Section 8.3.1, with a = b = £. 


8.9 Solutions 


(b) This is again a two-dimensional object, so the / for a triangle of side 2€ is 16 
times the / for a triangle of side £, assuming that the axes pass through any two 
corresponding points. With pictures, we have: 


21 
= 16/N 
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The first line comes from the scaling argument, the second comes from the fact 


that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 


to eliminate e gives 


L\= yn" 
12 


l 


This agrees with the result of Example 11 in Section 8.3.1, with N = 3. The 
“radius” R used in that example equals ¢/./3 in the present notation. 


8.8. Slick calculations of J for fractal objects 


(a) The scaling argument here is a little trickier than the one in Section 8.3.2. Our 
object is self-similar to an object 3 times as big, so let’s increase the length by a 
factor of 3 and see what happens to 7. In the integral f x? dm, the x’s pick up a 
factor of 3, so this gives a factor of 9. But what happens to the dm? Well, tripling 
the size of our object increases its mass by a factor of 2, because the new object 
is made up of two of the smaller ones, plus some empty space in the middle. 
So the dm picks up a factor of 2. Therefore, the J for an object of length 32 is 
18 times the / for an object of length £, assuming that the axes pass through 
any two corresponding points. With pictures, we have (the following symbols 
denote our fractal object): 


31 l 
. = 18 -e— 
1/2 
e = 2(-— -) 
e — — = —0e— + ml? 


The first line comes from the scaling argument, the second comes from the 
fact that moments of inertia simply add, and the third comes from the parallel- 
axis theorem. Equating the right-hand sides of the first two, and then using the 
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third to eliminate e — — gives 


This is larger than the / for a uniform stick, namely m£? /12, because the mass 
here is generally farther away from the center. 


Remark: When we increase the length of our object by a factor of 3, the factor 
of 2 in the dm is larger than the factor of 1 relevant to a zero-dimensional object, 
but smaller than the factor of 3 relevant to a one-dimensional object. So in some 
sense our object has a dimension between 0 and 1. It is reasonable to define the 
fractal dimension, d, of an object as the number for which r@ is the increase in 
“volume” when the dimensions are increased by a factor of r. In this problem, 
we have 34 = 2, so d = log; 2 ~ 0.63. & 


(b) Again, the mass scales in a strange way. Let’s increase the dimensions of our 
object by a factor of 2 and see what happens to 7. In the integral f x? dm, the x’s 
pick up a factor of 2, so this gives a factor of 4. But what happens to the dm? 
Doubling the size of our object increases its mass by a factor of 3, because the 
new object is made up of three of the smaller ones, plus an empty triangle in the 
middle. So the dm picks up a factor of 3. Therefore, the J for an object of side 
2£ is 12 times the / for an object of side £, assuming that the axes pass through 
any two corresponding points. With pictures, we have: 


21 


e = ry +m Gy 


The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 


to eliminate e gives 


1 


ry = gi 


This is larger than the / for the uniform triangle in Problem 8.7, namely m€? /12, 
because the mass here is generally farther away from the center. Increasing the 
size of our object by a factor of 2 increases the “volume” by a factor of 3, so the 
fractal dimension is given by 27 = 3 —> d = log, 3 © 1.58. 

(c) Again, the mass scales in a strange way. Let’s increase the dimensions of our 
object by a factor of 3 and see what happens to 7. In the integral f x? dm, the x’s 
pick up a factor of 3, so this gives a factor of 9. But what happens to the dm? 


8.9 Solutions 357 


Tripling the size of our object increases its mass by a factor of 8, because the 
new object is made up of eight of the smaller ones, plus an empty square in the 
middle. So the dm picks up a factor of 8. Therefore, the J for an object of side 
3£ is 72 times the J for an object of side £, assuming that the axes pass through 
any two corresponding points. With pictures, we have: 


31 


1 - 6a) +40) 


° = © + ml? 


2 + mZ)? 


The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third and fourth come from the 
parallel-axis theorem. Equating the right-hand sides of the first two, and then 


using the third and fourth to eliminate » and gives 


This is larger than the J for the uniform square in Problem 8.7, namely m£? /6, 
because the mass here is generally farther away from the center. Increasing the 
size of our object by a factor of 3 increases the “volume” by a factor of 8, so the | Fit 
fractal dimension is given by 3f = 8 => d = log; 8 ~ 1.89. y 


8.9. Zero torque from internal forces 


Let Fint be the force that the ith particle feels due to the jth particle (see Fig. 8.71). 


Then int 
is 


DD i (8.85) 
J 


The total internal torque on all the particles, relative to the chosen origin, is therefore Fig. 8.71 
m=) rx" =>) r x Fy. (8.86) 
i ij 
But if we interchange the indices (which were labeled arbitrarily), we have 


qint — Yy x Fint =a X N x Fi, (8.87) 
P i 


J J 
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where we have used Newton’s third law, Fint = -Fi Adding the two previous 
equations gives l 


2r™* = > Xoor: — rj) x Fi". (8.88) 
J 


i 


But F int is parallel to (r; — rj), by assumption. Therefore, each cross product in the 
sum equals zero. 

The above sums might make this solution look a bit involved. But the idea is simply 
that the torques cancel in pairs. This is clear from Fig. 8.71, because the two forces 
shown are equal and opposite, and they have the same lever arm relative to the origin. 


8.10. Removing a support 


(a) First soLUTION Let the desired force from the left support be F, and let the 
downward acceleration of the stick’s CM be a. Then the F = ma and t = Ia 
(relative to the fixed support; see Fig. 8.72) equations, along with the circular- 
motion relation between a and qa, are, respectively, 


pa £ me? £ (8.89) 
mg =ma, ME =| 3 ]o Ga 5 o. ; 
The second equation gives a = 3g/2€. The third equation then gives a = 3g/4. 
And the first equation then gives F = mg/4. Note that the right end of the stick 
accelerates at 2a = 3g/2, which is larger than g. 

SECOND SOLUTION Looking at torques around the CM, and also torques around 
the fixed support, we have, respectively, 


pe (me d me- me? ear 
5 ha i m53 je (8:20) 


Dividing the first of these equations by the second gives F = mg/4. 
FIRST SOLUTION As in the first solution above, we have (using the parallel-axis 
theorem; see Fig. 8.73) 


(b 


YS 


mg—F=ma, mgd = (Bmr? + md?)a, a=ad. (8.91) 


Solving for F gives F = mg/(1 + d?/r?). For d =r and B = 1/3, we obtain 
the answer in part (a). 

SECOND SOLUTION As in the second solution above, looking at torques around 
the CM, and also torques around the fixed support, we have, respectively, 


Fd = (Bmr*)a, and mgd = (Bmr* + ma?)a. (8.92) 


Dividing the first of these equations by the second gives F = mg/(1+d7/Br7). 


Remarks: For the special case d = r, we have the following: If 8 = 0 (point 
mass in the middle) then F = 0; if 8 = 1 (dumbbell with masses at the ends) 
then F = mg/2; and if 8 = oo (masses at the ends of long massless extensions 
of the stick) then F = mg. These all make intuitive sense. In the limit d = 0, we 
have F = mg. And in the limit d = oo (using a massless extension), we have 
F = 0. Technically, we should be writing d « ./Brandd>./Brhere. & 


8.11. Falling stick 


(a) Let’s calculate t and L relative to the pivot point. The torque is due to gravity, 
which effectively acts on the CM and has magnitude mgb. The moment of inertia 
of the stick around the horizontal axis through the pivot (and perpendicular to 
the massless stick) is simply mb?. So when the stick starts to fall, the t = dL/dt 


8.12. 


8.13. 


8.9 Solutions 


equation is mgb = (mb)a. Therefore, the initial acceleration of the CM, namely 
ba, is 


ba = g, (8.93) 


which is independent of £ and b. This answer makes sense. The stick initially 
falls straight down, and the pivot provides no force because it doesn’t know right 
away that the stick is moving. 

(b) The only change from part (a) is the moment of inertia of the stick around the 
horizontal axis through the pivot (and perpendicular to the massless stick). From 
the parallel-axis theorem, this moment is mb? + mé/12. So when the stick starts 
to fall, the t = dL /dt equation is mgb = (mb? +m€? /12)æ. Therefore, the initial 
acceleration of the CM is 

——— 
1 + (€2/12b2) 
For £ < b, this goes to g, as it should. And for £ >> b, it goes to zero, as it 
should. In this case, a tiny movement of the CM corresponds to a very large 
movement of the points far out along the stick. Therefore, by conservation of 
energy, the CM must be moving very slowly. 


(8.94) 


Pulling a cylinder 

Consider the net force in the y direction (the transverse direction) on an infinitesimal 
arclength of the cylinder. Measure 0 clockwise from the bottom, as shown in Fig. 8.74. 
The tension increases as 6 increases (assuming Tı > T2), and the friction force Ff on 
the cylinder equals the difference dT in the tension from one end of the small arc to the 
other (using Newton’s third law, and noting that there can be no net force on the string 
because it is massless). The friction force on the cylinder therefore produces a force 
component dT sin @ in the y direction. From the example in Section 2.1, the normal 
force on the cylinder is N = T d0, which yields a force component T dé cos @ in the 
y direction. The net y-force on the cylinder at the little arc is therefore 


dF, = dT sin6 + T d0 cos@ = d(T sin 0). (8.95) 
The total F, on the cylinder is thus 


Fy = fa, = f aT sino) = A(T sin8). (8.96) 


But @ runs from 0 to z, which means that sin 0 starts and ends at zero. Therefore, the 
total change in T sin@ equals zero, and so F, = 0, as desired. Note that nowhere in 
this solution did we assume anything about T. The string might be slipping, and the 
cylinder might be rough in some places and smooth in others, and it doesn’t matter. 
The total F, is still zero (as we know it must be, from the simpler reasoning that 7; 
and T> pull only in the x direction). Physically, what happens is that N is larger in the 
top half of the semicircle than in the bottom half, and the resulting net downward force 
cancels the upward force from the friction. 

The above F, = A(7'sin@) result holds in general, and not just when the string 
wraps around a semicircle. Because of our convention for 0, the signs work out so that 
A(T sin @) is simply the sum of the y components of the tensions 7; and T2, which is 
the answer we expect. 


Oscillating ball 


Let the angle from the bottom of the cylinder to the ball be 6 (see Fig. 8.75), and let 
Fr be the friction force. Then the tangential F = ma equation is 


Ff — mg sin 0 = ma, (8.97) 
where we have chosen rightward to be the positive direction for a and Ff. Also, the 
t = /a@ equation (relative to the CM) is 


2 
—rF; = zma, (8.98) 
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where we have chosen clockwise to be the positive direction for œ. Using the non- 
slipping condition ra = a, the torque equation becomes Fr = —(2/5)ma. Plugging 
this into Eq. (8.97), and using sin ~ 6, we obtain mg@ + (7/5)ma = 0. Under the 
assumption r < R, the center of the ball moves along a circle with radius essentially 
equal to R, so we have a © RÖ, We therefore arrive at 


b+ (ž)o= . (8.99) 


This is the equation for simple harmonic motion with frequency 


/5g 
=,/—. 8.100 
w FR ( ) 


In general, if the ball has a moment of inertia equal to Bmr?, you can show that 
the frequency of small oscillations is ,/g/(1 + 8)R. Note that we needed to use two 
different expressions for a in this solution, namely ra and RÖ. 


Remarks: The answer in Eq. (8.100) is slightly smaller than the ./g/R answer for 
the case where the ball slides. In terms of forces, the reason for this is that the friction 
force causes there to be a smaller net tangential force. In terms of energy, the reason 
is that energy is “lost” in the rotational motion, so the ball ends up moving slower. 

If we omit the r < R assumption, then the ra = a relation still holds, because 
we can consider the ball to be instantaneously rotating around the contact point. But 
the a = RÖ relation is replaced by a = (R — r)@, because the center of the ball 
moves along a circle of radius R — r. Therefore, the exact result for the frequency is 


@ = \/5g/7(R — r). This goes to infinity asr > R.  & 
Oscillating cylinders 


The moments of inertia of the cylinders are J; = Mı R? and h = MpR3. Let F be the 
force between the two cylinders, defined with rightward on the small cylinder being 
positive. Let 0; and 62 be the angles of rotation of the cylinders, with counterclockwise 
positive, relative to the position where the small cylinder is at the bottom of the big 
cylinder. Then the torque equations are 


FR, = M,Ri6;, and FR = —M>R36>. (8.101) 


We are not so much concerned with 6; and 62 as we are with the angular position that 
M, makes with the vertical. Let this angle be 6 (see Fig. 8.76). In the approximation 
Ri < Ro, the nonslipping condition says that R20 ~ R202 — R141, since both sides 
of this equation are expressions for the arclength away from the bottom of the big 
cylinder. Adding Eqs. (8.101) after dividing through by the masses then gives 


1 1 m 
F | — + — | = —R208. 8.102 
(Ge +7) 2 ( ) 


The tangential force equation on My is 
F — Mig sind = M (R26). (8.103) 


Substituting the F from (8.102) into this equation gives, with sin 0 ~ 0, 


1 \, (mM 
(m — )i ( #)o=0. (8.104) 
m + Uh Ro 


After simplifying, the frequency of small oscillations is 


M M: 
ae (2 oe. (8.105) 
Ry \V Mı +2M2 


8.15. 


8.9 Solutions 


Remarks: In the limit M) < Mj, we obtain œ ~ ./g/Ro. In this case, there is 
essentially no friction force between the cylinders, because otherwise the “massless” 
Mhz would have infinite angular acceleration. So there is only a normal force, and the 
small cylinder essentially acts like a pendulum of length R2. In the limit M, < M2, we 
obtain w ~ ./g/2R2. In this case, the big cylinder is essentially fixed, so we simply 
have the setup mentioned in the solution to Problem 8.13, with 8 = 1. & 


Lengthening the string 

Consider the angular momentum relative to the support point P. The forces on the mass 
are the tension in the string and gravity. The former provides no torque around P, and 
the latter provides no torque in the z direction. Therefore, L; is constant. The motion 
is always approximately circular because the length of the string changes very slowly, 
so if we let wy be the frequency of the circular motion when the string has length £, 
then we can say that 


L: = mr’ ar (8.106) 
is constant. The frequency w, can be obtained by using F = ma for the circular motion. 


The tension in the string is essentially mg/ cos 0 (to make the forces in the y direction 
cancel), so the horizontal radial force is mg tan 0. Therefore, 


mg tan = mros = m(£sin Oar o =, 7 T = n (8.107) 


Plugging this into Eq. (8.106), we see that the constant value of L, is 


L: = m? fS (8.108) 
“i 


This holds at all times, so the quantity r2//h is constant. Let’s look at the two cases. 


(a) For@ © 0, we have A ~ £, so Eq. (8.108) says that [Jb is constant. Therefore, 
ro llt, (8.109) 
which means that r grows very slowly as you let the string out when 6 ~ 0. 


(b) For 6 © 2/2, we have r ~ £, so Eq. (8.108) says that 2/s/h is constant. 
Therefore, 


hatt, (8.110) 


which means that h grows very quickly as you let the string out when 6 ~ 7/2. 

Note that Eq. (8.108) says that h œ 74 for any value of 0. So no matter what 
@ is, if you slowly lengthen the string so that r doubles, then h increases by a 
factor of 16. Equivalently, if you draw the string in, the envelope of the motion 


of the mass is a surface of revolution generated by a curve of the form y œ —x4. 


8.16. A triangle of cylinders 


(a) Let N be the normal force between the cylinders, and let F be the friction force 
from the ground (see Fig. 8.77). Let ay be the initial horizontal acceleration of 
the right bottom cylinder (so œ = a,/R is its angular acceleration), and let ay be 
the initial vertical acceleration of the top cylinder, with downward taken to be 
positive. 

If we consider the torque around the center of one of the bottom cylinders, 
then the only relevant force is F, because N, gravity, and the normal force from 
the ground all point through the center. The equations for Fy = ma, on the 
bottom right cylinder, Fy = may on the top cylinder, and t = Ja on the bottom 
right cylinder are, respectively, 


N cos 60° — F = max, 
mg — 2N sin 60° = may, (8.111) 
FR = (mR?) (ax/R). 
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We have four unknowns, N, F, ax, and ay, so we need one more equation. 
Fortunately, a, and a, are related. The contact surface between the top and 
bottom cylinders lies (initially) at an angle of 30° with the horizontal. Therefore, 
if the bottom cylinders move a distance d to the side, then the top cylinder moves 
a distance d tan 30° downward. Hence, 


a, = V3ay. (8.112) 
We now have four equations and four unknowns. Solving for a, by your method 
of choice gives 


g 


a 8.113 
T+ 6B oe 


(b) Let N be the normal force between the cylinders, and let F be the friction 


force between the cylinders, with positive directions shown in Fig. 8.78. Let 
ax be the initial horizontal acceleration of the right bottom cylinder, and let a, 
be the initial vertical acceleration of the top cylinder, with downward taken to 
be positive. Let æ be the angular acceleration of the right bottom cylinder, with 
counterclockwise taken to be positive. Note that a is not equal to a, /R, because 
the bottom cylinders slip on the ground. 

If we consider the torque around the center of one of the bottom cylinders, 
then the only relevant force is F. And from the same reasoning as in part (a), we 
have ay = V3ay. Therefore, the four equations analogous to Eqs. (8.111) and 
(8.112) are 


N cos 60° — F sin 60° = max, 


mg — 2N sin 60° — 2F cos 60° = may, 


8.114 
FR = (BmR?)a, ( ) 


l= V3 ay. 


We have five unknowns, N, F, ax, ay, and a, so we need one more equation. 
The tricky part is relating a to ay. One way to do this is to ignore the y motion 
of the top cylinder and imagine the bottom right cylinder to be rotating up and 
around the top cylinder, which is held fixed. In this rotational motion, the center 
of the bottom cylinder moves at an angle of 30° with respect to the horizontal 
(at the start). So ifit moves an infinitesimal distance d to the right, then its center 
moves a distance d/ cos 30° up and to the right. So the angle through which the 
bottom cylinder rotates is 0 = (d/ cos 30°)/R = (2//3)(d/R). Bringing back 
in the vertical motion of the top cylinder doesn’t change this result. Therefore, 
taking two derivatives of this relation gives 


2 ay 
a=. 
V3R 
We now have five equations and five unknowns. Solving for ay by your method 
of choice gives 


(8.115) 


g 
& 


7+ 8B 


dy = 


(8.116) 


Remarks: If = 0, that is, if all the mass is at the center of the cylinders, then 
the results in both parts (a) and (b) reduce to g/7. This 6 = 0 case is equivalent 
to the case of frictionless cylinders (of any mass distribution), because then 
nothing rotates. If 6 # 0, then the result in part (b) is smaller than that in 
part (a). This isn’t so obvious, but the basic reason is that the bottom cylinders in 
part (b) take up more energy because they have to rotate slightly faster, because 
a= (2/3) (a/R) instead ofa =a,/R. & 


8.9 Solutions 363 


8.17. Falling chimney 


Let 0 be the angle through which the chimney has fallen. Before we start dealing with 
the forces in the rods, let’s first determine Ô as a function of @. Let £ be the height 
of the chimney. Then the moment of inertia around the pivot point on the ground is 
mé? /3 (if we ignore the width). And the torque (around the pivot point) due to gravity 
is t = mg(€/2)sin@. Therefore, t = dL/dt gives mg(€/2)sin@ = (1/3)me26, 
and so 

nE 3g sin 0 l 

2£ 


Let’s now determine the forces in the rods. Our strategy will be to imagine that the 
chimney consists of a chimney of height A, with another chimney of height £ — h placed 
on top of it. We’ll find the forces in the rods connecting these two “‘sub-chimneys,” 
and then we’ll maximize one of these forces (77, defined below) as a function of A. 

The forces on the top piece are gravity and also the forces from the two rods at each 
end of the bottom board. Let’s break these latter forces up into transverse and longi- 
tudinal forces along the chimney. Let Tı and Tz be the two longitudinal components, 
and let F be the sum of the transverse components, as shown in Fig. 8.79. We have 
picked the positive directions for Tı and 7 so that positive Tı corresponds to com- 
pression in the left rod, and positive T) corresponds to tension in the right rod (which 
is what the forces will turn out to be, as we’ll see). It turns out that if the width (which 
we’ll call 27) is much less than the height, then T2 >> F (as we’ll see below), so the 
tension in the right rod is essentially equal to T2. We will therefore be concerned with 
maximizing 75. 

In writing down the force and torque equations for the top piece, we have three 
equations (the radial and tangential F = ma equations, and t = dL /dt around the 
CM), and three unknowns (F, Tı, and T2). If we define the fraction f = h/¢, then 
the top piece has length (1 — f)é and mass (1 — f )m, and its CM travels in a circle of 
radius (1 + f)£/2, Therefore, our three force and torque equations are, respectively, 


(8.117) 


Fig. 8.79 


Tə — Tı + (1 — f)mg cos = (1 —f)m (£ +4) 62, 


2 
F ' , {0+ 
+- f)mgsinð = (1 —f)m 3 0, (8.118) 
= = f\2p2 
Ti + T2)r ae _ =(1 rm (S - £ a 


At this point, we could plow forward and solve this system of three equations in three 
unknowns. But things simplify greatly in the limit r < £. The third equation says that 
Tı + T is of order 1/r, and the first equation says that 7, — T; is of order 1. These 
imply that Ti ~ T2, to leading order in 1 /r. Therefore, we can set Ti + Tz © 27T in the 
third equation. Using this approximation, along with the value of Ë from Eq. (8.117), 
the second and third equations become 


3 
F + (1 —f)mg sind = qu —f?)mg sin 0, 
. (8.119) 
(=F 1 s3 . 
2rT, — F = gl fy mgé sind. 
This first of these equations gives 
inĝ 
= +y- 7), (8.120) 
and then the second gives 
mg sing. : 
ne seers. (8.121) 
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8.18. 


8.19. 


As stated above, this is much greater than F (because £/r >> 1), so the tension in the 
right rod is essentially equal to T2. Taking the derivative of T) with respect to f, we 
see that it is maximum at 


of = 8.122 
J= an (8.122) 


Therefore, the chimney is most likely to break at a point one-third of the way up 
(assuming that the width is much less than the height). Interestingly, f = 1/3 makes 
the force F in Eq. (8.120) exactly equal to zero. For more on the falling chimney, see 
Madsen (1977) and Varieschi and Kamiya (2003). 


Ball hitting stick 


Let V, v, and w be the speed of the ball, the speed of the stick’s CM, and the angular 
speed of the stick, respectively, after the collision. Then conservation of momentum, 
angular momentum (around the fixed point that coincides with the initial center of the 
stick), and energy give 


MVo = MV + mv, 
MVod = MVd + Bme?a, (8.123) 


MV2 = MV? + mv? + pml’. 


We must solve these three equations for V, v, and w. The first two equations quickly 
give vd = £7. Solving for V in the first equation and plugging the result into the 
third, and then eliminating w through vd = B£?w gives 


d 
2V; 2z Vo 
ee — i tE (8.124) 
1+4¢4+5 1404+ 
MT Be MT pe 
Having found v, the first equation above gives V as 
q- myg È 
p= %y— <= (8.125) 
1+ y +per 


You are encouraged to check various limits of these answers. Another solution to 
Eq. (8.123) is of course V = Vo, v = 0, and w = 0. The initial conditions certainly 
satisfy conservation ofp, L, and E with the initial conditions (a fine tautology, indeed). 
Nowhere in Eq. (8.123) does it say that the ball actually hits the stick. 


A ball and stick theorem 
As in the solution to Problem 8.18, we have 


MVo = MV + m, 
MVod = MVd + Ia, (8.126) 


MVé = MV? + m? + Io. 


The speed of the contact point on the stick right after the collision equals the speed 
of the CM plus the rotational speed relative to the CM. In other words, it equals v+ ad. 
The desired relative speed is therefore (v + wd) — V. We can determine the value of 
this relative speed by solving the above three equations for V, v, and œw. Equivalently, 
we can just use the results of Problem 8.18. There is, however, a much more appealing 
method, which is the following. 

The first two equations quickly give mvd = Iw. The last equation may then be 
written as, using TIa = (lw)w = (mvd)a, 


M (Vo —-V)\Vo+ V) = mv(v + od). (8.127) 


8.9 Solutions 


If we now write the first equation as 


M(Vo — V) = m, (8.128) 
we can divide Eq. (8.127) by Eq. (8.128) to obtain Vo + V = v + wd, or 
Vo = (v + œd) - V, (8.129) 


as we wanted to show. In terms of velocities, the correct statement is that the final 
relative velocity is the negative of the initial relative velocity. In other words, Vo — 0 = 
—(V —(v+oa)). 


8.20. The superball 


Since we are told that |v,| is unchanged by the bounce, we can ignore it when applying 
conservation of energy. And since the vertical impulse from the floor provides no 
torque around the ball’s CM, we can completely ignore the y motion in this problem. 
The horizontal impulse from the floor is responsible for changing both vy and w. 
With positive directions defined as in the statement of the problem, Eq. (8.61) gives 


AL = RAp 
Ila —0) = Rm, Uy). (8.130) 
And conservation of energy gives 
sme slo” gmo + -Ia 
= Io? —@*) = mw =i), (8.131) 


Dividing this equation by Eq. (8.130) gives!° 


Rœ +a) = — (v + vy). (8.132) 
We can now combine this equation with Eq. (8.130) which can be rewritten as, using 
I = (2/5)mR?, 
2 i $ 
zO — @) = Vy — Uy. (8.133) 


Given vy and w, the previous two equations are two linear equations in the two 
unknowns, vi and w’. Solving for v! and w’, and writing the result in matrix notation, 


gives 
A 1 3 —4 Ux 
Uy ae x 
( Ro! )- 7 ( =i 23 I Ro ). oo 


as desired. Note that Eq. (8.132), when written in the form of vy + Rw = —(v..+ Ra’), 
says that the relative velocity of the ball’s contact point and the ground simply changes 
sign during the bounce. 


REMARK: For a ball with a general moment of inertia Z = mR, you can use the 
above procedure to show that the matrix in Eq. (8.134) takes the general form, 


1 (1-6 -2 
= D aoe (8.135) 


15 We have divided out the trivial œ = w and vl. = vy solution, which corresponds to slipping 
motion on a frictionless plane. The nontrivial solution we will find shortly is the nonslipping one. 
Basically, to conserve energy, there must be no work done by friction. And since work is force 
times distance, this means that either (1) the plane is frictionless, so that the force is zero, or (2) 
there is no relative motion between the ball’s contact point and the plane, so that the distance is 
zero. The latter case is the one we are concerned with here. 


365 


366 Angular momentum, Part | (Constant L) 


8.21. 


8.22. 


For £ = 1 (a hoop), this becomes 


0 -=i 
ee é J (8.136) 


which means that the bounce simply interchanges and negates the values of vy and Ro. 
In particular, if you throw a “‘super-hoop” sideways with no spin (that is, Rw = 0), 
then it will bounce straight up in the air (that is, v. = 0) while spinning. & 


Many bounces 
Equation (8.62) gives the result after one bounce, so the result after two bounces is 


uy = 3/7 —4/7 vi, 
Ro” ) \ —10/7 —3/7 Ro! 
_f 3 AN? T o 
= 10/7 =3/7 Ro 
E 1 0 Vy 
~\ 0 1 Ro 
Vy 
=( bo (8.137) 


The square of the matrix turns out to be the identity. Therefore, after two bounces, 
both vy and @ return to their original values. The ball then repeats the motion of 
the previous two bounces (and so on, after every two bounces). The only difference 
between successive pairs of bounces is that the ball may shift horizontally. You are 
strongly encouraged to experimentally verify this interesting periodic behavior. 


Rolling over a bump 


We will use the fact that the angular momentum of the ball with respect to the corner 
of the step (call this point P) is unchanged by the collision. This is true because any 
forces exerted at point P provide zero torque around P. (The torque from gravity 
will be relevant during the subsequent rising-up motion. But during the instantaneous 
collision, L does not change.) This fact will allow us to find the energy of the ball right 
after the collision, which we will then require to be greater than mgh. 

Breaking the initial Z into the contribution relative to the CM, plus the contribution 
from the ball treated like a point mass located at the CM, we see that the initial angular 
momentum is L = (2/ 5)mR? wo + mVo(R — h), where wo is the initial angular speed. 
But the nonslipping condition tells us that wọ = Vo/R, so we can write L as 


2 TR 
L zRo + mVo(R — h) = mVo 5 h). (8.138) 
Let w’ be the angular speed of the ball around point P immediately after the colli- 
sion. The parallel-axis theorem says that the moment of inertia around P is equal to 
(2/5)mR? + mR? = (7/5)mR?2. Conservation of L around P during the collision then 


gives 
TR 7 Vi Sh 
mVo | — —h) = =mR?o! o=2[1 ; (8.139) 
5 R TR 


The energy of the ball right after the collision is therefore 


1/7 7 5h\? 
E=-~(-—=mR?)o? = —mv?(1- >=). (8.140) 
2\5 10 TR 


The ball will climb up over the step if E > mgh, which gives 


10gh 5h\ T! 
nvzy = (l-z) (8.141) 
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Remarks: Itis possible for the ball to rise up over the step even if h > R, provided 
that the ball sticks to the corner, without slipping. (If h > R, the step would have to be 
“hollowed out” so that the ball doesn’t collide with the side of the step.) But note that 
Vo —> coash— 7R/5. For h > 7R/5, it is impossible for the ball to make it up over 
the step, no matter how large Vo is. The ball will get pushed down into the ground, 
instead of rising up, if h > 7R/5. 

For an object with a general moment of inertia J = mR? (so B = 2/5 in our 
problem), you can show that the minimum initial speed is 


Vo > |22" (: D (8.142) 
ve Vis 2B AFAR) ` i 


This decreases as £ increases. It is smallest when the “ball” is a wheel with all the 
mass on its rim (so that 6 = 1), in which case it is possible for the wheel to climb up 
over the step even if h is close to 2R. & 


8.23. Falling toast 


Let vo and v be the speeds of the CM right before and right after the collision (so we 
know that vo = ./2gH ). The J for a square is the same as the / for a stick, (1/12)mé?, 
so AL = (€/2)Ap gives!° 


ea n ) low (8.143) 
w= mv — mv v- v= : 
12 2 i j 6 
Conservation of energy during the collision gives 
1 1 1 (ml? La? 
5mo = zm” - 5 (= Jo? (vo + v) (vo — v) = 5 ; (8.144) 


Dividing this by Eq. (8.143) gives vo + v = €w/2. We now have two linear equations 
for v and æ. Solving gives v = vo/2 and w = 3u9/€. 

After the collision, the time to hit the ground is given by vt + gt*/2 = h. Solving 
for t, setting wt = 7 for half a rotation, and using the v and w we just found, yields 


3v 1 7 
Sy Of 4 26h | =x. (8.145) 
eel 2 va 


Factoring out a vw and then using vw = 2gH gives 


3H 4h 
14 1 T. (8.146) 
£ H 
Isolating the square root, and then squaring and solving for H gives 
n?e? 
= ——___—_., 8.147 
6(6h — m£) ( ) 


The special value of £ is £ = (6/7 )h (which would be a huge piece of toast), in which 
case H = oo. If £ is larger than (6/7 )h, then there isn’t enough time to make half a 
rotation before hitting the floor. The intuitive reason for this is that since w = 6v/£ 
(from above), there is no way to increase w without also increasing v. The toast has the 
best chance of making half a rotation if v is very large, because then gravity doesn’t 
have any time to increase v. In this limit, the toast makes half a rotation upon hitting 
the floor if 7/w = h/v. Plugging in w = 6v/£ gives 6h = x £, as desired. 


'6 The minus sign on the right-hand side comes from the fact that we’re defining the v’s to be positive 
downward. Equivalently, the force from the counter increases the angular speed but decreases the 
linear speed. 
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For the reasonable values of h = 1 m and £ = 10cm, we obtain H ~ 3 mm, which 
is quite small. You should try this with a pencil to convince yourself that even this 
small distance can yield the desired (or rather, undesired) half rotation. 


8.24. Sliding to rolling 


(a) Define all linear quantities to be positive to the right, and all angular quantities 
to be positive clockwise, as shown in Fig. 8.80. Then, for example, the friction 
force Ff is negative. The friction force slows down the translational motion and 
speeds up the rotational motion, according to 


Ff =ma, and —FfR = Ta. (8.148) 


Eliminating Fç, and using J = BmR?, gives a = —fRa. Integrating this over 
time, up to the time when the ball stops slipping, gives 


AV = —BRAa. (8.149) 


Note that we could have obtained this by simply using the impulse equation, 
Eq. (8.61). Using AV = Ve —Vo, and Aw = wf — wo = wp, and also wf = Vf/R 
(the nonslipping condition), Eq. (8.149) gives 


Vo 


Vp = — 
£ 1+8’ 


(8.150) 
independent of the nature of Ff. Ff can depend on position, time, speed, or 
anything else. The relation a = — Ra, and hence also Eq. (8.149), will still be 
true at all times. 


REMARK: We can also calculate t and L relative to a dot painted on the ground 
that is the contact point at a given instant. There is zero torque relative to this 
point. To find L, we must add the L of the CM and the ZL relative to the CM. 
Therefore, t = dL /dt gives 0 = (d/dt)(muR + BmR? w), and so a = —BRa, as 
above. & 


Using Eq. (8.150), and also the relation wp = V¢/R, the loss in kinetic energy is 


l l 1 
AK = -mVè mV? + -Io 
2 2 2 


E 1 2 1 B 
amo (1 GFA C Se) 
_! if 8 
=z (£5). (8.151) 


For 6 — 0, no energy is lost, which makes sense. And for 6 — oo (a spool 
sliding on its axle), all the energy is lost, which also makes sense, because we 
essentially have a sliding block which can’t rotate. 

(b) Let’s first find t. The friction force is Ff = — umg, so F = ma gives —ug = a. 
Therefore, AV = at = —ygt. But Eq. (8.150) says that AV = Vp — Vo = 
—VoB/(U + £). Therefore, 


Pao, Fo (8.152) 


i= : 5 
A+) ug 


For B — 0, we have t — 0, which makes sense. And for 8 —> ov, we have 
t > Vo/(ug), which equals the time a sliding block would take to stop. 
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Let’s now find d. We have d = Vot+(1/2)at*. Using a = — ug, and plugging 
in the ¢ from Eq. (8.152), we obtain 


2 
-CtP V (8.153) 
d+8? 2ug 


The two extreme cases for 6 check here. 

To calculate the work done by friction, we might be tempted to write down the 
product Ffd, with Ff = — umg and d given in Eq. (8.153). But the result doesn’t 
equal the loss in kinetic energy calculated in Eq. (8.151). What’s wrong with 
this reasoning? The error is that the friction force does not act over a distance d. 
To find the distance over which Ff acts, we must find how far the surface of 
the ball moves relative to the ground. The speed of a dot on the ball that is 
instantaneously the contact point is Vre ©) = V(t) — Ræ (t) = (Vo + at) — Rat. 
Using œ = —a/BR and a = — ug, this becomes 


Viat) = Vo — HE pg, (8.154) 


Integrating this from ¢ = 0 to the ¢ given in Eq. (8.152) gives 


p V 
1+8 2ug` 
The work done by friction is F¢d;ey = —umgdrei, which does indeed give the 
loss in kinetic energy given in Eq. (8.151). 


8.25. Lots of sticks 


Consider the collision between two sticks. Let V be the speed of the contact point on 
the heavy one. Since this stick is essentially infinitely heavy, we may consider it to 
be an infinitely heavy ball, moving at speed V. The rotational degree of freedom of 
the heavy stick is irrelevant, as far as the light stick is concerned. We can therefore 
invoke the result of Problem 8.19 to say that the relative speed of the contact points is 
the same before and after the collision. This implies that the contact point on the light 
stick picks up a speed of 2V, because the heavy stick is essentially unaffected by the 
collision and keeps moving at speed V. 

Let us now find the speed of the other end of the light stick. This stick receives an 
impulse from the heavy stick, so we can apply Eq. (8.61) to the light stick to obtain 


drel = f Vrei (t) dt = (8.155) 


UCM 
B 


The speed of the struck (top) end is vtop = rw + vcm, because the CM speed adds to the 
rotational speed. The speed of the other (bottom) end is vpot = rw — vey, because the 
CM speed subtracts from the rotational speed.'’ The ratio of these speeds is 


AL=rAp => Bmr?o =r(mvcu) => rw= (8.156) 


Vbot __ SRY UM T= (8 157) 
Vtop aM + Vem 1+ 8 i : 


This is a general result whenever you strike the end of a stick with any force. In the 
present problem, we have vtop = 2V. Therefore, 


2(1 — 
Ubot = V (EP). (8.158) 


17 Since B < 1 for any real stick, we have rw = vcm/B > vem. Therefore, rw — vcm is greater 
than or equal to zero. 
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The same analysis holds for all the other collisions. Therefore, the bottom ends of the 
sticks move with speeds that form a geometric progression with ratio 2(1 — B)/(1 + 
p). If this ratio is less than 1 (that is, if 6 > 1/3), then the speeds go to zero as 
n — oo. If it is greater than 1 (that is, if 6 < 1/3), then the speeds go to infinity 
as n — oo. If it equals 1 (that is, if 8 = 1/3), then the speeds remain equal to V and 
are thus independent of n, as we wanted to show. A uniform stick has 6 = 1/3 relative 
to its center (which is usually written in the form J = m€? /12, where £ = 2r). 


Chapter 9 
Angular momentum, Part Il 
(General L) 


In Chapter 8, we discussed situations where the direction of the vector L remained 
constant, and only its magnitude changed. In this chapter, we will look at more 
general situations where the direction of L is allowed to change. The vector 
nature of L will prove to be vital here, and we will arrive at all sorts of strange 
results for spinning tops and such things. This chapter is rather long, alas, but the 
general outline is that the first three sections cover general theory, then Section 9.4 
introduces some actual physical setups, and then Section 9.6 begins the discussion 
of tops. 


9.1 Preliminaries concerning rotations 
9.1.1 The form of general motion 


Before getting started, we should make sure we’re all on the same page concerning 
a few important things about rotations. Because rotations generally involve three 
dimensions, they can often be hard to visualize. A rough drawing on a piece of 
paper might not do the trick. For this reason, this chapter is one of the more 
difficult ones in this book. But to ease into it, the next few pages consist of some 
definitions and helpful theorems. This first theorem describes the general form 
of any motion. You might consider it obvious, but it’s a little tricky to prove. 


Theorem 9.1 (Chasles’ theorem) Consider a rigid body undergoing arbitrary 
motion. Pick any point P in the body. Then at any instant (see Fig. 9.1), the 
motion of the body can be written as the sum of the translational motion of P, 
plus a rotation around some axis (which may change with time) through P.! 


Proof: The motion of the body can be written as the sum of the translational 
motion of P, plus some other motion relative to P (this is true because relative 
coordinates are additive quantities). We must show that this latter motion is a 
rotation. This seems quite plausible, and it holds because the body is rigid; that is, 


' In other words, a person at rest with respect to a frame whose origin is P, and whose axes are 
parallel to the fixed-frame axes, sees the body undergoing a rotation around some axis through P. 


Fig. 9.1 


371 


372 


Fig. 9.2 


Angular momentum, Part II (General L) 


all points keep the same distances relative to each other. If the body weren’t rigid, 
then this theorem wouldn’t be true. 

To be rigorous, consider a spherical shell fixed in the body, centered at P. The 
motion of the body is completely determined by the motion of the points on this 
sphere, so we need only examine what happens to the sphere. Because distances 
are preserved in the rigid body, the points on the sphere must always remain the 
same radial distance from P. And because we are looking at motion relative to P, 
we have therefore reduced the problem to the following: In what manner can a 
rigid sphere transform into itself? We claim that any such transformation has the 
property that there exist two points that end up where they started.” These two 
points must then be diametrically opposite points (assuming that the whole sphere 
doesn’t end up back where it started, in which case every point ends up where it 
started), because distances are preserved; given one point that ends up where it 
started, the diametrically opposite point must also end up where it started, to 
maintain the distance of a diameter. 

If this claim is true, then we are done, because for an infinitesimal transfor- 
mation, a given point moves in only one direction, because there is no time to do 
any turning. So a point that ends up where it started must have remained fixed 
for the whole (infinitesimal) time. Therefore, all the points on the diameter join- 
ing the two fixed points must also have remained fixed the whole time, because 
distances are preserved. So we are left with a rotation around this axis. 

This “two points ending up where they started” claim is quite believable, 
but nevertheless tricky to prove. Claims with these properties are always fun to 
think about, so I’ve left this one as a problem (Problem 9.2). Try to solve it on 
your own. | 


We will invoke this theorem repeatedly in this chapter (often without bothering 
to say so). Note that we are assuming that P is a point in the body, because we 
used the fact that P keeps the same distances from other points in the body. 


Remark: A situation where this theorem isn’t so obvious is the following (this setup contains 
only rotation, with no translation of the point P). Consider an object rotating around a fixed 
axis, the stick shown in Fig. 9.2. But now imagine grabbing the stick and rotating it around 
some other axis (the dotted line shown). It isn’t immediately obvious that the resulting motion 
is (instantaneously) a rotation around some new axis through the point P (which remains fixed). 
But indeed it is. We’ll be quantitative about this in the “Rotating sphere” example later in this 
section. & 


9.1.2 The angular velocity vector 


It is extremely useful to introduce the angular velocity vector, w, which is defined 
as the vector that points along the axis of rotation, and whose magnitude equals 


2 This claim is actually true for any transformation of a rigid sphere into itself, but for the present 
purposes we are concerned only with infinitesimal transformations, because we are looking only 
at what happens at a given instant in time. 
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the angular speed. The choice of the two possible directions along the axis is given 
by the right-hand rule: if you curl your right-hand fingers in the direction of the 
spin, then your thumb points in the direction of w. For example, a spinning record 
has œ perpendicular to the record, through the center (as shown in Fig. 9.3),° with 
its magnitude equal to the angular speed, w. The points on the axis of rotation 
are the ones that (instantaneously) do not move. Of course, the direction of œ 
may change over time, so the points that were formerly on the axis may now be 
moving. 


REMARKS: 


1. If you want, you can break the mold and use the left-hand rule to determine @, as long as 
you use it consistently. The direction of œ will be the opposite, but that doesn’t matter, 
because w isn’t really physical. Any physical result (for example, the velocity ofa particle, 
given below in Theorem 9.2) will come out the same, independent of which hand you 
(consistently) use. 


When studying vectors in school, 
You'll use your right hand as a tool. 
But look in a mirror, 

And then you’ll see clearer, 

It’s just like the left-handed rule. 


2. The fact that we can specify a rotation by specifying a vector @ is a peculiarity to three 
dimensions. If we lived in one dimension, then there would be no such thing as a rotation. 
If we lived in two dimensions, then all rotations would take place in that plane, so we 
could label a rotation by simply giving its speed, w. In three dimensions, rotations take 
place in (3) = 3 independent planes. And we choose to label these, for convenience, by 


the directions orthogonal to these planes, and by the angular speed in each plane. If we 


lived in four dimensions, then rotations could take place in (5) = 6 planes, so we would 


have to label a rotation by giving 6 planes and 6 angular speeds. Note that a vector, which 
has four components in four dimensions, would not do the trick. & 


In addition to specifying the points that are instantaneously motionless, œ 
also easily produces the velocity of any point in the rotating object. Consider 
the situation where the axis of rotation passes through the origin, which we’ ll 
generally assume to be the case in this chapter, unless otherwise stated. Then we 
have the following theorem. 


Theorem 9.2 Given an object rotating with angular velocity œ, the velocity 
of a point at position r is given by 


V=oxr. (9.1) 


Proof: Drop a perpendicular from the point in question (call it P) to the axis w. 
Let Q be the foot of the perpendicular, and let r’ be the vector from Q to P 


3 It’s actually meaningless to say that w passes through the center of the record, because you can 
draw the vector anywhere, and it’s still the same vector, as long as it has the correct magnitude and 
direction. Nevertheless, it’s customary to draw w along the axis of rotation and to say things like, 
“An object rotates around @...” 
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(see Fig. 9.4). From the properties of the cross product (see Appendix B), v = 
æ x r is orthogonal to æ, r, and also r’ because r’ is a linear combination of œ 
and r. Therefore, the direction of v is correct; it is always orthogonal to w and 
r’, so it describes circular motion around the axis w. Also, by the right-hand rule 
in the cross product (or the left-hand rule, if you had chosen to be different and 
defined w that way), v has the proper orientation around œ, namely into the page 
at the instant shown. And since 


Iv| = |@||r| sind = wr’, (9.2) 


we see that v has the correct magnitude, because wr’ is the speed of the circular 
motion around w. So v is indeed the correct velocity vector. (If we have the 
special case where P lies along æ, then r is parallel to w, so the cross product 
gives a zero result for v, as it should.) E 


We’ll make good use of Eq. (9.1) and apply it repeatedly throughout this 
chapter. Even if it’s hard to visualize what’s going on in a given rotation, all you 
have to do to find the speed of any point is calculate the cross product œ x r. 
Conversely, if the speed of every point in a body is given by v = @ x r, then the 
body must be undergoing a rotation with angular velocity w, because all points 
on the axis w are motionless, and all other points move with the proper speed for 
this rotation. 

A very nice thing about angular velocities is that they simply add. Stated more 
precisely: 


Theorem 9.3 Let coordinate systems Sı, S2, and S3 have a common origin. 
Let S| rotate with angular velocity @\,2 with respect to S2, and let Sy rotate with 
angular velocity @2,3 with respect to S3. Then Sı rotates (instantaneously) with 
angular velocity 


@13 = @12 + @23 (9.3) 


with respect to 83. 


Proof: If #12 and w23 point in the same direction, then the theorem is clear; 
the angular speeds just add. If, however, they don’t point in the same direction, 
then things are a bit harder to visualize. But we can prove the theorem by making 
abundant use of the definition of w. 

Pick a point P; at rest in S1. Let r be the vector from the origin to P1. The 
velocity of P4 (relative to a very close point P2 at rest in Sy) due to the rotation of 
Sı around @)2 is Vp,p, = @1,2 x r. The velocity of P2 (relative to a very close 
point P3 at rest in S3) due to the rotation of S2 around @23 is Vp,p,; = @23 X T, 
because P3 is also located essentially at position r. Therefore, the velocity of Pı 
relative to P3 is Vp,p, + VPP, = (@1,2 + @23) x r. This holds for any point 
P; at rest in S1, so the frame Sı rotates with angular velocity (@1,2 + #23) with 
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respect to S3. We see that the proof basically comes down to the facts that (1) the 
linear velocities just add, as usual, and (2) the angular velocities differ from the 
linear velocities by a cross product with r. a 


If 1/2 is constant in S2, then the vector 13 = @1,2 + @23 will change with 
respect to $3 as time goes by, because @},2, which is fixed in S2, is changing with 
respect to $3 (assuming that 1,2 and 23 aren’t parallel). But at any instant, 
@ 3 may be obtained by adding the present values of w1,2 and w2,3. Consider the 
following example. 


Example (Rotating sphere): A sphere rotates with angular speed w3 around a 
stick that initially points in the z direction. You grab the stick and rotate it around the 
y axis with angular speed w2. What is the angular velocity of the sphere, with respect 
to the lab frame, as time goes by? 


Solution: Inthe language of Theorem 9.3, the sphere defines the S4 frame; the stick 
and the y axis define the S2 frame; and the lab frame is the S3 frame. The instant after 
you grab the stick, we are given that œ] 2 = w3Z, and 23 = œ2f. Therefore, the 
angular velocity of the sphere with respect to the lab frame is w1 3 = @1,2 + @23 = 
@3Z + 29, as shown in Fig. 9.5. Convince yourself that the combination of these 
two rotations yields zero motion for the points along the line of @1,3. As time goes 
by, the stick (and hence @ 7) rotates around the y axis, so @1 3 = @1,2 + @2,3 traces 
out a cone around the y axis, as shown. 


Remark: Note the different behavior of 1,3 for a slightly different statement of the 
problem: Let the sphere initially rotate with angular velocity wy around a stick, and then 
grab the stick and rotate it with angular velocity w3z. For this situation, w13 initially 
points in the same direction as in the original statement of the problem (it initially equals 
w29 + w32). But as time goes by, it is now the horizontal component (defined by the stick) 
of œ; 3 that changes, so w13 = @1,2 + @23 traces out a cone around the z axis, as shown 
in Fig. 9.6. & 


An important point concerning rotations is that they are defined with respect 
to a coordinate system. It makes no sense to ask how fast an object is rotating with 
respect to a certain point, or even a certain axis. Consider, for example, an object 
rotating with angular velocity œ = w32 with respect to the lab frame. Saying only, 
“The object has angular velocity œ = w32,” is not sufficient, because someone 
standing in the frame of the object would measure w = 0, and would therefore be 
very confused by your statement. Throughout this chapter, we’ Il try to remember 
to state the coordinate system with respect to which is measured. But if we 
forget, the default frame is the lab frame. 

This section was definitely a bit abstract, so don’t worry too much about it 
at the moment. The best strategy is perhaps to read on, and then come back 
for a second pass after digesting a few more sections. At any rate, we'll be 
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discussing many other aspects (probably more than you’d ever want to know) 
of w in Section 9.7.2, so you’re assured of getting a lot more practice with it. 
For now, if you want to strain some brain cells thinking about w vectors, you are 
encouraged to solve Problem 9.3, and also to look at the three given solutions. 


9.2 The inertia tensor 


Given an object undergoing general motion, the inertia tensor is what relates the 
angular momentum, L, to the angular velocity, œ. This tensor (which is just a 
fancy name for “matrix” in this context) depends on the geometry of the object, 
as we’ll see. In finding the L due to general motion, we’ll follow the strategy of 
Section 8.1. We’ll first look at the special case of rotation around an axis through 
the origin, then we’ll look at the most general possible motion. 


9.2.1 Rotation around an axis through the origin 


The three-dimensional object in Fig. 9.7 rotates with angular velocity œ. Consider 
a little piece of the body, with mass dm and position r. The velocity of this piece is 
v = œw XT, so its angular momentum (relative to the origin) isr xp = (dm)rxv = 
(dm)r x (œw x r). The angular momentum of the entire body is therefore 


L= fi x (@ x r)dm, (9.4) 


where the integration runs over the volume ofthe body. In the case where the rigid 
body is made up of a collection of point masses m;, the angular momentum is 


L= X mir; x (@ x ri). (9.5) 


The double cross product in Eqs. (9.4) and (9.5) looks a bit intimidating, but it’s 
actually not so bad. First, we have 


x y Z 
@xr=| 0o) a ? @3 
x yp Z 


= (wz — way)& + (œx — W12)9 + (oiy — Or~)J2. (0.6) 


We’re using the notation w; instead of wy, etc., because there are already enough 
x, y, z’s floating around here. The double cross product is then 


A A A 


x y Z 
rx(@xr)= x y z 
(@2z — w3y)  (w3x—@1Z) (@ıy — w2x) 


= (o O? +22) — wpxy — wszx) 
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+ (or? +x) — wsyz — o)§ 
+ (os (x? +y’) — W zx — wnyz (9.7) 


The angular momentum in Eq. (9.4) may therefore be written in the concise 
matrix form, 


Li [0+2 — fxy — fzx w1 
In |= — f xy TZ +x?) — fyz w2 
L3 -fzx -fyz f@+ty’) 3 
Lex Ly Lez (OJI 
=| bx ly hz w2 
Ix Ly Iz w3 
= lø. (9.8) 


For the sake of clarity, we have not bothered to write the dm part of each integral 
(and we’ll continue to drop it for most of the remainder of this section). The 
matrix I is called the inertia tensor. If the word “tensor” scares you, just ignore 
it. I is simply a matrix. It acts on a vector (the angular velocity) and produces 
another vector (the angular momentum). 


Example (Cube with origin at corner): Calculate the inertia tensor for a solid 
cube of mass M and side length L, with the coordinate axes parallel to the edges of 
the cube, and the origin at a corner (see Fig. 9.8). 


Solution: Due to the symmetry of the cube, there are only two integrals we need 
to calculate in Eq. (9.8). The diagonal entries are all equal to f o? + z°) dm, and the 
off-diagonal entries are all equal to — f xy dm. With dm = p dx dy dz, and p = M/L?, Fig. 9.8 
these two integrals are 


L rl pL L L 2 
I f f O? +2) dx dy dz = o | y dy + o | z? dz = =ML’, 
o Jo Jo 0 0 3 


L pL pl L L M 
-f f f wodrdydz = -pl | xax f ydy = ——_. (9.9) 
o Jo Jo 0 0 4 


Therefore, 


2/3 —1/4 —1/4 
I=MĽ?| -1/4 2/3 -1/4 J. (9.10) 
aif =1)/4 373 


Having found I, we can calculate the angular momentum associated with any given 
angular velocity. If, for example, the cube is rotating around the z axis with angu- 
lar speed w, then we can apply the matrix I to the vector (0,0, w) to find that the 
angular momentum is L = ML*w(—1/4, —1/4,2/3). Note the somewhat odd fact 
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that Ly and Ly are nonzero, even though the rotation is only around the z axis. We’ll 
discuss this issue after the following remarks. 


REMARKS: 


1. The inertia tensor in Eq. (9.8) is a rather formidable-looking object. You will therefore 
be very pleased to hear that you rarely have to use it. It’s nice to know that it’s there if 
you need it, but the concept of principal axes (discussed in Section 9.3) provides a way 
to avoid using the inertia tensor (or more precisely, to greatly simplify it) and is therefore 
much more useful in solving problems. 

2. Lis a symmetric matrix, which is a fact that will be important in Section 9.3. There are 
therefore only six independent entries, instead of nine. 

3. In the case where the rigid body is made up of a collection of point masses mj, the entries 
in the matrix are just sums. For example, the upper left entry is X m;(y? + 22). 

4. I depends only on the geometry of the object, and not on ø. 

5. To construct an I, you not only need to specify the origin, you also need to specify the x, y, 
z axes of your coordinate system. And the basis vectors must be orthogonal, because the 
cross product calculation above is valid only for an orthonormal basis. If someone else 
comes along and chooses a different orthonormal basis (but the same origin), then her I 
will have different entries, as will her œ, as will her L. But her œ and L will be exactly the 
same vectors as your w and L. They will appear different only because they are written 
in a different coordinate system. A vector is what it is, independent of how you choose to 
look at it. If you each point your arm in the direction of what you calculate L to be, then 
you will both be pointing in the same direction. 

6. For the case of a pancake object rotating in the x-y plane, we have z = 0 for all points in 
the object. And w = w32, so w; = œz = 0. The only nonzero term in the L in Eq. (9.8) 
is therefore L3 = J (x? + y?) dm w3, which is simply the L, = Z% result we found in 
Eq. (8.5). & 


This is all perfectly fine. Given any rigid body, we can calculate I (relative 
to a given origin, using a given set of axes). And given œw, we can then apply I 
to it to find L. But what do these entries in I really mean? How do we interpret 
them? Note, for example, that w3 appears not only in L3 in Eq. (9.8), but also in 
Lı and L2. But w3 is relevant to rotations around the z axis, so what in the world 
is it doing in Lı and L2? Consider the following examples. 


Example 1 (Point mass in the x-y plane): Consider a point mass m traveling 
in a circle of radius r (centered at the origin) in the x-y plane, with frequency w3, as 
shown in Fig. 9.9. Using œ = (0,0, @3), x2 +y = r2, and z = 0 in Eq. (9.8) (with 
a discrete sum of only one object, instead of the integrals), the angular momentum 
with respect to the origin is 


L = (0, 0, mr*a3). (9.11) 


The z component is mr(r@3) = mrv, as it should be. And the x and y components 


are zero, as they should be. This case where œ] = œ? = 0 and z = 0 is simply 
the case we studied in Chapter 8, as mentioned in Remark 6 above. 


Fig. 9.9 
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Example 2 (Point mass in space): Consider a point mass m traveling in a circle 
of radius r, with frequency w3. But now let the circle be centered at the point (0, 0, zo), 
with the plane of the circle parallel to the x-y plane, as shown in Fig. 9.10. Using 
w = (0,0,03), x2 +y? = r?, and z = zp in Eq. (9.8), the angular momentum with 
respect to the origin is 


L = mo3(—xz9, —yz0, r’). (9.12) 


The z component is mrv, as it should be. But surprisingly, we have nonzero Lı and 
Ly, even though the mass is just rotating around the z axis. L does not point along w 
here. What’s going on? 

Consider an instant when the mass is in the y-z plane, as shown in Fig. 9.10. The 
velocity of the mass is then in the —x direction. Therefore, the particle most certainly 
has angular momentum around the y axis, as well as the z axis. Someone looking at a 
split-second movie of the mass at this point can’t tell whether it’s rotating around the 
y axis, the z axis, or undergoing some complicated motion. But the past and future 
motion is irrelevant; at any instant in time, as far as the angular momentum goes, we 
are concerned only with what is happening at this instant. 

At this instant, the angular momentum around the y axis is L2 = —mzgv, because 
zo is the distance from the y axis, and the minus sign comes from the right-hand rule. 
Using v = w3r = w3y, we have L2 = —mzgw3y, in agreement with Eq. (9.12). Also, 
at this instant, Lı is zero, because the velocity is parallel to the x axis. This agrees 
with Eq. (9.12), since x = 0. As an exercise, you can check that Eq. (9.12) is also 
correct when the mass is at a general point (x, y, zg). 

We see that, for example, the z = — J yz entry in I tells us how much the %3 
component of the angular velocity contributes to the L2 component of the angular 
momentum. And due to the symmetry of I, the 4, = sy entry in I also tells us how 
much the w component of the angular velocity contributes to the L3 component of 
the angular momentum. In the former case, if we group the product of the various 
quantities as — f (w3y)z, we see that this is simply the appropriate component of the 
velocity times the distance from the y axis. In the latter case with — f (w2z)y, it is 
the opposite grouping. But in both cases there is one factor of y and one factor of z, 
hence the symmetry in I. 


REMARK: For a point mass, L is actually more easily obtained by just calculating L = 
r x p. The result for the instant shown in Fig. 9.10 is drawn in Fig. 9.11, where it is clear 
that L has both y and z components, and thus also clear that L doesn’t point along w. For a 
more complicated object, the tensor I is generally used, because it is necessary to perform 
the integral of the L = r x p contributions over the entire object, and the tensor has this 
integral built into it. At any rate, whatever method you use, you will find that except in 
special circumstances (see Section 9.3), L doesn’t point along øw. 


Consider the vector of L, 

And that of w as well. 

The erroneous claim 

That they must aim the same 

Is a view that you’ve got to dispel! & 
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Example 3 (Two point masses): Let’s now add another point mass m to the 
previous example. Let it travel in the same circle, at the diametrically opposite point, 
as shown in Fig. 9.12. Using œ = (0,0, w3), x2 +y? = r?, and z = zg in Eq. (9.8), 
you can show that the angular momentum with respect to the origin is 


L = 2ma3(0, 0,77). (9.13) 


Since v = w3r, the z component is 2mrv, as it should be. And Lı and L3 are zero, 
unlike in the previous example, because these components of the L’s of the two 
particles cancel. This occurs because of the symmetry of the masses around the z 
axis, which causes the /,, and /-y entries in the inertia tensor to vanish; they are each 
the sum of two terms, with opposite x values, or opposite y values. Alternatively, you 
can just note that adding on the mirror-image L vector in Fig. 9.10 produces canceling 
x and y components. 


Let’s now look at the kinetic energy of our object, which is rotating around an 
axis passing through the origin. To find this, we must add up the kinetic energies 
of all the little pieces. A little piece has energy (dm) v?/2 = dm|@ x r|*/2. 
Therefore, using Eq. (9.6), the total kinetic energy is 


1 


T= 5 f ((orz — wy)? + (@3x — wz)? + (@ıy — onx)°) dm. (9.14) 


Multiplying this out, we see (after a little work) that we can write T as 


| [O +27) -fx — fzx w1 
T = 5 (@1,02,03) - -fw  f@+x’) =f w2 
-fzx -fyz fœ +y 3 
1 1 
= 50: To = 50°L. (9.15) 


If æ = w3Z, then this reduces to T = L203 /2, which agrees with the result in 
Eq. (8.8), with a slight change in notation. 


9.2.2 General motion 


How do we deal with general motion in space? That is, what if an object is both 
translating and rotating? For the motion in Fig. 9.13, the various pieces of mass 
aren’t traveling in circles around the origin, so we can’t write v = œw x r, as we 
did prior to Eq. (9.4). 

To determine L (relative to the origin), and also the kinetic energy T, we will 
use Theorem 9.1 to write the motion as the sum of a translation plus a rotation. 
In applying the theorem, we may choose any point in the body to be the point 
P in the theorem. However, only in the case where P is the object’s CM can we 
extract anything useful, as we’ll see. The theorem then says that the motion of 
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the body is the sum of the motion of the CM plus a rotation around the CM. So 
let the CM move with velocity V, and let the body instantaneously rotate with 
angular velocity œ around the CM (that is, with respect to the frame whose origin 
is the CM, and whose axes are parallel to the fixed-frame axes).* 

Let the position of the CM relative to the origin be R = (X,Y,Z), and let 
the position of a given piece of mass relative to the CM be r’ = (x’, y’,z’). Then 
r = R+ r is the position of a piece of mass relative to the origin (see Fig. 9.14). 
Let the velocity of a piece of mass relative to the CM be v’ (so v’ = @”’ x r’). 
Then v = V + Vv is the velocity relative to the origin. 

Let’s look at L first. The angular momentum is 


L= [exvan= f Rr) x (V+ x1) dm 


= fe x Vydn+ fv x (@' x x’) dm 
= MR x V) + Ley, (9.16) 


where the cross terms vanish because the integrands are linear in r’. More pre- 
cisely, the integrals involve fr’ dm, which is zero by definition of the CM 
(because f r'dm/M is the position of the CM relative to the CM, which is zero). 
Lew is the angular momentum relative to the CM.° 

We see that as in the pancake case in Section 8.1.2, the angular momentum 
(relative to the origin) of a body can be found by treating the body like a point 
mass located at the CM and finding the angular momentum of this point mass 
relative to the origin, and by then adding on the angular momentum of the body 
relative to the CM. Note that these two parts of the angular momentum need not 
point in the same direction, as they did in the case of the pancake moving in the 
x-y plane. 

Now let’s look at T. The kinetic energy is 


1 1 
T = fi? am= [sv tv Pam 


MV? + 50 -Lom, (9.17) 


4 It’s not necessary to put the prime on the w here, because the angular velocity vector in the CM 
frame is the same as in the lab frame. But we’ll use the prime just because we’ll have primes on 
the other CM quantities below. 

2 By this, we mean the angular momentum as measured in the coordinate system whose origin is the 
CM, and whose axes are parallel to the fixed-frame axes. 
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where the last line follows from the steps leading to Eq. (9.15). The cross term 
fV -v'dm = [V-(@! x r’) dm vanishes because the integrand is linear in r’ 
and thus yields a zero integral, by definition of the CM. As in the pancake case 
in Section 8.1.2, the kinetic energy of a body can be found by treating the body 
like a point mass located at the CM, and by then adding on the kinetic energy of 
the body due to the rotation around the CM. 


9.2.3 The parallel-axis theorem 


Consider the special case where the CM rotates around the origin with the same 
angular velocity at which the body rotates around the CM (see Fig. 9.15), that is, 
V =o’ x R. This can be achieved, for example, by piercing the body with the 
base of a rigid “T” and then rotating the T and the body around the (fixed) line of 
the “upper” part of the T (the origin must pass through this line). We then have the 
nice situation where all points in the body travel in fixed circles around the axis 
of rotation. Mathematically, this follows from v = V+ v’ =o’ x R+% xr = 
æ x r. Dropping the prime on ø, Eq. (9.16) becomes 


L=MR x (@ xR) + |r x (ox ram (9.18) 


Expanding the double cross products as in the steps leading to Eq. (9.8), we can 
write this as 


Li Y?+Zz? -XY —ZX w 
L | =M =X Zx =Y7 a 
L3 EIK = Xy w3 
fo” +z?) _ f xy! = fax | 
+ _ f xy Se? 4 x?) = fyz w 
— fzx — fyz Jo +y”) o3 
= (Ir + Icm)ø. (9.19) 


This is the generalized parallel-axis theorem. It says that once you’ ve calculated 
Icm relative to the CM, then if you want to calculate I relative to another point, 
you simply have to add on the Ip matrix, obtained by treating the object like a 
point mass at the CM. So you have to compute six extra numbers (there are six, 
instead of nine, because the Ip matrix is symmetric) instead of just the one MR? 
in the parallel-axis theorem in Chapter 8, given in Eq. (8.13). Problem 9.4 gives 
another derivation of the parallel-axis theorem, without mentioning the angular 
velocity. 


REMARK: The name “parallel-axis” theorem is actually a misnomer here. The inertia tensor 
isn’t associated with one particular axis, as the moment of inertia in Chapter 8 was. The moment 
of inertia is just one of the diagonal entries (associated with a given axis) in the inertia tensor. 
The inertia tensor depends on the entire coordinate system. So in that sense we should call this 
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the “parallel-axes” theorem, because the coordinate axes in the CM frame are assumed to be 
parallel to the ones in the fixed frame. At any rate, the point is that the parallel-axis theorem 
in Chapter 8 dealt with shifting the axis, whereas the present theorem deals with shifting the 
origin (and hence all three axes in general). & 


As far as the kinetic energy goes, if @ and w’ are equal, so that V = w’ x R, 
then Eq. (9.17) gives (dropping the prime on æ) 


1 1 
T = 5Mlo x R|? +f 5l x r'|? dm. (9.20) 


Performing the steps leading to Eq. (9.15), this becomes 


1 1 
T= 5° - (Ir + Icy) = 5° -L. (9.21) 


9.3 Principal axes 


The cumbersome expressions in the previous section may seem a bit unsettling, 
but it turns out that usually we can get by without them. The strategy for avoiding 
all of the above mess is to use the principal axes of a body, which we will define 
below. 

In general, the inertia tensor I in Eq. (9.8) has nine nonzero entries, of which 
six are independent due to the symmetry of I. In addition to depending on the 
origin chosen, the inertia tensor depends on the set of orthonormal basis vectors 
chosen for the coordinate system; the x, y, z variables in the integrals in I depend, 
of course, on the coordinate system they’re measured with respect to. Given a 
blob of material, and given an arbitrary origin, any orthonormal set of basis 
vectors is usable, but there is one special set that makes all our calculations 
very nice. These special basis vectors are called the principal axes. They can be 
defined in various equivalent ways: 


e The principal axes are the orthonormal basis vectors for which I is diagonal, that is, 
for which’ 


L 0 0 
I=] 0 bh oO |. (9.22) 
0 0 h 


I, h, and J; are called the principal moments. For many objects, it is quite obvious 
what the principal axes are. For example, consider a uniform rectangle in the x-y plane. 
Pick the origin to be the CM, and let the x and y axes be parallel to the sides. Then the 


é The CM is often chosen to be the origin, but it need not be. There are principal axes associated 
with any origin. 

7 Technically, we should be writing J11 or J, instead of J1, etc., in this matrix, because the one- 
index object J; looks like the component of a vector, not a matrix. But the two-index notation gets 
cumbersome, so we’ll be sloppy and just use /1, etc. 
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principal axes are clearly the x, y, and z axes, because all the off-diagonal elements 
in the inertia tensor in Eq. (9.8) vanish, by symmetry. For example, Ixy = — f xy dm 
equals zero, because for every point (x, y) in the rectangle, there is a corresponding point 
(—x, y), so the contributions to f xy dm cancel in pairs. Also, the integrals involving z 


are identically zero, because z = 0. 


e A principal axis is an axis ô for which Iô = Jĝ. That is, a principal axis is a special 


direction with the property that if œ points along it, then so does L. The principal axes 
of an object are then the orthonormal set of three vectors 1, @2, @3 with the property 
that 


Iô; =e), Iô? = hâz, Iô; = hô3. (9.23) 


The three statements in Eq. (9.23) are equivalent to Eq. (9.22), because the vectors @1, 
2, and @3 are simply (1,0, 0), (0, 1,0), and (0, 0, 1) in the frame in which they are the 
basis vectors. 

Consider an object rotating around a fixed axis with constant angular speed. Then this 
axis is a principal axis if there is no need for any torque. So in some sense, the object is 
“happy” to spin around a principal axis. A set of three orthonormal axes, each of which 
has this property, is by definition what we call a set of principal axes. 

This definition of a principal axis is equivalent to the previous definition for the 
following reason. Assume that the object rotates around a fixed axis @; for which 
L = Iĝ; = 1, @}, as in Eq. (9.23). Then since ô; is assumed to be fixed, we see that L 
is also fixed. Therefore, t = dL/dt = 0. 

Conversely, if the object is rotating around a fixed axis @), and if t = dL/dt = 0, 
then we claim that L points along @ (that is, L = 7161). This is true because if L does 
not point along @ , then imagine painting a dot on the object somewhere along the line 
of L. A little while later, the dot will have rotated around the fixed vector ô]. But the 
line of L must always pass through the dot, because we could have rotated our axes 
around @ and started the process at a slightly later time (this argument relies on @1 
being fixed). Therefore, we see that L has changed, in contradiction to the assumption 
that dL/dt = 0. Hence, L must in fact point along 1. 

For a rotation around a principal axis ô, the lack of need for any torque means that 
if the object is pivoted at the origin, and if the origin is the only place where any 
force is applied (which implies that there is zero torque around it), then the object can 
undergo rotation with constant angular velocity w. If you try to set up this scenario with 


a nonprincipal axis, it won’t work. 


Example (Square with origin at corner): Consider the uniform square in 
Fig. 9.16. In Appendix E, we show that the principal axes are the dotted lines drawn 
(and also the z axis perpendicular to the page). But there is no need to use the tech- 
niques in the appendix to see this, because in this new basis it is clear by symmetry 
that the integral f x1x2 is zero; for every x4 in the integral, there is a —x,. And x3 = z 
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is identically zero, which makes all the other off-diagonal terms in I also equal to 
zero. Therefore, since I is diagonal in this new basis, these basis vectors are the 
principal axes. 

Furthermore, it is intuitively clear that the square will be happy to rotate around 
any one of these axes indefinitely. During such a rotation, the pivot will certainly be 
applying a force (if the axis is ô; or Z, but not if it is @2), to produce the centripetal 
acceleration of the CM in its circular motion. But it won’t be applying a torque relative 
to the origin (because the r inr x F is 0). This is good, because for a rotation around 
one of these principal axes, dL/dt = 0, so there is no need for any torque. 

In contrast with the off-diagonal zeros in the new basis, the integral f xy in the 
old basis is not zero, because every point gives a positive contribution. So the inertia 
tensor is not diagonal in the old basis, which means that x and Ẹ are not principal axes. 
Consistent with this, it is reasonably clear that it is impossible to make the square 
rotate around, say, the x axis, assuming that its only contact with the outside world 
is through a pivot (for example, a ball and socket) at the origin. The square simply 
doesn’t want to remain in this circular motion. Mathematically, L (relative to the 
origin) doesn’t point along the x axis, so it therefore precesses around the x axis along 
with the square, tracing out the surface of a cone. This means that L is changing. But 
there is no torque available (relative to the origin) to provide for this change in L. 
Hence, such a rotation cannot exist. 


At the moment, it is not at all obvious that an orthonormal set of principal axes 
exists for an arbitrary object. But this is indeed the case, as stated in Theorem 9.4 
below. Assuming for now that principal axes do exist, then in this basis the L and 
T in Eqs. (9.8) and (9.15) take on the particularly nice forms, 


L = (jo), ha, 3), 


1 


(9.24) 
T= 5 (no? + hos + 10%) F 


The quantities w1, w2, and w3 here are the components of a general vector œw 
written in the principal-axis basis; that is, @ = @1@, + @2@2 + @3@3. Equation 
(9.24) is a vast simplification over the general formulas in Eqs. (9.8) and (9.15). 
We will therefore invariably work with principal axes in the remainder of this 
chapter. 

Note that the directions of the principal axes (relative to the body) depend only 
on the geometry of the body. They may therefore be considered to be painted on 
(or in) it. Hence, they will generally move around in space as the body rotates. 
For example, if the object is rotating around a principal axis, then that axis 
stays fixed while the other two principal axes rotate around it. In relations like 
w = (@1, 07,3) and L = (a1, a2, 103), the components w; and J;a; are 
measured along the instantaneous principal axes @;. Since these axes change 
with time, it is quite possible that the components w; and I;w; change with time, 
as we’ll see in Section 9.5 (and onward). 
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Let’s now state the theorem that implies that a set of principal axes does indeed 
exist for any body and any origin. The proof of this theorem involves a useful 
but rather slick technique, but it’s slightly off the main line of thought, so we’ ll 
relegate it to Appendix D. Take a look at the proof if you wish, but if you want 
to just accept the fact that principal axes exist, that’s fine. 


Theorem 9.4 Given a real symmetric 3 x 3 matrix, I, there exist three 
orthonormal real vectors, ©, and three real numbers, Ip, with the property 
that 


Ið; = ôk. (9.25) 
Proof: See Appendix D. E 


Since the inertia tensor in Eq. (9.8) is indeed symmetric for any body and any 
origin, this theorem says that we can always find three orthogonal basis vectors 
that satisfy Eq. (9.23). Or equivalently, we can always find three orthogonal basis 
vectors for which I is a diagonal matrix, as in Eq. (9.22). In other words, principal 
axes always exist. Problem 9.7 gives another way to demonstrate the existence 
of principal axes in the special case of a pancake object. 

Invariably, it is best to work in a coordinate system that has principal axes as 
its basis, due to the simplicity of Eq. (9.24). And as mentioned in Footnote 6, the 
origin is generally chosen to be the CM, because from Section 8.4.3 the CM is 
one of the origins for which t = dL/dt is a valid statement. But this choice is 
not necessary; there are principal axes associated with any origin. 

For an object with a fair amount of symmetry, the principal axes are usually 
the obvious choices and can be written down by simply looking at the object 
(examples are given below). If, however, you are given an unsymmetrical body, 
then the only way to determine the principal axes is to pick an arbitrary basis, 
then find I in this basis, and then go through a diagonalization procedure. This 
diagonalization procedure basically consists of the steps at the beginning of the 
proof of Theorem 9.4 (given in Appendix D), with the addition of one more step 
to get the actual vectors, so we’ll relegate it to Appendix E. There’s no need 
to worry much about this method. Virtually every system you encounter will 
involve an object with sufficient symmetry to enable you to just write down the 
principal axes. 

Let’s now prove two very useful (and very similar) theorems. 


Theorem 9.5 [ftwo principal moments are equal (I, = h = 1), then any axis 
(through the chosen origin) in the plane of the corresponding principal axes is a 
principal axis, and its moment is also I. Similarly, if all three principal moments 
are equal (I, = h = h = 1), then any axis (through the chosen origin) in space 
is a principal axis, and its moment is also I. 


Proof: The first part was already proved at the end of the proof in Appendix D, 
but we’ll do it again here. Since 4 = h = J, we have Iu; = Ju,, and Iu? = /w, 


9.3 Principal axes 


where the w’s are the principal axes. Hence, I(au, + bu2) = Z (au; + bug), for 
any a and b. Therefore, any linear combination of u; and up (that is, any vector 
in the plane spanned by u; and uz) is a solution to Iu = Zu and is thus a principal 
axis, by definition. 

The proof of the second part proceeds in a similar manner. Since J) = h = 
h = I, we have Iu; = Juy, Iu? = 7w, and Iu3 = Juz. Hence, I(au, + buz + 
cu3) = I (au; + buz + cuz). Therefore, any linear combination of uy, u2, and u3 
(that is, any vector in space) is a solution to Iu = Ju and is thus a principal axis, 
by definition. 

In short, if J} = h = J, then I is the identity matrix (up to a multiple) in the 
space spanned by u; and ug. Andif J; = h = h = /, then I is the identity matrix 
(up to a multiple) in the entire space. Note that it isn’t required that the various 
u; vectors be orthogonal. All we need is that they span the relevant space. MH 


If two or three moments are equal, so that there is freedom in choosing the 
principal axes, then it is possible to pick a nonorthogonal group of them. We 
will, however, always choose ones that are orthogonal. So when we say “a set of 
principal axes,” we mean an orthonormal set. 


Theorem 9.6 [fa pancake object is symmetric under a rotation through an 
angle 0 + 180° in the x-y plane (such as a hexagon), then every axis in the 
x-y plane (with the origin chosen to be the center of the symmetry rotation) is a 
principal axis with the same moment. 


Proof: Let ôo be a principal axis in the plane, and let ôg be the axis obtained 
by rotating @ through the angle 6. Then @g is also a principal axis with the same 
principal moment, due to the symmetry of the object. Therefore, Io = J@o, and 
Ið = [@@. 

Now, any vector w in the x-y plane can be written as a linear combination of 
ôo and g, provided that 9 Æ 180° (or zero, of course). That is, ôo and p span 
the x-y plane. Therefore, any vector w can be written as œ = ao + b@g, and so 


Iw = I(a@o + b@g) = alo + blg = Io. (9.26) 


Hence, œ is also a principal axis. Problem 9.8 gives another proof of this 
theorem. | 


The theorem actually holds even without the “pancake” restriction. That is, 
it holds for any object with a rotational symmetry around the z axis (excluding 
0 Æ 180°). This can be seen as follows. The z axis is a principal axis, because 
if œ points along 2, then L must also point along 2, by symmetry. There are 
therefore (at least) two principal axes in the x-y plane. Label one of these as ôo 
and proceed as above. 

Let’s now do some quick examples. We’ll state the principal axes for the 
objects listed below (relative to the origin). Your task is to show that they 
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are correct. Usually, a quick symmetry argument shows that 


{Q? +27) -fx — fzx 
I= -fy f@t+x) -fyz (9.27) 
-fa —-fyz  fJœ@+y) 


is diagonal. In all of these examples (see Fig. 9.17), the origin for the principal 
axes is understood to be the origin of the given coordinate system (which is not 
necessarily the CM). In describing the axes, they therefore all pass through the 
origin, in addition to having the other properties stated. 


Example 1: Point mass at the origin. 
principal axes: any axes. 


Example 2: Point mass at the point (xọ, yo, Z0). 
principal axes: axis through point, any axes perpendicular to this. 


Example 3: Rectangle centered at the origin, as shown. 
principal axes: the x, y, and z axes. 


Example 4: Cylinder with axis as z axis. 
principal axes: z axis, any axes in x-y plane. 


Example 5: Square with one corner at origin, as shown. 
principal axes: z axis, axis through CM, axis perpendicular to this. 


9.4 Two basic types of problems 


The previous three sections introduced a variety of abstract concepts. We will 
now finally look at some actual physical systems. The concept of principal axes 
gives us the ability to solve many kinds of problems. Two kinds, however, come 
up again and again. There are variations on these, of course, but they may be 
generally stated as follows. 


e Strike a rigid object with an impulsive (that is, quick) blow. What is the motion of the 
object immediately after the blow? 

e An object rotates around a fixed axis. A given torque is applied. What is the frequency 
of the rotation? Or conversely, given the frequency, what is the required torque? 


We’ll work through an example for each of these problems. In both cases, the 
solution involves a few standard steps, so well write them out explicitly. 


9.4 Two basic types of problems 


9.4.1 Motion after an impulsive blow 


Problem: Consider the rigid object in Fig. 9.18. Three masses are connected 
by three massless rods, in the shape of an isosceles right triangle with hypotenuse 
length 4a. The mass at the right angle is 2m, and the other two masses are m. 
Label them A, B, C, as shown. Assume that the object is floating freely in outer 
space. Mass B is struck with a quick blow, directed into the page. Let the imparted 
impulse have magnitude f F dt = P. (See Section 8.6 for a discussion of impulse 
and angular impulse.) What are the velocities of the three masses immediately 
after the blow? 


Solution: Our strategy will be to find the angular momentum of the system 
(relative to the CM) using the angular impulse, and then calculate the principal 
moments and find the angular velocity vector (which will give the velocities 
relative to the CM), and then finally add on the CM motion. 

The altitude from the right angle to the hypotenuse has length 2a, and the CM 
is easily seen to be located at its midpoint (see Fig. 9.19). Picking the CM as our 
origin, and letting the plane of the paper be the x-y plane, the positions of the 
three masses are ry = (—2a, —a, 0), rg = (2a, —a, 0), and rc = (0,a,0). There 
are now five standard steps that we must perform. 


e Find L: The positive z axis is directed out of the page, so the impulse vector is 
P= f F dt = (0,0, —P). Therefore, the angular momentum of the system (relative to 


the CM) is 
L= [ra= [rex Pd=rgx f Far 


= (2a, —a, 0) x (0,0, —P) = aP(1,2,0), (9.28) 


as shown in Fig. 9.19. We have used the fact that rg is essentially constant during the 
blow (because the blow is assumed to happen very quickly) in taking rg outside the 
integral. 

e Calculate the principal moments: The principal axes are the x, y, and z axes, because 
the symmetry of the triangle makes I diagonal in this basis, as you can quickly check. 
The moments (relative to the CM) are 


Ty ma? + ma? + (2m)a? = Ama’, 
Ly = m(2a)? + m(2a)? + (2m)0? = 8ma’, (9.29) 
L=hk+h= 12ma?. 
We have used the perpendicular-axis theorem to obtain 77, although it won’t be needed 
to solve the problem. 


e Find @: We now have two expressions for the angular momentum of the system. One 
expression is in terms of the given impulse, Eq. (9.28). The other is in terms of the 


389 
C 2m 
A B 
m 4a m OF 
Fig. 9.18 
y L 
2m4C o 
Vv 
yy 
YN. . 
ft Na 
m 2a 2a m 
Fig. 9.19 


390 


Angular momentum, Part II (General L) 


moments and the angular velocity components, Eq. (9.24). Equating these gives 


(ox, lywy, 1-02) = aP(1,2,0) 


=> (4m oy, 8ma wy, 12ma?w-) = aP(1,2,0) 
P 
= (ay, @y,@:) = — (1,1,0), (9.30) 
: 4ma 


as shown in Fig. 9.19. 

Calculate the velocities relative to the CM: Right after the blow, the object rotates 
around the CM with the angular velocity found in Eq. (9.30). The velocities relative to 
the CM are then u; = œ x r;. Thus, 


P 
u4 = ø x r4 = F—(1,1,0) x (=2a, —a, 0) = (0,0, P/4m), 


P 
ug = @ x rg = 7 (1, 1,0) x (2a, —a,0) = (0,0, -3P /4m), (9.31) 


P 
uc = w x rc = — (1,1,0) x (0,a,0) = (0,0, P/4m). 
4ma 
As a check, it makes sense that ug is three times as large as uy and uc, because B is 


three times as far from the axis of rotation as A and C are, as you can verify by doing a 
little geometry in Fig. 9.19. 


e Add on the velocity of the CM: The impulse (that is, the change in linear momentum) 


supplied to the whole system is P = (0, 0, —P). The total mass of the system is M = 4m. 
Therefore, the velocity of the CM is 


P 
Vom = zz = © 0, —P/4m). (9.32) 


The total velocities of the masses are therefore 
v4 = u4 + Vom = (0,0,0), 
vg = ug + Vom = (0,0, —P/m), (9.33) 


vc =uc + Vom = (0,0,0). 


REMARKS: 


1. We see that masses A and C are instantaneously at rest immediately after the blow, and 
mass B acquires all of the imparted impulse. In retrospect, this is clear. Basically, it is 
possible for both A and C to remain at rest while B moves a tiny bit, so this is what 
happens. If B moves into the page by a small distance €, then A and C won’t know that B 
has moved, because their distances to B will change (assuming hypothetically that they 
don’t move) by a distance of order only e°. If we changed the problem and added a mass 
D at, say, the midpoint of the hypotenuse, then it would not be possible for A, C, and D 
to remain at rest while B moved a tiny bit. So there would have to be some other motion 
in addition to B’s. This setup is the topic of Exercise 9.38. 

2. As time goes on, the system undergoes a rather complicated motion. What happens 
is that the CM moves with constant velocity while the masses rotate around it in 


9.4 Two basic types of problems 


a messy manner. Since there are no torques acting on the system (after the initial 
blow), we know that L forever remains constant. It turns out that œ moves around 
L while the masses rotate around this changing w. These matters are the subject of 
Section 9.6, although in that discussion we restrict ourselves to symmetric tops, that 
is, ones with two equal moments. But these issues aside, it’s good to know that 
we can, without too much difficulty, determine what’s going on immediately after 
the blow. 

3. The object in this problem was assumed to be floating freely in space. If we instead have 
an object that is pivoted at a given fixed point, then we should use this pivot as our origin. 
There is then no need to perform the last step of adding on the velocity of the origin (which 
was the CM, above), because this velocity is now zero. Equivalently, just consider the 
pivot to be an infinite mass, which is therefore the location of the (motionless) CM. & 


9.4.2 Frequency of motion due to a torque 


Problem: Consider a stick of length £, mass m, and uniform mass density. The 
stick is pivoted at its top end and swings around the vertical axis. Assume that 
conditions have been set up so that the stick always makes an angle 0 with the 
vertical, as shown in Fig. 9.20. What is the frequency, w, of this motion? 


Solution: Our strategy will be to find the principal moments and then the 
angular momentum of the system (in terms of w), and then find the rate of 
change of L, and then calculate the torque and equate it with dL/dt. We will 
choose the pivot to be the origin.® Again, there are five standard steps that we 
must perform. 


e Calculate the principal moments: The principal axes are the axis along the stick, 
along with any two orthogonal axes perpendicular to the stick. So let the x and y axes 
be as shown in Fig. 9.21. The positive z axis then points out of the page. The moments 
(relative to the pivot) are i, = mg? /3, Iy = 0, andl; = ml? /3 (which won’t be needed). 

e Find L: The angular velocity vector points vertically (however, see the third remark 
following this solution), so in the basis of the principal axes, the angular velocity vector 
is w = (w sin, w cos 0,0), where w is yet to be determined. The angular momentum 
of the system (relative to the pivot) is therefore 


L = (ka, oy, wz) = ((1/3)mé?o sin 6, 0, 0). (9.34) 


Find dL/dt: The vector L in Eq. (9.34) points up to the right, along the x axis (at 
the instant shown in Fig. 9.21), with magnitude L = (1/3)mé2wsin@. As the stick 
rotates around the vertical axis, L traces out the surface of a cone. That is, the tip of L 
traces out a horizontal circle. The radius of this circle is the horizontal component of 
L, which is L cos 0. The speed of the tip (which is the magnitude of dL/dt) is therefore 


8 This is a better choice than the CM because this way we won’t have to worry about any messy 


forces acting at the pivot when computing the torque. The task of Exercise 9.41 is to work through 
the more complicated solution which has the CM as the origin. 


Fig. 9.21 
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(L cos 0)æ, because L rotates around the vertical axis with the same frequency as the 
stick. So dL/dt has magnitude 


dL 1 
T = (Lcos 6)w = zmo? sin 6 cos 0, (9.35) 


and it points into the page. 


Remark: With more complicated objects where J, # 0, L won’t point nicely along a 
principal axis, so the length of its horizontal component (the radius of the circle that L 
traces out) won’t immediately be obvious. In this case, you can either explicitly calculate 
the horizontal component (see the spinning-top example in Section 9.7.5), or you can just do 
things the formal way by finding the rate of change of L via the expression dL/dt = w x L, 
which holds for all the same reasons that v = dr/dt = x r holds. In the present problem, 
we obtain 


dL/dt = (w sin 8, w cos 0,0) x ((1/3)me?.o sin 6,0, 0) 
= (0,0, -—(1/3)me?a” sin 0 cos @), (9.36) 


in agreement with Eq. (9.35). And the direction is correct, because the negative z axis points 
into the page. Note that we calculated this cross product in the principal-axis basis. Although 
these axes are changing in time, they present a perfectly good set of basis vectors at any 
instant. & 


Calculate the torque: The torque (relative to the pivot) is due to gravity, which 
effectively acts on the CM of the stick. So t = r x F has magnitude 


t=rF sind = (€/2)(mg) sind, (9.37) 


and it points into the page. 
Equate t with dL/dt: The vectors dL/dt and t both point into the page, which is 
good, because they had better point in the same direction. Equating their magnitudes 


mlw sindcos@ mg sind 3g 
= = = ; 9.38 
3 2 =V 2€cosd Pan 


gives 


REMARKS: 


1. This frequency is slightly larger than the frequency that would arise if we instead had a 
mass on the end of a massless stick of length £. From Problem 9.12, the frequency in that 
case is ./g/€cos@. So, in some sense, a uniform stick of length £ behaves like a mass on 
the end of a massless stick of length 20/3, as far as these rotations are concerned. 

2. As 0 — 1/2, the frequency goes to 00, which makes sense. And as 6 — 0, it approaches 
3/28, which isn’t so obvious. 

3. As explained in Problem 9.1, the instantaneous w is not uniquely defined in some situa- 
tions. At the instant shown in Fig. 9.20, the stick is moving directly into the page. What if 
someone else wants to think of the stick as (instantaneously) rotating around the w’ axis 
perpendicular to the stick (the x axis, in the above notation), instead of the vertical axis, 
as shown in Fig. 9.22. What is the angular speed w'? 

Well, if w is the angular speed of the stick around the vertical axis, then we may view 
the tip of the stick as instantaneously moving in a circle of radius £sin@ around the 
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vertical axis œ. So œ(£ sin @) is the speed of the tip of the stick. But we may also view 
the tip of the stick as instantaneously moving in a circle of radius £ around w’, as shown. 
The speed of the tip is still œ(£ sin @), so the angular speed around this axis is given by 
o'l = w(Lsinð). Hence w = wsind, which is simply the x component of œ that we 
found above, right before Eq. (9.34). The moment of inertia around o’ is m£? /3, so the 
angular momentum has magnitude (m£? /3)(% sin 0), in agreement with Eq. (9.34). And 
the direction is along the x axis, as it should be. 

Note that although @ is not uniquely defined at any instant, L = f (r x p) dm certainly 
is.” Choosing @ to point vertically, as we did in the above solution, is in some sense the 
natural choice, because this œ doesn’t change with time. & 
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Consider a rigid body instantaneously rotating around an axis w. This œ may 
change as time goes on, but all we care about for now is what it is at a given 
instant. The angular momentum is given by Eq. (9.8) as L = Iw, where I is the 
inertia tensor, calculated with respect to a given origin and a given set of axes 
(and is written in the same basis, of course). 

As usual, things are much nicer if we use the principal axes (relative to the 
chosen origin) as the basis vectors of our coordinate system. Since these axes are 
fixed with respect to the rotating object, they will rotate with respect to the fixed 
reference frame. In this basis, L takes the nice form, 


L = (hoi, haz, 403), (9.39) 


where w1, w2, and w3 are the components of @ along the principal axes. In other 
words, if you take the vector L in space and project it onto the instantaneous 
principal axes, then you get the components in Eq. (9.39). 

On one hand, writing L in terms of the rotating principal axes allows us to 
write it in the nice form of Eq. (9.39). But on the other hand, writing L in this 
way makes it nontrivial to determine how it changes in time, because the principal 
axes themselves are changing. However, it turns out that the benefits outweigh 
the detriments, so we will invariably use the principal axes as our basis vectors. 

The goal of this section is to find an expression for dL/dt, and to then equate 
this with the torque. The result will be Euler’s equations in Eq. (9.45). 


Derivation of Euler's equations 

If we write L in terms of the body frame, which we’ll choose to be described 
by the principal axes painted on the body, then L can change (relative to the lab 
frame) due to two effects. It can change because its coordinates in the body frame 
change, and it can also change because of the rotation of the body frame. To be 
precise, let Lo be the vector L at a given instant. At this instant, imagine painting 
the vector Lo onto the body frame, so that Lo then rotates with the body. The rate 


? The nonuniqueness of w arises from the fact that Z, = 0 here. If all the moments are nonzero, then 
(Lx, Ly, Lz) = (Ixwx, lywy, lzwz) uniquely determines øw, given L. 
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of change of L with respect to the lab frame may be written in the (identically 
true) way, 
dL  d(L- Lo) g dLo 
d — dt dt ` 


(9.40) 


The second term here is simply the rate of change of a body-fixed vector, which 
we know is œ x Lo, which equals œ x L at this instant. The first term is the rate 
of change of L with respect to the body frame, which we’ll denote by ôL/ôt. 
This is what someone standing fixed on the body measures. So we end up with 


— +6 * L, (9.41) 


This is actually a general statement, true for any vector in any rotating frame 
(we'll derive it in another more mathematical way in Chapter 10). There was 
nothing particular about L that we used in the above derivation. Also, there was 
no need to restrict ourselves to principal axes. In words, what we’ve shown is 
that the total change equals the change relative to the rotating frame, plus the 
change of the rotating frame relative to the fixed frame. This is just the usual way 
of adding velocities when one frame moves with respect to another. 

Let us now make use of our choice of the principal axes as the body axes. 
This will put Eq. (9.41) in a usable form. Using Eq. (9.39), we can rewrite 
Eq. (9.41) as 


T = © hor, hon hos) + (w1, 02,03) x 101, l2%2, 1303). (9.42) 
The ôL/ôt term does indeed equal (d/dt)(11@1, 22, 1,w3), because someone 
in the body frame measures the components of L with respect to the principal 
axes to be (11@1, 2, 1303). And 5L/6t is by definition the rate at which these 
components change. 

Equation (9.42) equates two vectors. As is true for any vector, these (equal) 
vectors have an existence that is independent of the coordinate system we choose 
to describe them with (Eq. (9.41) makes no reference to a coordinate system). 
But since we’ve chosen an explicit frame on the right-hand side of Eq. (9.42), 
we should choose the same frame for the left-hand side. We can then equate the 
components on the left with the components on the right. Projecting dL/dt onto 
the instantaneous principal axes, we have 


ALY (AL) (AL) \ ho boho) easy 
dt ai dt s dt 3 = 11,4202, 1303 : 


+ (@1, 02,03) X (howi, hw, 1303). 


Remark: The left-hand side looks nastier than it really is. The reason we’ve written it in 
this cumbersome way is the following (this is a remark that has to be read very slowly). We 
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could have written the left-hand side as (d/dt)(L1, L2, L3), but this might cause confusion as to 
whether the L; refer to the components with respect to the rotating axes, or the components with 
respect to the fixed set of axes that coincide with the rotating principal axes at this instant. That 
is, do we project L onto the principal axes to obtain components, and then take the derivative 
of these components? Or do we take the derivative of L and then project onto the principal axes 
to obtain components? The latter is what we mean in Eq. (9.43).!° The way we’ve written the 
left-hand side of Eq. (9.43), it’s clear that we’re taking the derivative first. We are, after all, 
simply projecting Eq. (9.41) onto the principal axes. & 


The time derivatives on the right-hand side of Eq. (9.43) are d(,a1)/dt = 
I\@1, etc., because the /’s are constant. Performing the cross product and equating 
the corresponding components on each side yields the three equations, 


dL . 
—)}) = ho + B —h)a3a2, 
dt Ji 


dL : 
(=) = ha +  —h)a\03, (9.44) 
dt Ja 


dL . 
ele Boz + (h — l)a]. 
t J3 


We will now invoke the results of Section 8.4.3 to say that if we have chosen the 
origin of our rotating frame to be either a fixed point or the CM (as we always 
do), then we can equate dL/dt with the torque, t. We therefore have 


q = ho + (B — h)wa, 
n = ha: + (l — 3)ww3, (9.45) 


3 = bhos + (h —N)a2o. 


These are Euler $ equations. You need to remember only one of them, because 
the other two can be obtained by cyclic permutation of the indices. 


REMARKS: 


1. We repeat that the left- and right-hand sides of Eqs. (9.45) are components that are mea- 
sured with respect to the instantaneous principal axes. Let’s say we do a problem, for 
example, where t3 has a constant nonzero value, and tı and t2 are always zero (as in the 
example in Section 9.4.2). This doesn’t mean that t is a constant vector. On the contrary, 
t always points along the X3 vector in the rotating frame, but this vector is changing in 
the fixed frame (unless X3 points along œ). 

2. The two types of terms on the right-hand sides of Eqs. (9.44) are the two types of changes 
that L can undergo. L can change because its components with respect to the rotating 
frame change, and L can also change because the body is rotating around w. 


10 The former is ôL/ôt, by definition. The two interpretations certainly give different results. For 
example, if instead of L we consider a vector fixed in the body (such as the Lo above), then 
the first interpretation gives a zero result, whereas the second interpretation gives a nonzero 
result. Considering what we mean by, say, the vector (w1,@2, 3), I think that the more logical 
interpretation of (d/dt)(L,,L2,L3) is the first one, so it should definitely be avoided. 
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3. Section 9.6.1 on the free symmetric top (viewed from the body frame) provides a good 
example of the use of Euler’s equations. Another interesting application is the famed 
“Tennis racket theorem” (Problem 9.14). 

4. It should be noted that you never have to use Euler’s equations. You can simply start 
from scratch and use Eq. (9.41) each time you solve a problem. The point is that 
we’ve done the calculation of dL/dt once and for all, so you can just invoke the result 
in Eqs. (9.45). & 


9.6 Free symmetric top 


The free symmetric top is the classic example of an application of Euler’s 
equations. Consider an object that has two of its principal moments equal, with 
the CM as the origin. Assume that the object is in outer space, far from any 
external forces. We will choose the object to have cylindrical symmetry around 
some axis (see Fig. 9.23), although this is not necessary; a square cross section, 
for example, would yield two equal moments. The principal axes are then the 
symmetry axis and any two orthogonal axes in the cross-section plane through 
the CM. Let the symmetry axis be chosen as the X3 axis. Then the moments are 
l = h = Í; and J3. 

We’ll first look at things from the point of view of someone standing at rest 
on the body, and then we’ll look at things from the point of view of someone 
standing at rest in an inertial frame. The mathematics involved here isn’t so bad, 
but as with most types of top problems, it’s hard to get an intuitive feel for what 
all the various vectors are doing. And the intuition is even more difficult in the 
following body-frame analysis because of the noninertial frame of reference. But 
let’s see what we find. 


9.6.1 View from the body frame 


Plugging J = h = Z into Euler’s equations in Eq. (9.45), and using the fact that 
all the q; are zero (since there are no torques, because the top is “free’”’), we have 


0 = Ja, + U3 — I)o, 
0 = Iaz + (I — h)wiws, (9.46) 
0 = hos. 


The last equation says that w3 is constant. If we then define 


h= 
2=( =) e (9.47) 


the first two equations become 


© + lwa: = 0, and a — Qa; = 0. (9.48) 


Taking the derivative of the first of these, and then using the second to eliminate 
2, gives 


6, + Zo =0, (9.49) 
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and likewise for w2. This is a good old simple-harmonic-oscillator equation. 
We can therefore write œ; (t) as, say, a cosine. And then Eq. (9.48) yields a sine 
for w2(t). So we have 


w(t) = Acos(Qt+ $), a2(t) = Asin(Qr+ ¢). (9.50) 


We see that w(t) and w(t) are the components of a circle in the body frame. 

Therefore, the vector traces out a cone around $; (see Fig. 9.24), with frequency 

Q, as viewed by someone standing on the body. This frequency Q in Eq. (9.47) 

depends on the value of w3 and on the geometry of the object (through /3 and 7). 

But the radius, A, of the w cone is determined by the initial values of w; and %2. 
The angular momentum is 


X2 


view from 
body frame, 


L = (howi, ha, 03) = (14 cos(Qt + ġ), IA sin(Qt + ¢), Bos), (9.5) “ Q>0 (>I) 
so L also traces out a cone around $; with frequency Q, as viewed by someone Fig. 9.24 
standing on the body. This is shown in Fig. 9.24 for the case Q > 0 (that is, 
13 > T). In this case, J; > J implies L3/L2 > w3/%2, so the L vector lies above 
the æ vector (that is, between œ and X3). An object with J; > 7, such as a coin, 
is called an oblate top. X3 
Figure 9.25 shows the case where Q < 0 (that is, 4; < 7). In this case, 13 < I 
implies L3/L2 < w3/w2, so the L vector lies below the @ vector, as shown. 
And since Q is negative, the w and L vectors precess around $; in the opposite _----- 
direction (clockwise, as viewed from above). An object with J; < 7, such as a 
carrot, is called a prolate top. 


view from 
body frame, 
xı Q <0 (<I) 
Example (The earth): Lets consider the earth to be our top. Then w3 ~ 
2 /(1 day). 11 The bulge at the equator (caused by the spinning of the earth) makes /3 Fig. 9.25 
slightly larger than /, and it turns out that (/3 — J)/J ~ 1/320. Therefore, Eq. (9.47) 
gives Q ~% (1/320) 27/(1 day). So the œ vector should precess around in its cone 
once every 320 days, as viewed by someone on the earth. The true value is more like 
430 days. The difference has to do with various things, including the nonrigidity of 
the earth, but at least we got an answer in the right ballpark. This precession of œ is 
known as the “Chandler wobble.” 
In practice, how can we determine the direction of œ? Simply take an extended- 
time photograph exposure at night. The stars will form arcs of circles. At the center 
of all these circles is a point that doesn’t move. This is the direction of œ. Fortunately, 
Q is much smaller than w, so the w vector doesn’t change much during an exposure 
time of, say, an hour. So the center of the circles is essentially well defined. 
How big is the w cone, for the earth? Equivalently, what is the value of A in 
Eq. (9.50)? Observation has shown that the œ vector pierces the earth at a point on 


1L This isn’t quite correct, since the earth rotates 366 times for every 365 days, due to the motion 
around the sun, but it’s close enough for the purposes here. 
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the order of 10 m from the north pole, although this distance fluctuates over time. 12 


Hence, A/w3 ~ (10m)/Rg. The half-angle of the w cone is therefore on the order of 
only 1074 degrees. So if you use an extended-time photograph exposure one night to 
see which point in the sky stands still, and then if you do the same thing 200 nights 
later, you probably won’t be able to tell that they’re really two different points. 


9.6.2 View from a fixed frame 


Let’s now see what our symmetric top looks like from a fixed frame. Euler’s 
equations won’t help much here, because they deal with the components of œ 
in the body frame. But fortunately we can solve for the motion from scratch. In 
terms of the (changing) principal axes, $1, X2, 3, we have 

w = (1X) + @2%2) + 03%3, 

. i . (9.52) 

L = /1(@,X; + @2X2) + w383. 
Eliminating the (@;X, + w282) term from these equations gives (in terms of the 
Q defined in Eq. (9.47)) 


Ta 
L=I(@+ 2%) = ø=7L- 9i, (9.53) 


where L = |L], and L is the unit vector in the L direction. The linear relationship 
among L, w, and x3 implies that these three vectors lie in a plane. But L remains 
fixed, because there are no torques on the system. Therefore, œ and $; precess (as 
we’ll see below) around L, with the three vectors always coplanar. See Fig. 9.26 
for the case Q > 0, that is, 4; > J (an oblate top), and Fig. 9.27 for the case 
Q < 0, that is, 4 < I (a prolate top). 

What is the frequency of this precession, as viewed from the fixed frame? The 
rate of change of X3 is œ x X3, because X; is fixed in the body frame, so its change 
comes only from rotation around œ. Therefore, Eq. (9.53) gives 

dx3 Ls ` 7 La y 

Ae = (Ft- os) x X; = (Ft) x X3. (9.54) 
But this is simply the expression for the rate of change of a vector rotating 
around the fixed vector & = (L/I )L. The frequency of this rotation is |@| = L/I. 
Therefore, X3 precesses around the fixed vector L with frequency 


o (9.55) 
w = — A 
T° 


12 This distance could theoretically be much larger or much smaller than 10 m. It happens to be of 
this order due to the nature of the driving force. The present consensus for this force is pressure 
changes at the bottom of the ocean and in the atmosphere; see Gross (2000). Without a driving 
force, the amplitude would head to zero, due to the nonrigidity of the earth. 
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in the fixed frame. And therefore @ does also, because it is coplanar with 
$; and L. 


REMARKS: 


1. We just found that œ precesses around L with frequency L/I. What, then, is wrong with 
the following reasoning: “Just as the rate of change of X3 equals w x X3, the rate of change 
of œ should equal w x œw, which is zero. Therefore, w should remain constant.” The error 
is that the vector w is not fixed in the body frame. A vector A must be fixed in the body 
frame in order for its rate of change to be given by w x A. 

2. We found in Eqs. (9.51) and (9.47) that a person standing on the rotating body sees L (and 
@) precess with frequency Q = w3(/3 — J)/I around x3. But we found in Eq. (9.55) that 
a person standing in the fixed frame sees $; (and œ) precess with frequency L/J around 
L. Are these two facts compatible? Should we have obtained the same frequency from 
either point of view? (Answers: yes, no). 

These two frequencies are indeed consistent, as can be seen by the following reasoning. 
Consider the plane (call it S) containing the three vectors L, œ, and $3. We know from 
Eq. (9.51) that S rotates with frequency QX3 with respect to the body. Therefore, the 
body rotates with frequency —QX3 with respect to S. And from Eq. (9.55), S rotates with 
frequency (L/I )L with respect to the fixed frame. Therefore, the total angular velocity of 
the body with respect to the fixed frame is (using the frame S as an intermediate step) 


Be. is 
Mtotal = fap — QX3. (9.56) 


But from Eq. (9.53), this is simply @, as it should be. So the two frequencies in Eqs. (9.47) 
and (9.55) are indeed consistent. 

For the earth, /3 and / are nearly the same, so Q = @3(43 — /)/J and L/I are quite 
different. L/I is roughly equal to L/B, which is essentially equal to w3. On the other hand, 


its cone at roughly the rate at which the earth spins. But it’s not exactly the same rate, and 
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x3 
Q is roughly equal to (1/300)w3. Basically, an external observer sees œ precess around {> 


this difference is what causes the earth-based observer to see œ precess with a nonzero Q. 

3. The fixed-frame precession of $; around L should not be confused with the 
“precession of the equinoxes” effect. See Problem 10.15 for a discussion of 
the latter. & 
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Consider now a heavy symmetrical top, that is, one that spins on a table, under Fig: 9.28 


the influence of gravity (see Fig. 9.28). Assume that the tip of the top is fixed on 
the table by a pivot. We’ll solve for the motion of the top in two different ways 
below in Sections 9.7.3 and 9.7.4. The first uses t = dL/dt, and the second uses 
the Lagrangian method. 


9.7.1 Euler angles 


For both of these methods, it is convenient to use the Euler angles, 0, ġ, Y, 
which are shown in Fig. 9.29 and defined as follows. 


e 0: Let x3 be the symmetry axis of the top. Define @ to be the angle that x3 makes with 
the vertical Z axis of the fixed frame. Fig. 9.29 


XxX 


fixed point 
in body 
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e ¢: Draw the plane orthogonal to x3. Let X; be the intersection of this plane with the 
horizontal x-y plane. Define ¢ to be the angle that x; makes with the X axis in the fixed 
frame. Note that x; is not necessarily fixed in the object. 

e y: Let X2 be orthogonal to 3 and $4, as shown. As with $4, X2 is not necessarily fixed 
in the object. Let frame S be the frame whose axes are $1, X2, and X3. Define w to be 
the angle of rotation of the body around the X3 axis in frame S. So WX is the angular 
velocity of the body with respect to S. And from the figure, we also see that the angular 
velocity of frame S with respect to the fixed frame is #2 + 6%. 


The angular velocity of the body with respect to the fixed frame is equal to the 
angular velocity of the body with respect to frame S, plus the angular velocity of 
frame S with respect to the fixed frame. From above, we therefore have 


w = Wks + ($2 + 6%). (9.57) 


It is often more convenient to rewrite œ entirely in terms of the orthogonal x1, 
$2, X3 basis vectors. Since z = cos 0X3 + sin 0X2, Eq. (9.57) gives 


w = (o) + bcos 0)%&3 + ġ sin 0&2 + OX. (9.58) 


This form of æ is generally more useful, because $1, X2, 3 are principal axes of 
the body. (We are assuming that we are working with a symmetrical top, with 
h = h = 1. This means that any axes in the X)-X2 plane are principal axes.) 
Although x; and Xp are not fixed in the object, they are still good principal axes 
at any instant. 


9.7.2 Digression on the components of œ 


The above expressions for œ might look a little scary, but there is a very helpful 
diagram we can draw (see Fig. 9.30) that makes it easier to see what’s going on. 
Let’s talk a bit about this before tackling the original problem of the spinning top. 
The diagram is rather dense (you might even say it looks scarier than the above œ), 
so we’ll go through it slowly. In the following discussion, we’ll simplify things 
by setting 6 = 0. All the interesting features of œ remain. The 6%; component of 
@ in Eqs. (9.57) and (9.58) arises simply from the easily visualizable rising and 
falling of the top. We will therefore concentrate on the more complicated issues, 
namely the components of w in the plane of x3, z, and $2. 

With 6 = 0, Fig. 9.30 shows the vector w in the %3-Z-% plane (the way we’ve 
drawn it, X; points into the page, in contrast with Fig. 9.29). We’ll refer to this 
figure many times in the problems for this chapter. There are numerous comments 
to be made about it, so we’ll just list them out. The following discussion deals 
with the kinematics of œw, that is, the meaning of the various components and how 
they relate to each other. The discussion of the dynamics of œ, that is, why the 
components take on the values they do, given a certain physical system, is the 
subject of Section 9.7.3 onward. 
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1. If someone asks you to “decompose” @ into pieces along z and x3, what would you 
do? Would you draw the lines perpendicular to these axes to obtain the lengths shown 
(which we’ll label as wz and w3), or would you draw the lines parallel to these axes to 
obtain the lengths shown (which we’II label as Q and w’)? There is no “correct” answer 
to this question. The four quantities, œz, 3, 2, œ simply represent different things. 
We will interpret each of these below, along with w (the projection of w along $2). 
It turns out that Q and w’ are the frequencies that your eye can see the easiest, while 
œw and w3 are what you want to use when doing calculations involving the angular 
momentum. But as far as I can see, œz is not of much use. 

2. Note that it is true that 


@ = o'z + NÊ, (9.59) 
but it is not true that œw = w3X3 + œwz2. Another true statement is 
@ = 03X3 + 2X2. (9.60) 


3. In terms of the Euler angles, we see by comparing Eqs. (9.59) and (9.57), with 6 = 0, 
that 


wo =, and Q=¢. (9.61) 
And we also have, by comparing Eqs. (9.60) and (9.58), with 6 = 0, 


wz = ý +ġcosð = a + Qos, 
(9.62) 
o2=¢@sind = Qsind. 
These are also clear from Fig. 9.30. There is therefore technically no need to introduce 
the new w2, 3, Q, œ definitions in Fig. 9.30, because the Euler angles are quite 
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13 


sufficient. But we will be referring back to this figure many times, and it is a little 
easier to work with these omegas than the various combinations of the Euler angles. 


. Q is the easiest of the frequencies to visualize. It is the frequency of precession of 


the top around the vertical z axis. 1? In other words, the symmetry axis X3 traces out a 
cone (assuming 6 = 0) around the Z axis with frequency Q. The reason for this is the 
following. The vector is the vector that gives the speed of any point (at position r) 
fixed in the top as œ x r. Therefore, since the vector X3 is fixed in the top, we can write 


dX3 ` ra Se a R š 
ie = 0 X X3 = (w x3 + QZ) x &3 = (QZ) x X3. (9.63) 


But this is precisely the expression for the rate of change of a vector rotating around the 
z axis with frequency Q. (This is exactly the same type of proof as the one leading to 
Eq. (9.54).) Note that the precession frequency around the Ê axis is not wz. It certainly 
can’t be wz, because we can imagine grabbing the symmetry axis and holding it in 


place, so that œ points along X3. This scenario has a nonzero wz, but no precession. 


Remark: In the derivation of Eq. (9.63), we basically just stripped off the part of œ that 
points along the X3 axis, because a rotation around $; contributes nothing to the motion 
of x3. Note, however, that there is in fact an infinite number of ways to strip off a piece 
along X3. For example, we can also break w up as, say, @ = w3X3 + w282. We then obtain 
dx3/dt = (@2X2) x X3, which means that $; is instantaneously rotating around X2 with 
frequency w2. Although this is true, it isn’t as useful as the result in Eq. (9.63), because the 
X2 axis changes with time (it precesses around Zz). The point here is that the instantaneous 
angular velocity vector around which the symmetry axis rotates is not well defined (Problem 
9.1 discusses this issue).'* But the Z axis is the only one of these angular velocity vectors 
that is fixed. When we look at the top (or more precisely, the symmetry axis), we therefore 
see it precessing around the z axis. & 


. @ is also easy to visualize. Imagine that you are at rest in the frame that rotates around 


the z axis with frequency Q. Then you see the symmetry axis of the top remain perfectly 
still, and the only motion you see is the top spinning around this axis with frequency 
w’. (This is true because œ = w/X3 + Q2, and the rotation of your frame causes you 
not to see the Qz part.) If you paint a dot somewhere on the top, then the dot traces 
out a fixed tilted circle, and the dot returns to, say, its maximum height at frequency 
w’. A person in the lab frame sees this dot undergo a rather complicated motion but 
must observe the same frequency at which the dot returns to its highest point. So œ” is 
something quite physical in the lab frame also. 


. @3 is what you use to obtain the component of L along $3, because L3 = 13.03. w3 


is a little harder to visualize than Q and a’, but it is the frequency with which the top 


Although we’re using the same letter, this Q doesn’t have anything to do with the Q defined in 
Eq. (9.47), except for the fact that they both represent the frequency of something precessing 
around an axis. 

The instantaneous angular velocity of the whole body is well defined, of course. There is a definite 
line of points in the body that are instantaneously at rest. But if you look at the symmetry axis by 
itself, then there is an ambiguity (see Footnote 9). In short, because only one point on the axis (the 
bottom end), instead of a whole line, is instantaneously at rest, the instantaneous angular velocity 
vector can point in any direction. 
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instantaneously rotates, as seen by someone at rest in the frame that rotates around the 
instantaneous X2 axis with frequency w. (This is true because w = w282 + w383, and 
the rotation of the frame causes the person not to see the w 7X part.) This rotation is 
harder to vizualize in the lab frame, because the X2 axis changes with time. 

There is one physical scenario in which w3 is the easily observed frequency. Imagine 
that the top is precessing around the z axis at constant 0 (we’ll find in Section 9.7.5 that 
this is in fact a possible motion for the top), and imagine that the top has a frictionless 
rod protruding along its symmetry axis. If you grab the rod and stop the precession 
motion, so that the top is now spinning around its stationary symmetry axis, then this 
spinning has frequency w3. This is true because when you grab the rod, your torque 
has no component along the $; axis (because the rod lies along this axis, and because 
it is frictionless). Therefore, L3} doesn’t change, and so neither does w3. 

7. œ is similar to w3, of course. w2 is what you use to obtain the component of L along $7, 
because L2 = %2. It is the frequency with which the top instantaneously rotates, as 
seen by someone at rest in the frame that rotates around the instantaneous X3 axis with 
frequency w3. (This is true because œ = w282 + 3X3, and the rotation of the frame 
causes the person not to see the 33 part.) Again, this rotation is harder to vizualize in 
the lab frame, because the x3 axis changes with time. Note that by “instantaneous 3 
axis,” we mean the fixed axis in space that coincides with the symmetry axis at a given 
instant. The symmetry axis will therefore move away from this fixed axis, consistent 
with the fact that the person in the above-mentioned rotating frame sees the top rotate 
around the X axis. 

The physical scenario that produces w2, analogous to the scenario that produced w3 
above, is the following. Imagine a frictionless rod glued to the top at its tip, perpen- 
dicular to the symmetry axis, so that they form a “T.” As this rod is spinning around 
(ignore the fact that it has to keep passing through the table), grab it at the instant it 
points along $2. The top will then rotate with frequency w2 around the fixed rod. This 
is true for reasons analogous to the ones in the w3 case above. 

8. wz is not very useful, as far as I can see. The most important thing to note about wz 
is that it is not the frequency of precession around the z axis, even though it is the 
projection of œ onto z. The frequency of the precession is Q, as we found above in 
Eq. (9.63). A true, but somewhat useless, fact about œz is that if someone is at rest 
in the frame that rotates around the z axis with frequency wz, then she sees all points 
in the top instantaneously rotating around the horizontal x axis with frequency wy, 
where «x is the projection of w onto the x axis. (This is true because w = w,X + @zZ, 


and the rotation of the frame causes her not to see the wzZ part.) 


9.7.3 Torque method 


Let’s now finally solve for the motion of a heavy top. This first method involving 
torque is straightforward, although a bit tedious. We include it here to (1) show 
that the problem can be done without resorting to a Lagrangian, and to (2) get 
some practice using T = dL/dt. We’ll make use of the form of œ given in 
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Eq. (9.58), because there it is broken up into the principal-axis components. For 
convenience, define 8 = y + ¢cos8, so that 


w = îs + ġ sin OX + OX. (9.64) 


Note that we’ve returned to the most general motion, where 6 is not necessarily 
zero. For our origin, we’ ll choose the tip of the top, which is assumed to be fixed 
on the table.!> Let the principal moments relative to this origin be J) = h = J, 
and /3. The angular momentum of the top is then 


L = BË; + 1¢ sin 9X + 10%). (9.65) 


We must now calculate dL/dt. What makes this nontrivial is the fact that the x1, 
X2, and X3 unit vectors change with time (they change with 6 and @). But let’s 
forge ahead and take the derivative of Eq. (9.65). Using the product rule (which 
works fine with the product of a scalar and a vector), we have 


dL d(¢ sin 0 dé 
I (ġ sin Jeng 0. 


dp . 
-bÉ ist 


I 
di di de 
ip a en ee 9.66 
a a t ọsin a ae (9: ) 
Using a little geometry, you can show that 
dx ; ; 
= = —0X) + $ sin 6X), 
dx s ; 
— = 6% — ġ cos0ĝi, (9.67) 
dt 
dX i api ; F 
T = —¢ sin 0x3 + cos OX?. 


As an exercise, you should verify these by making use of Fig. 9.29. In the 
first equation, for example, show that a change in 6 causes X3 to move a cer- 
tain distance in the X2 direction; and show that a change in @ causes X3 to move 
a certain distance in the $; direction. Plugging the derivative expressions from 
Eq. (9.67) into Eq. (9.66) gives, after some algebra, 


dL in. a - ks 
= Bis + (18 sin 6 + 2166 cos — In6) 


+ (1 — Iġ? sin 6 cos8 + bÊġ sin 03). (9.68) 


Let’s now look at the torque on the top. This arises from gravity pulling down 
on the CM. So from Fig. 9.29, t points in the x; direction and has magnitude 


15 We could use the CM as our origin, but then we would have to include the complicated forces 
acting at the pivot point, which is difficult. But see Problem 9.19 for the case where the tip is free 
to slide on a frictionless table. 
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Mg£ sin @, where £ is the distance from the pivot to the CM. Using Eq. (9.68), 
the third component of t = dL/dt quickly gives 


B=0. (9.69) 
Therefore, B is a constant, which we’ll call w3 (an obvious label, in view of 
Eq. (9.64)). The other two components of t = dL/dt then give 
Id sin + 6(21¢ cos — baz) = 0, 
(Mgt +167 cos — 1303) sind = IË. (9.70) 


We’ ll wait to fiddle with these equations until we have derived them again using 
the Lagrangian method. 


9.7.4 Lagrangian method 


Equation (9.15) gives the kinetic energy of the top as T = œ - L/2. Using 
Eqs. (9.64) and (9.65), we have (writing yy + ġ cos 8 instead of the shorthand f)!° 


bere nod Da lre oiia 
T = 50: L= ZBC) + ġ cos0)? + 519? sino + 6”). (9.71) 


The potential energy is 
V = Mg£cos9, (9.72) 


where £ is the distance from the pivot to the CM. The Lagrangian is £ = T — V 
(we’ll use “£” here to avoid confusion with the angular momentum “L”), and so 


the equation of motion obtained from varying w is 
CK y LG + G0080) =0 (9.73) 
dtaw aw dt a le l 


Therefore, yy + ¢ cos 8 is a constant. Call it w3. The equations of motion obtained 
from varying ¢ and 8 are then (making use of w + ¢cos@ = w3) 


dak ƏL d di 

dap a6 zi (i303 cos + Ibsin?#) = 0, 

dak ƏL i : : 

a = IÖ = (Mgé + Id? cos@ — a3) sind. (9.74) 


Taking the derivative in the first equation, we see that these equations are identical 
to those in Eq. (9.70). 


'6 Tt was ok to use £ in the previous subsection. We introduced it only because it was quicker to write. 
But we can’t use it here, because it depends on the other coordinates, and the Lagrangian method 
requires the use of independent coordinates. The variational proof back in Chapter 6 assumed this 
independence. 
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Note that there are two conserved quantities, arising from the facts that dL /dw 
and 0£/d@ equal zero. The conserved quantities are the angular momenta in 
the $; and z directions, respectively. This is true because from Eq. (9.65), the 
former is L3 = hw and the latter is L- = L3 cos6 + L2 sin 0 = (33) cos + 
(Iġ sin 6) sin 0. These angular momenta are conserved because the torque points 
in the X; direction, so there is no torque in the plane spanned by $; and 2. 


9.7.5 Spinning top with 6 = 0 


A special case of Eqs. (9.70) occurs when 6 = 0. In this case, the first of 
Eqs. (9.70) says that ġ is constant. The CM of the top therefore undergoes uni- 
form circular motion in a horizontal plane. Let Q = ġ be the frequency of this 
motion (this is the same notation as in Eq. (9.61)). Then the second of Eqs. (9.70) 
becomes 


IQ? cos@ — boQ + Mgl = 0. (9.75) 


This quadratic equation can be solved to yield two possible precessional frequen- 
cies, Q, for the top. And yes, there are indeed two of them, provided that 3 is 
greater than a certain minimum value. 

The previous pages in this “Heavy symmetric top” section have been a bit 
abstract, so let’s now take a breather and rederive Eq. (9.75) from scratch. That 
is, we'll assume 6 = 0 from the start of the solution, and then solve things by 
finding L and using t = dL/dt, in the spirit of Section 9.4.2. In practice, this 
strategy of starting from scratch is invariably the best route to take, as you’ Il see 
in the problems and exercises for this chapter. The procedures in Sections 9.7.3 
and 9.7.4 are good to know, but the technique in the following example provides 
a much more intuitive way of looking at things. This example is the classic “top” 
problem. We’ll warm up by solving it in an approximate way. Then we’ll do it 
for real. 


Example (The top): Asymmetric top of mass M has its CM a distance £ from its 
pivot. The moments of inertia relative to the pivot are J; = h = IJ, and J3. The top 
spins around its symmetry axis with frequency 3 (in the language of Section 9.7.2), 
and initial conditions have been set up so that the CM precesses in a circle around 
the vertical axis. The symmetry axis makes a constant angle 6 with the vertical (see 
Fig. 9.31). 


(a) Assuming that the angular momentum due to w3 is much larger than any other 
angular momentum in the problem, find an approximate expression for the 
frequency, Q, of precession. 

(b) Now solve the problem exactly. That is, find Q by considering all of the angular 


momentum. 
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Solution: 

(a) The angular momentum (relative to the pivot) due to the spinning of the top 
has magnitude L3 = 133, and it is directed along x3. Let’s label this angular 
momentum vector as L3 = L383. As the top precesses, L3 traces out a cone 
around the vertical axis. So the tip of L3 moves in a circle of radius L3 sin 8. 
The frequency of this circular motion is the frequency of precession, Q. So 
dL3/dt, which is the velocity of the tip, has magnitude 


Q (L3 sin 8) = Qoz sind, (9.76) 


and it is directed into the page. 

The torque (relative to the pivot) is due to gravity acting on the CM, so it has 
magnitude Mg£ sin @, and it is directed into the page. Therefore, tT = dL/dt 
gives 


Mgt 


aa (9.77) 


This is independent of 6, and it is inversely proportional to w3. 

(b) The error in the above analysis is that we omitted the angular momentum aris- 
ing from the X2 (defined in Section 9.7.1) component of the angular velocity 
due to the precession of the top around the z axis. This component has magni- 
tude Qsin@.!’ Therefore, the angular momentum due to the angular velocity 


component in the X direction has magnitude 
L = IQ sinð. (9.78) 


Let’s label this part of the angular momentum as L? = L282. The total L = 
Ly + L3 is shown in Fig. 9.32. L precesses around in a cone, so only its 


horizontal component (call it Lı ) changes. From the figure, the length Z| is 
the difference in lengths of the horizontal components of L3 and L2. Therefore, 


Lı =L3sin@ — L cos 8 = 1303 sin 0 — IQ sin 0 cos 0. (9.79) 


The magnitude of the rate of change of L is!® 


dL 


ae QL, = 21303 sind — IQ sin 8 cos 6). (9.80) 


Both t (which has magnitude Mg sin 0) and dL/dt point into the page, so 


equating their magnitudes gives 


IQ? cos 6 — 032 + Mee = 0, (9.81) 


17 The angular velocity due to the precession is 922. We can break this up into components along 
the orthogonal directions $2 and $3. The Q cos@ component along X3 was already absorbed into 
the definition of w3 (see Fig. 9.30). 

'8 This result can also be obtained in a more formal way. Since L precesses with angular velocity 
QŻ, the rate of change of L is dL/dt = Qz x L. If you compute this cross product in the x1, x2, x3 
basis, you will obtain the result in Eq. (9.80). 
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in agreement with Eq. (9.75), as we wanted to show. The quadratic formula 
quickly gives the two solutions for Q, which may be written as 


Q= ho3 it he 4MI[gé cos 0 
2I cos 0 Roy 


(9.82) 


Note that if 0 = 7/2, then Eq. (9.81) is actually a linear equation, so there is 
only one solution for Q, which is the one in Eq. (9.77). The reason for this is 
that Lz points vertically, so it doesn’t change. Only L3 contributes to dL/dt, 
so the approximate solution in part (a) is in fact an exact solution. Because of 
this simplification, a top is much easier to deal with when its symmetry axis is 
horizontal. 

The two solutions in Eq. (9.82) are known as the fast-precession and slow- 
precession frequencies. For large w3, you can show that the slow-precession 
frequency is 
_ Met 


Q ~ ; 
hw 


(9.83) 


in agreement with the solution found in Eq. (9.77).!? This task, along with 
many other interesting features of this problem (including the interpretation of 
the fast-precession frequency, 02), is the subject of Problem 9.17, which you 


are encouraged to do. 


9.7.6 An “explanation” of precession 


The fact that a top can precess slowly around in a circle without simply falling 
down (as a simple pendulum would do) is rather bizarre. We showed above that 
this precession can be deduced perfectly well from t = dL/dt, but it would be 
nice if there was a more intuitive way to explain it, based at least somewhat on 
F = ma. Although a completely satisfactory intuitive explanation eludes me, I 
think the following discussion will clear some things up. This discussion will be 
qualitative (so no equations or numbers), but it should still suffice in explaining 
for the most part what’s going on with precession. 


Impulse applied to a dumbbell 

Let’s first look at a simple system consisting of a dumbbell with a massless stick 
glued perpendicular to it at its center. The dumbbell rotates around the stick which 
is held fixed; see Fig. 9.33. Let the y and z axes be defined as shown, with the x 
axis pointing out of the page. The total angular momentum points to the right, in 
the positive y direction. We’ll ignore gravity for now. Equivalently, we can have 
the dumbbell pivoted at its center. 


19 This is fairly clear. If w3 is large enough compared with Q, then we can ignore the first term in 
Eq. (9.81). That is, we can ignore the effects of L2, which is exactly what we did in the approximate 
solution in part (a). 
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At the instant the masses are in the plane of the paper, as shown (with the 
top one coming out of the page and the bottom one going in), we’ll apply equal 
and opposite small impulses to the stick, upward on the right end and downward 
on the left; see Fig. 9.34.7° Assume that the forces are applied for infinitesimal 
periods of time, but that the forces are large enough so that the impulse is nonzero. 
What happens to the masses? In particular, what happens to the plane of their 
rotation? 

Because the structure is rigid, the impulsive forces on the stick cause the two 
masses to pick up small velocity components in the +y directions, as shown 
in Fig. 9.34. If the forces are applied for an infinitesimal time ¢, then these 
velocity components are of the form v = at (with ¢ very small and a very large). 
Additionally, the masses move a distance d = at*/2 to the side. But because 
of the two powers of the infinitesimal time ¢ here, this distance is negligible. 
In other words, the masses pick up a nonzero v, but essentially no d. This is a 
general result when an object is struck with a hammer: right after the blow, it has 
a nonzero speed, but essentially zero displacement. 

A top view (with the z axis pointing out of the page) of the velocities of the two 
masses right after the blow is shown in Fig. 9.35. The dotted lines represent the 
velocity of the bottom mass which is behind (that is, below) the top mass. Now, 
if someone gives you these two velocities and doesn’t tell you what was going 
on beforehand, then you will simply say that the dumbbell is rotating around in a 
circle that lies in the vertical plane defined by the line of the new velocity vectors 
in Fig 9.35. In other words, the plane of the circular motion has been rotated 
around the vertical z axis. A top view of the situation is shown in Fig. 9.36. From 
this time onward, the masses rotate around in the new vertical plane. This means 
that the angular momentum of the dumbbell has picked up a component in the 
positive x direction (out of the page in Fig. 9.34 and downward in Fig. 9.36), 
consistent with the fact that the torque from the two applied forces points in the 
positive x direction. 

This example illustrates the bizarre fact that if you smack the rotation axis in 
one direction (vertical here), it will head off in another direction (horizontal). 


Impulse applied to a symmetric top 

Let’s now consider the more complicated situation where instead of the above 
dumbbell we have a symmetric top, for example, a flat disk with a stick pok- 
ing through its center. (We’ll still ignore gravity for the moment.) Things are 
more difficult now, because if we apply our impulsive forces to the stick, the 
plane of the disk can’t instantaneously rotate as it did above, because this would 
involve the “side” points on the disk moving a finite distance in zero time, as they 


20 We're applying equal and opposite forces just so that the CM doesn’t move. And we’re doing this 
for no reason other than simplicity. The motion of the CM is irrelevant for the point we want to 
make in this dumbbell setup. 
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move from the old plane of rotation to the new plane. So what does the motion 
look like? 

The rough answer is that our symmetric top is vaguely the same type of object 
as the above dumbbell, so the motion should look roughly the same. In other 
words, the axis of the top should end up pointing (in one way or another) a 
little bit in the x direction, as it did with the dumbbell. But the precise answer 
is that since we have a free top here, we know from the free-top discussion in 
Section 9.6.2 exactly what happens: the symmetry axis of the disk precesses 
(along with the angular velocity vector) in a thin cone around the new angular 
momentum vector, which points slightly out of the page due to the torque in the x 
direction. So although the symmetry axis of the disk doesn’t point in the definite 
direction along L as it did with the dumbbell, it points on average along L, which 
is slightly in the x direction. 

Let’s now bring gravity into the setup. It turns out that we can consider a heavy 
top’s precession to be the result of a succession of many little impulsive blows 
to a free top that is otherwise in freefall (where gravity provides no torque). The 
reasoning is as follows. 

Imagine that we hold our spinning top (not pivoted anywhere) and move it 
sideways (perpendicular to the stick) at a given constant speed; this speed will 
have to be chosen to be a particular value (see Footnote 23 below). And then we 
let go. Gravity provides a force on the CM, but zero torque around the CM, so 
the top is simply in freefall with the constant horizontal speed that we gave it. 
But let’s assume that immediately after we let go, we keep the top at a constant 
height by applying a very quick and very small upward strike to the end of the 
stick, and then waiting for a very short time while the CM rises and falls in its 
freefall motion, and then repeating the process indefinitely (let’s say we make 
100 tiny strikes each second). If we arrange for the time-averaged upward force 
to equal mg, then the CM stays at (essentially) a constant height. 

After each strike, the axis of the top undergoes its free-top precession in a very 
thin cone, so if the CM of the top weren’t moving, the end of the stick would 
end up pointing slightly to the side, along the direction of the new L.! But the 
sideways motion of the top (due to the properly chosen original speed we gave it) 
exactly brings the end of the stick back to its original location; see the top view in 
Fig. 9.37. The dot in the figure represents a fixed point in space. We can imagine 
this process taking place in two steps. In step 1, the axis changes its direction 
due to the angular impulse relative to the CM. And in step 2, the top moves 
sideways (due to the original horizontal speed we gave it) during its rising and 
falling projectile motion. As the process repeats, the CM moves around in a circle, 


21 Assume that the stick’s tiny precession cone around L somehow damps out, so that the stick 
eventually points in the definite direction along L. Since we’re eventually going to consider the 
end of the stick to be located at a pivot, this is a reasonable assumption. 
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with the end of the stick remaining (essentially) fixed.”” In other words, we have 
recreated our precessing top.”° And since we can consider the continuous upward 
force that a pivot applies to a heavy top to be a succession of quick infinitesimal 
strikes, we see that a heavy top precessing around a pivot is essentially the same 
as the above top precessing around the fixed dot in space in Fig. 9.37. 

The above reasoning is only qualitative, but it should make the precession of 
a heavy top a little more believable. Of course, since we’ve argued that a heavy 
top pivoted at a point can be considered to be a free top undergoing a succession 
of impulsive and free-top motions, we’ve just shifted the burden of proof to an 
intuitive understanding of why a free top precesses the way it does. But that 
makes my head hurt, so I’Il stop here. But at least we know that a free top should 
behave more or less like the dumbbell setup above, for which we showed (by 
actually looking at the forces) why the axis of rotation shifts to the side when 
vertical forces are applied. 


9.7.7 Nutation 


We will now solve Eq. (9.70) in a somewhat more general case, where @ is 
allowed to vary slightly. That is, we will consider a slight perturbation to the 
circular motion associated with Eq. (9.75). We will assume that w3 is large here, 
and we will assume that the original circular motion corresponds to the slow 
precession, so that ¢ is small. Under these assumptions, we will find that the top 
will bounce around slightly as it travels (roughly) in a circle. This bouncing is 
known as nutation. 

Since ġ is small compared with w3, we can (to a good approximation) ignore 
the middle terms on the left-hand sides of Eqs. (9.70) to obtain 


Iġ sint — 603 = 0, 


(9.84) 
(Mel — a3) sind = IË. 


We must somehow solve these equations for 6(t) and ¢ (t). Taking the derivative 
of the first equation and dropping the quadratic term (which is negligible for 
sufficiently small perturbations) gives 6 = (I sin 0 /I3w3) d*@/dt*. Substituting 


this expression for Ö into the second equation gives 
dq : 
+ ak - 2) =0, (9.85) 
22 We’ll need to apply a force directed along the axis, to provide the centripetal acceleration of the 
precessing CM. But this force produces no torque around the CM, so it doesn’t affect any of the 
other aspects of the problem. 
23 Tf the initial conditions aren’t set up properly, then the top will bounce up and down as it precesses 
around (this is known as nutation, described below in Section 9.7.7). In particular, if you hold 
the axis at rest and then let go, the top will initially fall straight down. So your intuition works 
perfectly fine here. But as time goes by, more complicated things start to happen, as explained in 
Remark 6 in the nutation section below. 
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where 


ha; Mgt 


On = Ea and Q, = (9.86) 


a3 

are, respectively, the frequency of nutation (as we shall soon see) and the slow- 
precession frequency given in Eq. (9.77). Shifting variables to y = @ — Qs in 
Eq. (9.85) gives us the nice harmonic-oscillator equation, } + w2y = 0. Solving 
this and then shifting back to ¢ yields 


f(t) = Qs +A coslwnt +y), (9.87) 
where A and y are determined by the initial conditions. Integrating this gives 
AY, 
P(t) = Qst + (£) sin(@nt + y), (9.88) 
n 


plus an irrelevant constant. 
Now let’s solve for 8 (t). Plugging our ¢ (t) into the first of Eqs. (9.84) gives 


E Ising , . . 

O(t) = — 7 Awn SIN(Mpt + y) = —A sin 0 sin(@pt + y), (9.89) 
303 

where we have used the definition of wn in Eq. (9.86). Since 8 (t) doesn’t change 

much, we can set sind œ sin 0o, where 6p is, say, the initial value of 8 (t). Any 

errors here will be second-order effects in small quantities. Integration then gives 


A 

Ot) =B+ (= sin on) cos(@nt + y), (9.90) 
Wn 

where B is a constant of integration. Equations (9.88) and (9.90) show that both 

¢ (neglecting the uniform Qst part) and @ oscillate with frequency œn, and with 

amplitudes inversely proportional to wn. Note that Eq. (9.86) says that œn grows 

with w3. 


Example (Sideways kick): Assume that uniform circular precession is initially 
taking place with 9 = 09 and $ = Qs. You then give the top a quick kick along the 
direction of motion, so that ġ suddenly becomes Qs + AQ (AQ may be positive or 
negative). Find ¢ (£) and 0 (t). 


Solution: This is an exercise in initial conditions. We are given the initial values 
of @, 6, and 6 (namely Qs + AQ, 0, and 4p, respectively), and our goal is to solve 
for the unknowns A, B, and y in Eqs. (9.87), (9.89), and (9.90). 6 is initially zero, so 
Eq. (9.89) gives y = 0 (or x, but this leads to the same answer). And ¢ is initially 
Qs + AQ, so Eq. (9.87) gives A = AQ. Finally, 6 is initially 09, so Eq. (9.90) gives 
B= 09 — (AQ/a@p) sin 69. Putting this all together, we have 
AQY , 
P(t) = Qst + | — | sin apt, 
a (9.91) 
AQ. l 
O(t) = 0o — | — sin 6o | (1 — cos wnt). 
w 


n 
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And for future reference (for the problems in this chapter), we’ll also list the 
derivatives, 


$À = Qs + AQ cos wnt, 
; (9.92) 
O(t) = —AQ sin 69 sin wnt. 


REMARKS: 


1. Remember that all of this analysis holds only if 6 and ¢ are small compared with %3, 
and if 6 is always near 6. 

2. For the initial setup we have chosen (that is, for 6 = 0), Eq. (9.91) shows that 0 
always stays on one side of 09. If AQ > 0, then 0 (£) < 8o for all ż (that is, the top is 
always at a higher position, since 6 is measured from the vertical). If AQ < 0, then 
A(t) > 0o for all ¢ (that is, the top is always at a lower position). 

3. Consider the point that is a distance £ from the origin and that travels in a horizontal 
circle with angular coordinates given by (¢,9)avg = (Qst, 4 — (AQ/a@n) sin 60). 
This is the “average” position of the CM, in the sense that Eq. (9.91) gives the angular 
coordinates of the CM relative to this point as 


AQ 
($, Orel = ( 
w]; 


) (sin @nź, sin ĝo COS @nt). (9.93) 

n 
The sin ĝo in the second coordinate here implies that the amplitude of the 0 oscillation 
is sin 0 times the amplitude of the ¢ oscillation. This is precisely the factor needed to 
make the CM travel in a circle, as viewed by someone riding along with coordinates 
($, @)avg, because a change in 0 causes a displacement of £ d8, whereas a change in 
ġ causes a displacement of £ sin 6o dẹ. 

4. Figure 9.38 shows plots of 6(f) vs. sin@o@(t) for various values of AQ. The 
dots at the start of each of the nine plots all represent the same starting point at 
0 = Oo. The plots are stacked on top of each other for comparison only; the ver- 
tical spacing between them is meaningless. We have chosen the horizontal axis 
to be sin @9@(t) instead of (t), and we have chosen the vertical axis to have 0 
increasing downward, so that these plots are exactly the paths you would see the 
CM trace out in space. We have picked arbitrary values of Qs and œn to gener- 
ate the plots (hence no numbers on the axes, since they wouldn’t mean much), but 
no matter what values are picked, the shapes of the plots are still the same (for 


example, the AQ = +Q, paths always have cusps). The oscillations have different 
frequencies and amplitudes, but both axes get scaled by the same amount (as you can 
check). 


5. From the figure, we see that the motions associated with AQ and — AQ look the same 
except for being shifted vertically (relative to the starting dots, which represent the 
same point), and also horizontally by half a cycle. This can be seen from Eq. (9.91); 
changing AQ to — AQ has the effect of shifting the constant term in 6(f), and also 
shifting the time by half a cycle, because AQ sin apt = (—AQ) sin(wnt + 7), and 
likewise for cosine. 

6. In the AQ = —Q,y case, the CM starts at rest. This corresponds to holding the axis 
of the top at rest and then dropping it. From Fig. 9.38 we see that initially the CM 
simply falls straight down, as your intuition suggests. (It might not be clear from the 
figure, but Problem 9.25 shows that the curve is indeed vertical at the high points in 
the motion.) But then the strangeness of angular momentum takes over, and the top 
ends up bouncing and precessing instead of continuing to fall as a point mass would. 

Qualitatively, we can understand this bouncing and precessing as follows. Let the 
axis and the angular momentum initially point to the right as you hold the axis, as 
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sin8yd(t) —> 


shown in Fig. 9.39. After you let go, a number of things happen: (1) Because of the 
downward gravitational force, the top starts to fall. The axis therefore now points 
slightly downward, which means that the angular momentum picks up a downward 
component. Relative to the CM, the force from the pivot must therefore provide a 
downward torque. By the right-hand rule, this force must point into the page. (2) 
From F = ma, this inward force causes the top to accelerate into the page. This 
makes the axis point slightly into the page, so the angular momentum picks up a 
component in that direction. By the right-hand rule, there must then be an upward 
force from the pivot to provide the necessary torque for this. (3) This upward force 
slows down the downward motion and in fact eventually becomes greater than mg 
and causes the CM to rise back up. This means that the angular momentum increases 
its vertical component, so by the right-hand rule there must be a force from the pivot 
that points out of the page to provide the necessary torque for this. (4) This outward 
force then slows down the motion into the page and eventually stops it, right when 
the top returns to its original height. The top is momentarily at rest, and then the 
process repeats itself. Of course, this qualitative reasoning doesn’t show that all the 
details work out correctly, but at least it makes the motion a little more believable. 


. Ifyou start by holding the axis of the top at rest, as in the previous remark, the AQ = 0 


case corresponds to giving the axis the proper initial push into the page (instead of 
just dropping it, as above) so that the resulting change in angular momentum into the 
page requires an upward force of mg from the pivot. The CM then stays at the same 
height and simply moves in a horizontal circle without bouncing. & 


9.8 Problems 


9.8 Problems 


Section 9.1: Preliminaries concerning rotations 


9.1. 


9.2. 


9.3. 


Many different @’s x 


Consider a particle at the point (a,0,0), with velocity (0, v,0). At this 
instant, the particle may be considered to be rotating around many differ- 
ent w vectors passing through the origin. There isn’t just one “correct” œ. 
Find all the possible w’s (give their directions and magnitudes). 


Fixed points on a sphere «« 


Consider a transformation of a rigid sphere into itself. Show that two 
points on the sphere end up where they started. 


Rolling cone +» 


Acone rolls without slipping on a table. The half-angle at the vertex is a, 
and the axis has length h (see Fig. 9.40). Let the speed of the center of 
the base, point P in the figure, be v. What is the angular velocity of the 
cone with respect to the lab frame at the instant shown? There are many 
ways to do this problem, so you are encouraged to take a look at the three 
given solutions, even after solving it. 


Section 9.2: The inertia tensor 


9.4. 


Parallel-axis theorem 


Let (X, Y, Z) be the position of an object’s CM, and let (x’, y’,z’) be the 
position relative to the CM. Prove the parallel-axis theorem, Eq. (9.19), 
by setting x = X + x', y = Y + y', and z = Z +2’ in Eq. (9.8). 


Section 9.3: Principal axes 


93; 


9.6. 


A nice cylinder x 


What must the ratio of height to radius of a cylinder be so that every axis 
is a principal axis (with the CM as the origin)? 


Rotating square x 


Here’s an exercise in geometry. Theorem 9.5 says that if the moments of 
inertia around two principal axes are equal, then any axis in the plane of 
these axes is a principal axis. This means that the object will happily rotate 
around any axis in this plane, that is, no torque is needed. Demonstrate this 
explicitly for four equal masses in the shape of a square, with the center 
as the origin, which obviously has two moments equal. Assume that the 
masses are connected by strings to the axis, as shown in Fig. 9.41, and 
that they all rotate with the same w around the axis, so that they remain in 
the shape of a square. Your task is to show that the tensions in the strings 
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9.7. 


9.8. 


are such that there is no net torque acting on the axis, relative to the center 
of the square. 


Existence of principal axes for a pancake « 


Given a pancake object in the x-y plane, show that there exist principal 
axes by considering what happens to the integral f xy when the coordinate 
axes are rotated about the origin by an angle of 7/2. 


Symmetries and principal axes for a pancake xx 


A rotation of the axes in the x-y plane through an angle @ transforms the 
coordinates according to (you can accept this) 


x! cosôð sin@\ /x 
C) 7 é sin cos >) C) : mam 


Use this to show that if a pancake object in the x-y plane has a symmetry 
under a rotation through 6 # x, then f xy = 0 for any choice of axes, 
which implies that all axes (through the origin) in the plane are principal 
axes. 


Section 9.4: Two basic types of problems 


9.9. 


9.10. 


9.12. 


Striking a rectangle x 


A flat uniform rectangle with sides of length a and b sits in space, not 
rotating. You strike the corners at the ends of one diagonal, with equal 
and opposite forces (see Fig. 9.42). Show that the resulting initial œ 
points along the other diagonal. 


Rotating stick x« 


A stick of mass m and length £ spins with frequency w around an axis, 
as shown in Fig. 9.43. The stick makes an angle @ with the axis and is 
kept in its motion by two strings that are perpendicular to the axis. What 
is the tension in the strings? (Ignore gravity.) 


. Stick under a ring «« 


A stick of mass m and length £ is arranged to have its CM motionless 
while its top end slides in a circle on a frictionless ring, as shown in 
Fig. 9.44. The stick makes an angle 0 with the vertical. What is the 
frequency of this motion? 


Circular pendulum xx 


Consider a pendulum made of a massless rod of length £ with a point 
mass m on the end. Assume conditions have been set up so that the mass 
moves in a horizontal circle. Let 0 be the constant angle the rod makes 
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with the vertical. Find the frequency, &2, of this circular motion in three 
different ways. 


(a) Use F = ma. This method works only if you have a point mass. 
With an extended object, you have to use one of the following 
methods involving torque. 

(b) Use t = dL /dt with the pendulum pivot as the origin. 

(c) Use t = dL /dt with the mass as the origin. 


9.13. Rolling in a cone xx 


A fixed cone stands on its tip, with its axis in the vertical direction. 
The half-angle at the vertex is 0. A small ring of radius r rolls without 
slipping on the inside surface. Assume that conditions have been set up 
so that (1) the point of contact between the ring and the cone moves in 
a circle at height h above the tip, and (2) the plane of the ring is at all 
times perpendicular to the line joining the point of contact and the tip 
of the cone (see Fig. 9.45). What is the frequency, Q, of this circular 
motion? Work in the approximation where r is much smaller than the 
radius of the circular motion, / tan 0. 


Section 9.5: Euler 5 equations 


9.14. Tennis racket theorem +*+ Fig. 9.45 
If you try to spin a tennis racket (or a book, etc.) around any of its 
three principal axes, you will find that different things happen with the 
different axes. Assuming that the principal moments (relative to the 
CM) are labeled according to J7 > h > h (see Fig. 9.46), you will 
find that the racket will spin nicely around the x; and X3 axes, but it 
will wobble in a rather messy manner around the $z axis. Verify this 
claim experimentally with a book (preferably lightweight, and wrapped 
with a rubber band), or a tennis racket, if you happen to study with one 
on hand. 

Now verify this claim mathematically. The main point here is that 
you can’t start the motion off with w pointing exactly along a principal 
axis. Therefore, what you want to show is that the motion around the 
X, and X3 axes is stable (that is, small errors in the initial conditions 
remain small), whereas the motion around the X2 axis is unstable (that 
is, small errors in the initial conditions get larger and larger, until the 
motion eventually doesn’t resemble a rotation around the x2 axis). 
Your task is to use Euler’s equations to prove these statements about 
stability. (Exercise 9.33 gives another derivation of this result.) 


Fig. 9.46 


24 Ifyou try for a long enough time, you’ll probably be able to get the initial œ pointing close enough 
to X2 so that the book will remain rotating (almost) around X9 for the entire time of its flight. There 
is, however, undoubtedly a better use for your time, as well as for the book... 
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9.17. 


> frictionless 
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9.16. 


9.18. 


9.19. 


Free-top angles + 


In Section 9.6.2, we showed that for a free symmetric top, the angular 
momentum L, the angular velocity @, and the symmetry axis $; all lie 
in a plane. Let «œ be the angle between x3 and L, and let £ be the angle 
between x3 and @ (see Fig. 9.47). Find the relationship between a and 
f in terms of the principal moments, J and /3. 


Staying above xx 


A top with J = nh, where n is a numerical factor, is initially spin- 
ning around its x3 axis with angular speed w3. You apply a strike at 
the bottom point, directed into the page as shown in Fig. 9.48 (imag- 
ine that you hit a little peg protruding from the bottom). What is the 
largest value of n for which the total œ vector never dips below the 
horizontal axis in the subsequent motion, no matter how hard your 
strike is? 


Section 9.7: Heavy symmetric top 


The top +» 


This problem deals with the spinning top example in Section 9.7.5. It 
uses the result for Q in Eq. (9.82). 


(a) What is the minimum value of w3 for which circular precession 
is possible? 

(b) Find approximate expressions for Q+ when @3 is very large. 
The phrase “very large,” however, is rather meaningless. What 
mathematical statement should replace it? 


Many tops +» 


N identical disks and massless sticks are arranged as shown in Fig. 9.49. 
Each disk is glued to the stick on its left and attached by a pivot to the 
stick on its right. The leftmost stick is attached by a pivot to a pole. 
You wish to set up circular precession with the sticks always forming a 
straight horizontal line. What should the relative angular speeds of the 
disks be so that this is possible? 


Heavy top on a slippery table x» 


Solve the problem of a heavy symmetric top spinning on a fric- 
tionless table (see Fig. 9.50). You may do this by simply stating 
what modifications are needed in the derivation in Section 9.7.3 (or 
Section 9.7.4). 


9.20. 


9.21. 


922, 


9.23. 


9.24. 


9.8 Problems 


Fixed highest point «« 


Consider a top made of a uniform disk of radius R, connected to the 
origin by a massless stick (which is glued perpendicular to the disk) of 
length £. Paint a dot on the top at its highest point, and label this as 
point P (see Fig. 9.51). You wish to set up uniform circular precession, 
with the stick making a constant angle 0 with the vertical (6 can be 
chosen to be any angle between zero and 7), and with P always being 
the highest point on the top. What is the frequency of precession, Q? 
What relation between R and £ must be satisfied for this motion to be 
possible? 


Basketball ona rim xx» 


A basketball rolls without slipping around a basketball rim in such a way 
that the contact points trace out a great circle on the ball, and the CM 
moves around in a horizontal circle with frequency 2. The radii of the 
ball and rim are r and R, respectively, and the ball’s radius to the contact 
point makes an angle 0 with the horizontal (see Fig. 9.52). Assume that 
the ball’s moment of inertia around its center is Z = (2/ 3)mr2. Find Q. 


Rolling lollipop +*+ 

Consider a lollipop made of a solid sphere of mass m and radius r that is 
radially pierced by a massless stick. The free end of the stick is pivoted 
on the ground (see Fig. 9.53). The sphere rolls on the ground without 
slipping, with its center moving in a circle of radius R with frequency Q. 
What is the normal force between the ground and the sphere? 


Rolling coin ++ 


Initial conditions have been set up so that a coin of radius r rolls around 
in a circle, as shown in Fig. 9.54. The contact point on the ground traces 
out a circle of radius R, and the coin makes a constant angle 0 with 
the horizontal. The coin rolls without slipping (assume that the friction 
with the ground is as large as needed). What is the frequency, Q, of 
the circular motion of the contact point on the ground? Show that such 
motion exists only if R > (5/6)rcosé. 


Wobbling coin x+» 


If you spin a coin around a vertical diameter on a table, it will slowly 
lose energy and begin a wobbling motion. The angle between the coin 
and the table will gradually decrease, and eventually it will come to rest. 
Assume that this process is slow, and consider the motion when the coin 
makes an angle 0 with the table (see Fig. 9.55). You may assume that 
the CM is essentially motionless. Let R be the radius of the coin, and let 
Q be the frequency at which the contact point on the table traces out its 
circle. Assume that the coin rolls without slipping. 


419 


Fig. 9.51 


Fig. 9.52 


Fig. 9.53 


420 Angular momentum, Part II (General L) 


(a) Show that the angular velocity of the coin is @ = Qsin@ Xo, 
where X2 points upward along the coin, directly away from the 


contact point. 
| & 
Q=2 : 9.95 
Rsind a 


(b) Show that 

(c) Show that Abe (or Tom, Franklin, George, John, Dwight, Susan, 
or Sacagawea) appears to rotate, when viewed from above, with 
frequency 


g 
Rsinð ` 


2(1 — cos 0) (9.96) 


9.25. Nutation cusps «« 


(a) Using the notation and initial conditions in the example in 
Section 9.7.7, prove that kinks occur in nutation if and only if 
AQ = +Q,. A kink is where the plot of 0 (£) vs. @(f) has a 
discontinuity in its slope. 

(b) Prove that these kinks are in fact cusps. A cusp is a kink where 
the plot reverses direction in the ¢-0 plane. 


9.26. Nutation circles x» 


(a) Using the notation and initial conditions in the example in 
Section 9.7.7, and assuming that œw >> AQ > Qs, find (approx- 
imately) the direction of the angular momentum right after the 
sideways kick takes place. 

(b) Use Eq. (9.91) to show that the CM then travels (approximately) in 
acircle around L. And show that this “circular” motion is just what 
you would expect from the free-top reasoning in Section 9.6.2, in 
particular, Eq. (9.55). 


Additional problems 


9.27. Rolling without slipping « 


The standard way that a ball rolls without slipping on a flat surface is 
for the contact points on the ball to trace out a vertical great circle on 
the ball. Are there any other ways that a ball can roll without slipping? 


9.28. Rolling straight? +x 


In some situations, such as the rolling-coin setup in Problem 9.23, the 
velocity of the CM ofa rolling object changes direction as time goes by. 
Consider a uniform sphere that rolls on the ground without slipping (pos- 
sibly in the nonstandard way described in the solution to Problem 9.27). 


9.29. 


9.30. 


9.9 Exercises 


Is it possible for the CM’s velocity to change direction? Justify your 
answer rigorously. 


Ball on paper +*+» 


A uniform ball rolls without slipping on a table (possibly in the non- 
standard way described in the solution to Problem 9.27). It rolls onto a 
piece of paper, which you then slide around in an arbitrary (horizontal) 
manner. You may even give the paper abrupt, jerky motions, so that the 
ball slips with respect to it. After you allow the ball to come off the 
paper, it will eventually resume rolling without slipping on the table. 
Show that the final velocity equals the initial velocity. 


Ball on a turntable xxx» 


A uniform ball rolls without slipping on a turntable (possibly in the 
nonstandard way described in the solution to Problem 9.27). As viewed 
from the inertial lab frame, show that the ball moves in a circle (not 
necessarily centered at the center of the turntable) with a frequency 
equal to 2/7 times the frequency of the turntable. 


9.9 Exercises 


Section 9.1: Preliminaries concerning rotations 


9.31. 


Rolling wheel xx 


A wheel with spokes rolls without slipping on the ground. A stationary 
camera takes a picture of it as it rolls by, from the side. Due to the nonzero 
exposure time of the camera, the spokes generally appear blurred. At 
what locations in the picture do the spokes not appear blurred? Hint: A 
common incorrect answer is that there is only one point. 


Section 9.2: The inertia tensor 


9.32. 


Inertia tensor x 


Calculate the r x (@ x r) double cross product in Eq. (9.7) by using the 
vector identity, 


A x (B x C) = B(A- C) — C(A - B). (9.97) 


Section 9.3: Principal axes 


9.33. 


Tennis racket theorem «« 


Problem 9.14 gives the statement of the “tennis racket theorem,” and the 
solution there involves Euler’s equations. Demonstrate the theorem here 
by writing down the conservation of Z? and conservation of £ statements 
and then using them in the following way. Produce an equation that says 
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9.34. 


9.35. 


9.36. 


that if œ and w3 (or w; and w2) start small, then they must remain 
small. And produce the analogous equation that says that if œw; and w3 
start small, then they need not remain small. (It’s another matter to show 
that they actually won t remain small. But let’s not worry about that here. 
Anything that can happen generally does happen in physics.) 


Moments for a cube xx 


In the spirit of Appendix E, calculate the principal moments for a solid 
cube of mass m and side length £, with the coordinate axes parallel to 
the edges of the cube, and the origin at a corner. 


Tilted moments +» 


(a) Consider a planar object in the x-y plane. If the x and y axes are 
principal axes, use the rotation matrix in Eq. (9.94) to show that 
the moment of inertia around the x’ axis, which makes an angle 
6 with the x axis, is Iv = Iy cos?8 + J, sin76. 

Consider a general three-dimensional object whose principal axes 
are the x, y, and z axes. Consider another axis that points along 
the unit vector (œ, 8, y). Show that the moment of inertia around 
this axis is œ? ly + B71, + y7J,. Hint: The cross product, discussed 
in Appendix B, provides a nice method of calculating the distance 
from a point to a line. 


(b 


wn 


Quadrupole x+ 
Consider an arbitrarily shaped body of mass m whose CM is at the 
origin. Using the law of cosines, the gravitational potential of a mass M 
at position R is 
GM d 
VR) = f — 
JR? +r? — 2Rr cos B 

where the integration runs over the volume of the body, and £ is the 


angle that the position vector r of an arbitrary point in the body makes 
with the vector R. 


(9.98) 


(a) Assuming that all points in the body satisfy r « R, show that an 
approximate expression for the potential is 


GM GM 
VR) © = - Sax | 7Beos’B—1)dm, (9.99) 
and then show that this can be written as 
GMm GM 


V(R) x — 


R T zp fl +h +h — 31r), (9.100) 


where /,, />, and J; are the moments around any three orthogonal 
axes (which we’ll take to be principal axes in part (b)), and Zp is 
the moment around the axis along the R vector. 


9.9 Exercises 


(b) Consider now a planet with rotational symmetry around $3, such 
as the earth which bulges at the equator due to the spinning. Using 
the result of Exercise 9.35, show that the potential in Eq. (9.100) 
can be written as 


R zp E D(1 — 3 cos?0), (9.101) 


where J = J, = h, and 8 is the angle that R makes with x3. 


REMARK: The second term here is known as the quadrupole term. In elec- 
trostatics, a dipole consists of equal and opposite charges separated by some 
distance d. At a given point far away, the forces from these two charges par- 
tially cancel. But they don’t exactly cancel, because the electrostatic force 
(which behaves like the gravitational force, with a 1 /r? law) depends on 
the distance and direction to the charges, and the two charges are located at 
different points. If two dipoles are oriented in opposite directions and then 
placed side by side, a distance d apart (so that there are charges of q and —q 
alternating around the corners of a square) then the forces from the dipoles 
nearly cancel. But again, the cancellation isn’t exact, because the dipoles are 
located in different places. This distribution of charge is called a quadrupole, 
and it is similar to the situation with a spinning (and bulging) planet, because 
such a planet consists of a spherical ball (which gives rise to the first term in 
Eq. (9.101)), plus a region of “negative” mass superimposed on the ball at 
the poles and a region of positive mass superimposed on the ball at the equa- 
tor. Looking at the above square of charges from far out along the diagonal 
containing the two negative charges is similar to looking at the earth from 
far out along its rotation axis. & 


Section 9.4: Two basic types of problems 


9.37. 


9.38. 


Sphere and points « 


A uniform sphere of mass m and radius R rotates around the vertical axis 
with angular speed w. Two particles of mass m/2 are brought close to the 
sphere at diametrically opposite points, at an angle 0 from the vertical, 
as shown in Fig. 9.56. The masses, which are initially essentially at rest, 
abruptly stick to the sphere. What angle does the resulting œ make with 
the vertical? (If you want, you can check your answer by showing that 
the @ that makes this angle maximum is sin~! ./7/9 ~ 61.9°.) 


Striking a triangle +x 

Consider the rigid object in Fig. 9.57. Four masses lie at the points shown 
on a rigid isosceles right triangle with hypotenuse length 4a. The mass 
at the right angle is 3m, and the other three masses are m. Label them 
A, B, C, D, as shown. Assume that the object is floating freely in outer 
space. Mass C is struck with a quick blow, directed into the page. Let 
the impulse have magnitude f F dt = P. What are the velocities of all 
the masses immediately after the blow? 
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Striking another triangle xx 


Consider the rigid object in Fig. 9.58. A uniform stick of mass M lies 
along the base of an isosceles triangle, and a mass m lies at the oppo- 
site vertex. The base has length b, and the height is A. Assume that the 
object is floating freely in outer space. The right end of the stick is struck 
with a quick blow, directed into the page. Let the impulse have magni- 
tude f F dt = P. What is the velocity of mass m immediately after the 
blow? 


Sticking sticks x+» 


Two identical uniform sticks spin around their stationary centers with 
equal angular speeds, as shown in Fig. 9.59. The bottom stick is slowly 
raised until its top end collides with the center of the top stick. The sticks 
stick together to form a rigid “T.” Assume that the collision takes place 
when the top stick lies in the plane of the paper. Immediately after the 
collision, one point (in addition to the CM) on the T will instantaneously 
be at rest. Where is this point? 


Circling stick again +x 


Solve the problem in Section 9.4.2 again, but now use the CM as the 
origin. 


Pivot and string ++ 


A stick of mass m and length £ spins with frequency w around an axis, 
as shown in Fig. 9.60. The stick makes an angle @ with the axis. One 
end is pivoted on the axis, and the other end is connected to the axis 
by a string that is perpendicular to the axis. What is the tension in the 
string, and what is the force that the pivot applies to the stick? (Ignore 
gravity.) 


Rotating sheet +x 


A uniform flat rectangular sheet of mass m and side lengths a and b 
rotates with angular speed w around a diagonal. What torque is required? 
Given a fixed area A, what should the rectangle look like if you want 
the required torque to be as large as possible? What is the upper bound 
on the torque? 


Rotating axle x+ 


Two wheels of mass m and moment of inertia J are connected by a 
massless axle of length £, as shown in Fig. 9.61. The system rests on a 
frictionless surface, and the wheels rotate with frequency w around the 
axle. Additionally, the whole system rotates with frequency Q around 
the vertical axis through the center of the axle. What is the largest value 
of Q for which both wheels stay on the ground? 


9.45. 


9.9 Exercises 


Stick on a ring ++ 


(a) A stick of mass m and length 2r is arranged to make a constant 
angle 0 with the horizontal, with its bottom end sliding in a circle 
on a frictionless ring of radius r, as shown in Fig. 9.62. What is 
the frequency of this motion? It turns out that there is a minimum 
@ for which this motion is possible; what is it? 

(b) Ifthe radius of the ring is now R, what is the largest value of r/R 
for which this motion is possible for @ —> 0?7° 


Section 9.6: Free symmetric top 
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Slightly wobbling « 


A coin of mass m and radius R is initially spinning around the axis 
perpendicular to its plane, with angular speed w3. It is supported by a 
pivot at its center. You apply an infinitesimal downward strike at a point 
on the rim, as shown in Fig. 9.63, giving the coin an infinitesimal angular 
velocity component w, in the plane of the coin. When the plane of the 
coin returns to its original plane for the first time, what (approximately) 
is the orientation of the coin? 


Original orientation +x 


A coin of mass m and radius R is initially spinning around the axis per- 
pendicular to its plane, with angular speed w3. It is supported by a pivot 
at its center. You apply a (nonzero) downward strike at a point on the 
rim, as shown in Fig. 9.63, giving the coin an angular velocity com- 
ponent w1 in the plane of the coin. Consider the nth time the plane of 
the coin returns to its original plane. What is the minimum value of n 
for which it is possible for the coin to have exactly the same orienta- 
tion as when it started? What should œw, be in terms of w3 to achieve 
this? 


Seeing tails «x 


A coin (floating in outer space) of mass m and radius R is initially spin- 
ning around the axis perpendicular to its plane, with angular speed w3. 
You view the coin directly from above, and you apply a downward 
strike at a point on the rim, as shown in Fig. 9.63. What is the minimum 
impulse, f F dt, you must apply in order to be able to barely see the 
underside of the coin at some later time in its wobbling motion? Assum- 
ing that you apply this minimum impulse, how far will the center of the 
coin have moved by the time you are able to see the underside? 


25 You can also play around with both parts of this problem for the setup where the stick swings 


around below the ring, with its top end running along the ring. 
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Fig. 9.64 


Fig. 9.65 


Fig. 9.66 
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Flipping a coin +» 

Imagine flipping an initially horizontal heads-up coin. Ifthe initial angu- 
lar velocity is horizontal, then the coin will rotate around this horizontal 
diameter the entire time in the air. The fraction of the flight time that 
the coin spends heads-up is therefore 1/2. In practice, however, it is 
impossible to make the initial œ be exactly horizontal, so assume that 
the initial components are w; and w3, with w3 < w1. Show that in 
this limit, the fraction of the flight time that the coin spends heads-up is 
1/2 + (4@3)/(1@7 ).7° 

Dipping low +» 

A top with J = 34; floats in outer space and initially spins around its 
x3 axis with angular speed w3. You apply a strike at the bottom point, 
directed into the page, as shown in Fig. 9.64, producing an angular 
velocity component, w,, directed to the right. What should w; be in 
terms of w3 in order to have the total œ vector dip as far below the 
horizontal as possible in the subsequent motion? 


Section 9.7: Heavy symmetric top 


Rolling lollipop « 

Consider a lollipop made of a solid sphere of mass m and radius r that 
is radially pierced by a massless horizontal stick. The free end of the 
stick is pivoted on a pole (see Fig. 9.65), and the sphere rolls on the 
ground without slipping, with its center moving in a circle of radius R 
with frequency Q. What is the normal force between the ground and the 
sphere? 


Horizontal œw «« 


A top (with mass m, moments J and J3, and distance £ from the pivot 
to CM) undergoes uniform precession, with its axis making a constant 
angle 0 with the negative z axis, as shown in Fig. 9.66. If conditions are 
set up so that the top’s œ is always horizontal, what is the frequency of 
precession? Can such motion exist if the top is up above the horizontal, 
making an angle 0 with the positive z axis? 


26 Tf the coin starts truly horizontal (or is at least not biased to tilt in any particular direction on 


average, which is still a condition that seems difficult to guarantee), and if you catch the coin in 


your hand to reduce random table-bouncing effects, then the result of this exercise implies that 


the coin is biased to come up heads. This effect is analyzed in detail in a paper by Persi Diaconis, 


Susan Holmes, and Richard Montgomery (to be published), who estimate that the probability of 


obtaining heads for a normally flipped coin is about 0.51. 


9.53. 


9.54. 


9.55. 


9.56. 


9.9 Exercises 


Sliding lollipop +x» 

Consider a lollipop made of a solid sphere of mass m and radius r that is 
radially pierced by a massless stick. The free end of the stick is pivoted 
on the ground, which is frictionless (see Fig. 9.67). The sphere slides 
along the ground, with the same point on the sphere always touching 
the ground. The center moves in a circle of radius R with frequency 
Q. Show that the normal force between the ground and the sphere is 
N =mg+mrQ?, which is independent of R. Solve this by: 


(a) Using an F = ma argument.”’ 
(b) Using the more complicated t = dL/dt argument. 


Rolling wheel and axle xxx 


A massless axle has one end attached to a wheel (a uniform disk of mass 
mand radius r), with the other end pivoted on the ground (see Fig. 9.68). 
The wheel rolls on the ground without slipping, with the axle inclined 
at an angle 0. The point of contact on the ground traces out a circle with 
frequency Q. 


(a) Show that w points horizontally to the right (at the instant shown), 
with magnitude w = Q/tané. 
(b) Show that the normal force between the ground and the wheel is 


1 3 
N = mg cos’6 + mrQ? G cos @ sin?@ + 5 cos’) . (9.102) 


Ball under a cone x+» 


A hollow ball (with J = (2/3)mR?) rolls without slipping on the inside 
surface of a fixed cone, whose tip points upward, as shown in Fig. 9.69. 
The angle at the vertex of the cone is 60°. Initial conditions have been 
set up so that the contact point on the cone traces out a horizontal circle 
of radius £ at frequency Q, while the contact point on the ball traces out 
a circle of radius R/2. Assume that the coefficient of friction between 
the ball and the cone is sufficiently large to prevent slipping. What is the 
frequency of precession, Q? What does it reduce to in the limits £ >> R 
and £ — (/3/2)R (the ball has to fit inside the cone, of course). What 
relation between £ and R must be satisfied if the setup is to work with a 
solid ball with J = (2/5)mR?? 


Ballin a cone «*««« 


A ball (with J = (2/5)MR?) rolls without slipping on the inside surface 
of a fixed cone, whose tip points downward. The half-angle at the vertex 
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27 This method happens to work here, due to the unusually nice nature of the sphere’s motion. For 
more general motion (for example, in Problem 9.22, where the sphere is spinning), you must use 
t = dL/dt. 
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of the cone is 0. Initial conditions have been set up so that the contact 
point on the cone traces out a horizontal circle of radius £ >> R, at 
frequency Q, while the contact point on the ball traces out a circle of 
radius r (not necessarily equal to R, as would be the case for a great 
circle). Assume that the coefficient of friction between the ball and the 
cone is sufficiently large to prevent slipping. What is the frequency of 
precession, Q? It turns out that Q can be made infinite if r/R takes 
on a particular value; what is this value? Work in the approximation 
where R < £. 


Nutation loops ++ 


In Fig. 9.38, the loops don’t quite intersect each other in the AQ = 4Q, 
case, but they very much do in the AQ = 102, case (a given loop there 
actually intersects the two on either side). Show that the value of k for 
which adjacent loops in the AQ = kQs case barely touch each other (as 
shown in Fig. 9.70) is k © 4.6033. You will have to solve something 
numerically. Hint: The curve is vertical at the relevant points. 


Solutions 


Many different w’s 
We want to find all the vectors œ with the property that œ x aX = vy. Since w is 
orthogonal to this cross product, œ must lie in the x-z plane. We claim that if œ makes 


an angle 6 with the x axis and has magnitude v/(a sin 0), then it satisfies w x aX = vy. 
Indeed, 


æ x aX = |@||aX| sin Oy = vy. (9.103) 


Alternatively, note that such an w can be written as 


v . v v 
@ = —— (cos0,0, sin 0) = (—,, 0, =). (9.104) 
asin@ atan@ a 


Only the z component here is relevant in the cross product with aX, so we have w x aX = 
(v/a)? x aX = vy. The answer to the problem is therefore that œ must take the form 
given in Eq. (9.104). 

It makes sense that the magnitude of w is v/(a sin 0), because if we drop a perpen- 
dicular from the particle to the line of œw, we see that the particle may be considered 
to be instantaneously traveling in a circle of radius r = asin@ around w, at speed 
v. And so we have v = wr, as we should. Whether the particle actually does travel 
in this circle is irrelevant. The past and future motion doesn’t matter in finding the 
instantaneous w. All we need to know is the velocity at the given instant. 

A few possible w’s are drawn in Fig. 9.71. Technically, it is possible to have m < 
0 < 27, but then the v/(a sin 8) coefficient in Eq. (9.104) is negative, which means 
that w really points upward in the x-z plane (physically, œ must point upward if the 
particle’s velocity is to be in the positive y direction). So we'll assume 0 < 6 < m. 
And since the v/a value of w, in Eq. (9.104) is independent of 0, all of the possible 
@’s look like those in Fig. 9.72. 

For 0 = 2/2, we have w = v/a, which makes sense. If 0 is very small, then w 
is very large, because w « 1/sin@. This also makes sense, because the particle is 
(instantaneously) traveling around in a very small circle at the given speed v. 
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Remark: The point of this problem is that the particle may be in the process of having 
its position vector trace out a cone around one of many possible axes, or perhaps it may 
be undergoing a more complicated motion. If we are handed only the given information 
on position and velocity, then it is impossible to determine which of these motions is 
happening. And it is likewise impossible to uniquely determine w. This is true for a 
collection of points that lie on at most one line through the origin. If the points, along 
with the origin, span more than a 1-D line, then @ is in fact uniquely determined (see 
Footnote 9). & 


9.2. Fixed points on a sphere 


FIRST SOLUTION: For the purposes of Theorem 9.1, we need only show that two points 
end up where they started for an infinitesimal transformation. But since it’s possible 
to prove this for a general transformation, we’ll consider the general case here. 
Consider the point A that ends up farthest away from where it started.” Label the 
ending point B. Draw the great circle, C4z, through A and B. Draw the great circle, C4, 
that is perpendicular to C4g at A; and draw the great circle, Cg, that is perpendicular 
to C4g at B. We claim that the transformation must take C4 to Cg. This is true for the than A 
following reason. The image of C4 is certainly a great circle through B. And this great B 
circle must be perpendicular to C4g, because otherwise there would exist another point 
that ended up farther away from its starting point than A did (see Fig. 9.73). Sincethere . 
is only one great circle through B that is perpendicular to C4g, the image of C4 must impossible 
in fact be Cz. image of C4 
Now consider the two points, Pı and P2, where C4 and Cg intersect (any two great 
circles intersect in two points). Look at Pı. The distances P4 and PB are equal, 
because C4g makes equal angles (namely 90°) with C4 and Cg. Therefore, the point A 
P; is not moved by the transformation. This is true because P4 ends up on Cg (because 
Cz is the image of C4, on which P started), and if it ended up at a point other than P1, Fig. 9.73 
then its final distance from B would be different from its initial distance from A. This 
would contradict the fact that distances are preserved on the rigid sphere. Likewise for 
P . We have therefore found our two desired points. 
Note that for a noninfinitesimal transformation, every point on the sphere may move 
at some time during the transformation. But what we’ve just shown is that two points 
end up back where they started, even if they’ve moved in the interim. 


moves farther 


Cp 


Ca 


SECOND SOLUTION: [In the spirit of the above solution, we can give a simpler solution, 
but one that is valid only in the case of infinitesimal transformations. 

Pick any point A that moves during the transformation. Draw the great circle that 
passes through A and is perpendicular to the direction of A’s motion. (This direction is 
well defined, because we are considering an infinitesimal transformation, so A doesn’t 
have time to change direction.) All points on this great circle must move (if they move 
at all) perpendicular to the great circle, because otherwise their distances from A would 
change. But they cannot all move in the same direction, because then the center of 
the great circle, and hence the sphere, would move (but it is assumed to be fixed). 
Therefore, at least one point on the great circle moves in the direction opposite to the 
direction in which A moves. Therefore, by continuity, some point (and hence also its 
diametrically opposite point) on the great circle must remain fixed. 


9.3. Rolling cone 
At the risk of overdoing it, we’ll present three solutions. The second and third solutions 
are the type that might make your head hurt, so you may want to reread them after 
looking at the discussion of the angular velocity vector in Section 9.7.2. 


28 If there is more than one such point, pick any one of them. Considering that the result of this 
problem is that the net motion of the sphere is a rotation around an axis, there will in fact be a 
whole great circle of points that move farthest. 


430 Angular momentum, Part II (General L) 


FIRST SOLUTION: Without doing any calculations, we know that œw points along the 
line of contact of the cone with the table, because these are the points on the cone that 
are instantaneously at rest. And we know that as time goes by, œ rotates around in the 
horizontal plane with angular speed v/(h cos œ), because point P travels at speed v in 
a circle of radius h cos œ around the z axis. 

The magnitude of œ can be found as follows. At a given instant, P may be considered 
to be rotating in a circle of radius d = A sina around ø (see Fig. 9.74). Since P moves 
with speed v, the angular speed of this rotation is œ = v/d. Therefore, 

v 


w = — ; (9.105) 
hsina 


SECOND SOLUTION: We can use Theorem 9.3 with the following frames. S| is fixed in 
the cone; $3 is the lab frame; and S2 is the frame that (instantaneously) rotates around 
the tilted w23 axis shown in Fig. 9.75, at the speed such that the axis of the cone 
remains fixed in S2. The tip of w23 traces out a circle as it precesses around the z axis, 
so after the cone moves a little, we will need to use a new S2 frame. But at any moment, 
S2 instantaneously rotates around the axis perpendicular to the axis of the cone. 

In the language of Theorem 9.3, @1 and @ 3 point in the directions shown. We 
must find their magnitudes and then add the vectors to determine the angular velocity 
of Sı with respect to $3. First, we have 


v 
lo23|= 7> (9.106) 

1 
because point P moves (instantaneously) with speed v in a circle of radius A around 


@2,3. We now claim that 


| | v v 
012| =-= ; 
: r htana 


(9.107) 


where r is the radius of the base of the cone. This is true because someone fixed in 
S sees the endpoint of the radius (the one drawn in Fig. 9.75) move “backward” at 
speed v, because it is stationary with respect to the table. Hence, the cone must be 
spinning with frequency v/r in S2. 

The addition of @),2 and @23 is shown in Fig. 9.76. The result has magnitude 
v/(h sina), and it points horizontally because | 3|/|@1,2| = tana. 


THIRD SOLUTION: We can use Theorem 9.3 with the following frames. S| is fixed in 
the cone; and $3 is the lab frame (as in the second solution). But now let Sz be the 
frame that rotates around the (negative) z axis, at the speed such that the axis of the 
cone remains fixed in it. Note that we can keep using this same S2 frame as time goes 
by, unlike the S2 frame in the second solution. 

@ 2 and @23 point in the directions shown in Fig. 9.77. As above, we must find 
their magnitudes and then add the vectors to determine the angular velocity of Sı with 
respect to $3. First, we have 


v 


|o2,3| (9.108) 


hcosa’ 


because point P moves with speed v in a circle of radius A cos œ around w23. 

It’s a little tricker to find |@1,2|. From the point of view of someone spinning 
around with S2, the table rotates backward with frequency |w23| = v/(h cos œ), from 
Fig. 9.77 Eq. (9.108). Consider the circle of contact points on the table where the base of the 
cone touches it. This circle has a radius h/ cosa, so someone spinning around with 
S2 sees the circle move backward with speed |@2,3|(h/ cosa) = v/ cos?@ around the 
vertical. Since there is no slipping, the contact point on the cone must also move with 
this speed in S2 around the axis of the cone (which is fixed in S2). And since the radius 
of the base is r, this means that the cone rotates with angular speed v/(r cos*@) with 
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9.10 Solutions 


respect to S2. Therefore, using r = A tana, we have 


VU 


Vv 
loil = — (9.109) 


rcosza  hsing cosa ` 
The addition of w),2 and @23 is shown in Fig. 9.78. The result has magnitude 
v/(hsina), and it points horizontally because |@23|/|@1,2| = sina. 


REMARK: The difference between the œ; 2 vectors in the second and third solutions 
comes down to the following fact. Let Q be the contact point on the base of the cone. 
Consider the ratio of the distance from Q to @ 3 to the distance from P to w73. This 
ratio is 1 in the second solution, but 1/cos*q in the third solution. This implies that 
the “backward” speed of Q relative to P, as measured in frame Sy, is a factor 1/ cos?a 
larger in the third solution. And since Q is the same distance r away from 1,2 in both 
cases, we see that @ 1,2 is a factor 1/ cos2a larger in the third solution. & 


Parallel-axis theorem 


Consider one of the diagonal entries in I, say Zx = f (y? +27). In terms of the new 
variables, this equals 


he = f (C++ E+) = [z for +2 


= M(Y* +2?) fo 7), (9.110) 


as desired. We have used the fact that the cross terms vanish because, for example, 
J Yy! =Y fy’ = 0, by definition of the CM. Similarly, consider an off-diagonal entry 
in I, say Ly = — f xy. We have 


p=- ftor- fxr -fy =M- fy, oun 


where the cross terms have likewise vanished. We therefore see that all of the terms in 
I take the form of those in Eq. (9.19), as desired. 


A nice cylinder 


By symmetry, the principal axes are the symmetry axis of the cylinder, along with any 
diameters in the cross-section circle through the CM. Let the equal moments around 
the diameters be 7. Then by Theorem 9.5, if the moment around the symmetry axis 
also equals /, then every axis is a principal axis. 

Let the mass of the cylinder be M. Let its radius be R and its height be 4. Then the 
moment around the symmetry axis is MR? /2. Let D be a diameter through the CM. 
The moment around D can be calculated as follows. Slice the cylinder into horizontal 
disks of thickness dy. Let p be the mass per unit height (so p = M/h). The mass of 
each disk is then p dy, so the moment around a diameter through the disk is (p dy)R?/4 
(the usual result for a disk). Therefore, by the parallel-axis theorem, the moment of a 
disk at height y (where —h/2 < y < h/2) around D is (p dy)R?/4 + (p dy)y”. Hence, 
the moment of the entire cylinder around D is 


h/2 / pR? ) pRh pob MR Mh? 
t= “4 py) d= = i (9.112) 
-a2 \ 4 4 12 4 12 


We want this to equal MR? /2. Therefore, h = y 3R. 

As an exercise, you can show that if the origin were instead taken to be the center 
of one of the circular faces, then the answer would be h = V. 3R /2. Note that the 7 in 
Eq. (9.112) looks just like the moment of a disk around a diameter plus the moment of 
a stick around its center. This is no coincidence. The integral that yielded the Mh? /12 
term here is the same integral that we would perform for a stick. 


Fig. 9.78 
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9.6. Rotating square 


9.7. 


Label two of the masses A and B, as shown in Fig. 9.79. Let £4 be the distance along 
the axis from the CM to A’s string, and let r4 be the length of A’s string. Likewise 
for B. The force, F4, in A’s string must account for the centripetal acceleration of A. 
Hence, F4 = mr4yw*. The torque around the CM due to F4 is therefore 


t4 = mr4lsw. (9.113) 


Likewise, the torque around the CM due to B’s string is tg = mrg£ Bor, in the opposite 
direction. But the two shaded triangles in Fig. 9.79 are congruent, because they have 
the same hypotenuse and the same angle @. Therefore, ¿4 = rg and £g = r4. Hence, 
T4 = Tg, and the torques cancel. The torques from the other two masses likewise 
cancel. Note that a uniform square is made up of many sets of these squares of point 
masses, so we’ve also shown that no torque is needed for a uniform square. 


REMARK: For a general N-gon of point masses, Problem 9.8 below shows that any 
axis in the plane is a principal axis. Let’s prove this here by using the above torque 
argument. We’ll use a math trick here that involves writing a trig function (a sine) as 
the imaginary part of a complex exponential. Using Eq. (9.113), we see that the torque 
from mass A in Fig. 9.80 is t4 = ma R? sin ¢ cos d. Likewise, the torque from mass 
B is tg = mW? R? sin(o + 27 /N) cos( + 27 /N), and so on. The total torque around 
the CM is therefore 


20k 20k 
=mR o Y sin( ¢4 i 
tT = mR w > sin(¢ N )Jeos(0 N ) 


2.2N-1 

= mR w 3 iafe apk) 
2 

k=0 


2 2 
mR w Im(e”*(1 p etTi/N 4 oBTi/N 4.4 gre) ) 
2 


mR f ANRIIN _ 1 
Tm (ie? ——_— 
2 eAmi/N _ 1] 


= 0. (9.114) 


To obtain the fifth line, we summed the geometric series. And to obtain the last line, 
we used the fact that e*”’ = 1. The one exception to this result is when N = 2, because 
the denominator in the fifth line is zero (but it’s hard to have a 2-gon, anyway); this 
corresponds to the 0 4 180° restriction in Theorem 9.6. 

Note that what we’ve done here is show that }*r;£; = 0, which implies that the 
total torque is zero (which is one of the definitions of a principal axis). In terms of 
the chosen axis, this is equivalent to showing that }* xy = 0, that is, showing that the 
off-diagonal terms in the inertia tensor vanish (which is simply another definition of 


the principal axes). & 


Existence of principal axes for a pancake 


For a pancake object, the inertia tensor I takes the form in Eq. (9.8), with z = 0. 
Therefore, if we can find a set of axes for which f xy = 0, then I will be diagonal, 
and we will have found our principal axes. We can prove, using a continuity argument, 
that such a set of axes exists. 

Pick a set of axes, and write down the integral J xy = Io. If Jo = 0, then we are done. 
If Jo £0, then rotate these axes by an angle of 2/2, so that the new X is the old y, 


9.8. 


9.10 Solutions 


and the new Ẹ is the old —X (see Fig. 9.81). Write down the new integral f xy=Jz/2. 
Since the new and old coordinates are related by Xnew = Yola and Vnew = — Xold, We 
have [7/2 = — Jo. Therefore, since J xy switches sign during the (continuous) rotation 
of the axes, there must exist some intermediate angle for which the integral J xy 
is zero. 


Symmetries and principal axes for a pancake 

In view of the form of the inertia tensor in Eq. (9.8), we want to show that if a pancake 
object has a symmetry under a rotation through 6 # x, then f xy = 0 for any set of 
axes (through the origin). Take an arbitrary set of axes and rotate them through an angle 
0 # x. The new coordinates are x’ = (x cos 0 +y sin 8) andy’ = (—x sin 0 + y cos 8), 
so the new matrix entries, in terms of the old ones, are 


i= a = ly sin?0 + 2hy sin 0 cos 0 + ly cos?ð, 
l= f x? = Iy cos? — 2fy sin 8 cos@ + Lw sin?9, (9.115) 


D =— f xy = Ip sin 6 cos + Ly (cos*0 — sin?) — Ix sin 0 cos 0. 


If the object looks exactly like it did before the rotation, then Z}, = Zx, I, = Jy, and 


xx tyy 
T = Ixy. The first two of these statements are actually equivalent (as you can show), 


so we'll just use the first and third. Using Eq. (9.116), along with 1 — cos?0 = sin?8, 
these two statements give 


0 = —hy sin’@ + 2h, sin 0 cos 6 + Zy sin?0, 


0 = —hy sin @ cos @ — 2lyy sin? + Ly sin @ cos 0. (9.116) 


Multiplying the first of these by cos@ and the second by sin@, and subtracting, 
gives 


2hy sin = 0. (9.117) 


Under the assumption 0 Æ x (and 0 ¥ 0, of course), we must therefore have J, = 0. 
Our initial axes were arbitrary; hence, any set of axes (through the origin) in the plane 
is a set of principal axes. 


Remark: Ifan object is invariant under a rotation through an angle 6, then 6 must be 
of the form 6 = 27/N, for some integer N (convince yourself of this). So consider a 
regular N-gon with “radius” R and with point masses m located at the vertices. Any 
object that is invariant under a rotation through 0 = 27/N may be considered to be 
built out of regular point-mass N-gons of various sizes. Theorem 9.5 implies that all 
axes in the plane ofa regular N-gon have the same moment (the two axes in this theorem 
aren’t required to be orthogonal). Let’s demonstrate this explicitly for a regular V-gon. 
This method we’ll use here is similar to the one in the remark in Problem 9.6, except 
that now we’ll write a trig function (a cosine) as the real part of a complex exponential. 
In Fig. 9.82 the distance from mass A to the axis is r4 = R sin @. And the distance from 
B to the axis is Rg = Rsin(¢ + 27/N), and so on for the other masses. The moment 
of inertia around the axis is therefore 


I Ry in? ( g + E 
=m sin = 
N 


k=0 


2 N-1 
= = > (1 cos(26 t ZA) 


k=0 
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9.9. 


9.10. 


2 2 N-i 
_ NmR* _ mR Re( eiet4nki™)) 


2 2 k=0 
E 2 7 ws Re(e#(1 p efti/N 4 8ri/N 4.4 gee) 
NmR2 mR2 5 eN Ti/N —1 
= ans ie ae: 
= M Re (2 (Seay J) (9.118) 


where we have summed the geometric series to obtain the last line. The numerator 
in the parentheses equals e"! — 1 = 0. And if N Æ 2, the denominator is not zero. 
Therefore, if N Æ 2 (which is equivalent to the 8 Æ x restriction), we have 


_ NmR? 
$=? 


(9.119) 


which is independent of ø. This result of NmR?/2 for the common value of all the 
moments makes sense, because the perpendicular-axis theorem says that this common 
value must be half of the moment around the axis perpendicular to the plane, which is 
NmR?. & 


Striking a rectangle 

If the force is out of the page at the upper left corner and into the page at the lower 
right corner, then the torque t = r x F points upward to the right, as shown in 
Fig. 9.83, with t « (b, a). The angular momentum equals f t dt, so immediately after 
the strike, L is proportional to (b, a). But the angular momentum may also be written 
as L = (ax, lyæy), where Iy = mb? /12 and I, = ma? /12 are the principal moments. 
Therefore, we have 


l c hoy x (b.a => Wy, Oy) X ag x =H xX a b 
xx, tyy > Xs Wy i > ia p2 > @ 29), 
(9.120) 


which is the direction of the other diagonal. This result checks in the special case a = b, 
and also in the limit where one of a and b is much larger than the other. Basically, if b 
is much larger than a, then this largeness matters more in the moment of inertia around 
the x axis (which is quadratic in length) than it does in the torque in the x direction 
(which is linear in length), so there is very little rotation around the x axis. 


Rotating stick 

The angular momentum around the CM can be found as follows. Break w up into its 
components along the principal axes of the stick (which are parallel and perpendicu- 
lar to the stick). The moment of inertia around the stick is zero. Therefore, only the 
component of œ perpendicular to the stick is relevant in calculating L. This compo- 
nent is w sin @, and the associated moment of inertia is mg? /12. Hence, the angular 
momentum at any time has magnitude 


1 
L= pe esine, (9.121) 


and it points as shown in Fig. 9.84. The tip of the vector L traces out a circle in a 
horizontal plane with frequency w. The radius of this circle is the horizontal component 
of L, which is L} = L cos 8. The rate of change of L therefore has magnitude 


1 
z = øL; =wlcos8é =a (gro sin 0) cos 0, (9.122) 


and it is directed into the page at the instant shown. 


9.11. 


9.10 Solutions 


Let the tension in the strings be T. Then the torque due to the strings is t = 
2T (£/2) cos 8, directed into the page at the instant shown. Therefore, t = dL/dt gives 


1 1 
Tecos0 = o ( 7zmPosino ) cose => T = mlo? sind. (9.123) 


REMARKS: For 6 — 0, this goes to zero, which makes sense. For 9 > 7/2, it goes to 
the finite value mw? /12, which isn’t so obvious. The lever arm and |dL/dt| are both 
small in this limit, and they happen to be proportional to each other as they both head 
to zero. 

Note that if we instead had a massless stick with equal masses m/2 on the ends, 
so that the relevant moment of inertia is now mé?/4, then our answer would be T = 
(1/4)m£a? sin@. This makes sense if we write it as T = (m/2) - (€/2)sin0@ - w*, 
because each tension is responsible for simply keeping a mass m/2 moving in a circle 
of radius (€/2) sin 8 at frequency w. This simple force argument doesn’t work for the 
original stick because of internal forces in the stick. & 


Stick under a ring 

As in Problem 9.10, the angular momentum around the CM can be found by breaking 
@ up into its components along the principal axes of the stick. The reasoning is exactly 
the same, so the angular momentum at any time has magnitude 


pe pda 4, (9.124) 
12 

and it points as shown in Fig. 9.85. The change in L comes from its horizontal 
component. This has length Lcos@ and travels in a circle at frequency œw. Hence, 

|dL/dt| = wL cos 0, and it is directed into the page at the instant shown. 
The torque around the CM has magnitude mg(¢/2) sin 0, and it points into the page at 
the instant shown. (This torque arises from the vertical force from the ring. There is no 
horizontal force from the ring, because the CM does not move.) Therefore, t = dL/dt 


gives 
mgt sin @ mew sin 0 8 = 6g (9.125) 
= cos =a $ 
7 w 12 K Lcos 


REMARKS: 


1. For 0 —> 7/2, this goes to infinity, which makes sense. For © — 0, it goes to 
/6g/€, which isn’t so obvious. 

2. The motion in this problem isn’t possible if the bottom end of the stick, instead of 
the top end, slides along the ring. The magnitudes of all quantities are the same 
as in the original problem, but the torque points in the wrong direction, as you 
can check. But see Exercise 9.45 for a related setup. 

3. Since the middle of the stick is motionless, you might be tempted to treat the 
bottom half as the stick in the example in Section 9.4.2. That is, you might just 
want to plug in £/2 as the length in Eq. (9.38). This, however, does not yield the 
answer in Eq. (9.125). The error is that there are internal forces in the stick that 
provide torques. If a pivot were placed at the CM of the stick (connecting the two 
halves) in this problem, then the stick would not remain straight. 

4. If we replace the stick by a massless string with equal masses m/2 on the ends, 
then the relevant moment of inertia is m£? /4, and we obtain w = y/ 2g/(£cos@). 
This is in fact simply the œ = ,/g/[(€/2) cos 0] answer for a point-mass circular 
pendulum of length £/2 (see Problem 9.12), because the middle of the string 
is motionless, and because the flexible string can’t provide the internal torques 
mentioned in the previous paragraph. & 
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9.12. Circular pendulum 


(a) The forces on the mass are gravity and the force from the rod (the tension), which 
points along the rod (see Fig. 9.86).? Since there is no vertical acceleration, 
we have T cos@ = mg. The unbalanced horizontal force from the tension is 
therefore T sin 0 = mg tan 6. This force accounts for the centripetal acceleration, 


m(é sin Oz. Hence, 
/ & 
ca Lcosé~ (2.126) 


For 6 ~ 0, this is the same as the ./g/@ frequency for a simple pendulum. 

For 0 © 7/2, it goes to infinity, which makes sense. Note that 0 must be less 

Fig. 9.86 than 2/2 for circular motion to be possible. (This restriction doesn’t hold for a 
spinning top with extended mass.) 

(b) The only force that applies a torque relative to the pivot is the gravitational force, 
so the torque is tT = mg£ sin 0, directed into the page at the instant shown in 
Fig. 9.87. 

At this moment, the mass has a speed (£sin@)Q, directed into the page. 
Therefore, L = r x p has magnitude m£? Q sin 8 and is directed upward to the 
right, as shown. The tip of L traces out a circle of radius L cos 0, at frequency Q. 
Therefore, dL/dt has magnitude QL cos 6 and is directed into the page. Hence, 
t = dL/dt gives mgl sin @ = Q(mé?Q sin @) cos 0, which yields Eq. (9.126). 

(c) The only force that applies a torque relative to the mass is the force from the pivot, 
which has two components (see Fig. 9.88). The vertical piece is mg. Relative to 
the mass, this provides a torque of mg(£ sin @), directed into the page. There is 
also the horizontal piece, which accounts for the centripetal acceleration of the 
mass, so it equals m(£ sin )Q2. Relative to the mass, this provides a torque of 
m£ sin 6 Q? (£ cos 6), directed out of the page. 

Fig. 9.87 Relative to the mass, there is no angular momentum. Therefore, dL/dt = 0, 

and so there must be no torque. This means that the above two torques must 

cancel, which gives mg(€ sin 8) = m£ sin 0 Q?(£ cos 0). This yields Eq. (9.126). 

mlQ2sin® f mg In problems that are more complicated than this one, it is often easier to work 

E with the fixed pivot (if there is one) as the origin, instead of the CM, because 

then you don’t have to worry about the messy pivot forces contributing to the 
torque. 


9.13. Rolling in a cone 
The forces on the ring are gravity (mg), the normal force (N) from the cone, and the 
friction force (F) pointing up along the cone (or perhaps down along the cone if F 
turns out to be negative, but we’ll find that it won’t). Since there is no net force in the 
vertical direction, we have 


Fig. 9.88 N sin + F cos0 = mg. (9.127) 


The inward horizontal force accounts for the centripetal acceleration, which gives 
Ncos6 — F sin 0 = m(htan 0). (9.128) 
Solving the previous two equations for F gives 


F = mg cos6 — mQ?(htan@) sin 6, (9.129) 


2° The force from the rod must point along the rod because it is massless. If there were a tangential 
force on the mass, then Newton’s third law would say that there would also be a tangential force 
on the rod. This would then produce a nonzero torque on the rod (relative to the pivot) and hence 
an infinite angular acceleration, because there is no gravitational torque on the (massless) rod to 
counter it. 
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with upward along the cone taken to be positive (this is just a rearrangement of the 
F = ma equation along the cone). The torque on the ring (relative to its CM) is due 
only to this F, because gravity provides no torque, and N points though the center of 
the ring (by the second assumption in the problem). Therefore, the torque points out 
of the page with magnitude 


t=rF = r(mg cos 6 — mQ7h tan @ sin 6). (9.130) 


We must now find dL/dt. Since we are assuming r < Atan 0, the frequency of the 
spinning of the ring (call it œ) is much greater than the frequency of precession, Q. 
We will therefore neglect the latter in finding L. In this approximation, L (relative 
to the CM) has magnitude mr, and it points downward along the cone (for the 
direction of motion shown in Fig. 9.45). The horizontal component of L has magnitude 
L, =Lsin@, and it traces out a circle at frequency Q. Therefore, dL/dt points out of 
the page with magnitude 

dL 


alles QL, = QLsind = (mrw) sind. (9.131) 


The nonslipping condition is re = (htan@)Q,*° which gives œ = (htan6)Q/r. 
Using this in Eq. (9.131) yields 


dL 2 ; 
ae = Q mrh tan 0 sind. (9.132) 


Equating this with the torque in Eq. (9.130) gives 
1 
2= R (9.133) 
tan@ Ņ\ 2h 


Remark: Ifyou consider an object with moment of inertia Bmr? (our ring has £ = 1), 
then you can show by the above reasoning that the “2” in Eq. (9.133) is replaced by 
(1+ £). This means that if we instead have a particle sliding around a frictionless cone 
(which is equivalent to a ring with £ = 0), then the frequency is Jg/h /tan@, as you 
can check from scratch. & 


9.14. Tennis racket theorem 
ROTATION AROUND X;: If the racket is rotated (nearly) around the $; axis, then the initial 


@2 and w3 are much smaller than w;. To emphasize this, let’s relabel them as w2 > €2 
and w3 —> €3. Then Eq. (9.45) becomes (with the torque equal to zero, because gravity 
provides no torque around the CM) 
0 = a — Ae263, 
0 = é + Boies, (9.134) 
0= €3 -C W162, 
where we have defined (for convenience) 
h-I h-I h-I 
=203 , B= l 2 , C= l =. 
q h h 


Note that A, B, and C are all positive. This fact will be very important. 
Our goal here is to show that if the €’s start out small, then they remain small. 
Assuming that they are small (which is true initially), the first equation says that 


A 


(9.135) 


3° This is technically not quite correct, for the same reason that the earth spins around 366 times 
instead of 365 times in a year. But it’s valid enough in the limit of small r. 
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@| © 0, to first order in the €’s. Therefore, we may assume that @ is essentially 
constant (when the €’s are small). Taking the derivative of the second equation then 
gives 0 = ¿2 + Bw) €3. Plugging the value of €3 from the third equation into this yields 


a (Bco) e. (9.136) 


Because of the negative coefficient on the right-hand side, this equation describes 
simple harmonic motion. Therefore, €2 oscillates sinusoidally around zero. So if it 
starts small, it remains small. By the same reasoning, €3 remains small. 

We therefore see that œ ~ (w1, 0, 0) at all times, which implies that L ~ (71%1, 0,0) 
at all times. That is, L always points (nearly) along the X; direction (which is fixed 
in the racket frame). But the direction of L is fixed in the lab frame, because there is 
no torque. Therefore, the direction of X; must also be (nearly) fixed in the lab frame. 
In other words, the racket doesn’t wobble. 


ROTATION AROUND X3: This calculation goes through exactly as above, except with “1” 
and “3” interchanged. We find that if €; and e2 start small, they remain small. 


ROTATION AROUND X2: If the racket is rotated (nearly) around the X2 axis, then the 
initial w and w3 are much smaller than w2. As above, let’s emphasize this by relabeling 
them as w > €; and w3 — €3. Then as above, Eq. (9.45) becomes 


0 = éi — Aw2e3, 
0 = a) + Be\63, (9.137) 


0= 63 = Caze]. 


Our goal here is to show that if the €’°s start out small, then they do not remain small. 
Assuming that they are small (which is true initially), the second equation says that 
@2 © 0, to first order in the €’s. So we may assume that œw? is essentially constant (when 
the €’s are small). Taking the derivative of the first equation then gives 0 = ë; — Aw2¢3. 
Plugging the value of é3 from the third equation into this yields 


a= (403) eis (9.138) 


Because of the positive coefficient on the right-hand side, this equation describes 
an exponentially growing motion, instead of an oscillatory one. Therefore, €) grows 
quickly from its initial small value. So even if it starts small, it becomes large. By 
the same reasoning, €3 becomes large. Of course, once the €’s become large, then 
our assumption of @2 ~ 0 isn’t valid anymore. But once the €’s become large, we’ve 
shown what we wanted to. 

We see that w does not remain (nearly) equal to (0,@2,0), which implies that L 
does not remain (nearly) equal to (0, /2@2, 0). That is, L does not always point (nearly) 
along the X2 direction (which is fixed in the racket frame). But the direction of L is 
fixed in the lab frame, because there is no torque. Therefore, the direction of X2 must 
change in the lab frame. In other words, the racket wobbles. 


Free-top angles 
In terms of the principal axes, X1, $2, £3, we have 
@ = (wi fı + w2X2) + w3%3, and 


: : : (9.139) 
L = I (w181 + 2X2) + 3033. 


Let m1 ô; = (@1X, + w282) be the component of w orthogonal to w3. Then we have 


OL To, 
tanB=—, and tana = ——. (9.140) 
3 bax 


9.16. 


9.10 Solutions 


Therefore, 


tana I 9.141 
tng h` GT 
IfI > 4, thena > £, and we have the situation shown in Fig. 9.89. A top with this 
property is called a “prolate top.” An example is an American football or a pencil. 
If7 < J;, thena < £, and we have the situation shown in Fig. 9.90. A top with this 
property is called an “oblate top.” An example is a coin or a Frisbee. 


Staying above 
If J < J; (oblate top), then the initial x3, œ, and L vectors look like those in Fig. 9.91, 
with L between X3 and w. The œw vector traces out a cone around L as shown, so it 
always stays above the horizontal (because it starts above the horizontal). 

However, if J > 13 (prolate top), then the initial X3, œ, and L vectors look like those 
in Fig. 9.92, with œ between X3 and L. Depending on the values of 7/5 and w1 /w3 
(where w] is the component of œ orthogonal to $3), the œ cone might stay above the 
horizontal axis, or it might dip below it. We are concerned with the cutoff case where 
it just touches the axis, as shown. 

From Fig. 9.92, the half-angle of the cone is œ — £, so if the cone reaches down to 
the horizontal axis, then we have a + (œ — £) = 90°. But «œ and £ are given by 

w Li Toy 


L 
tan = —, and tana = — 


= , 9.142 
03 L3 a3 ( ) 


Therefore, the 2a — 6B = 90° condition becomes 


1 ( 124 1f 4 o 
2 tan tan = 90°. (9.143) 
ho; W3 


With n = I/h and x = w] /%3, this yields 


2tan~! (nx) — tan™! (x) = 90° 


= tan (2 tan! (nx)) = tan(90° +tan! @)) 


2nx 1 
1—n2x2 x 
=> n(n—2)x =1. (9.144) 


We see that if n < 2, there is no solution for x. But if n > 2, there is a solution. 
Therefore, the answer to the problem is n = 2. 


Remarks: Ifnis only slightly larger than 2, then the solution for x in Eq. (9.144) (and 
hence w] ) is large. This means that your strike needs to be large, in order to produce 
a large w. 

There are three interesting limits of what can happen in the large-n limit, so let’s 
list them out. For these three cases, picture a thin stick spinning around its vertical x3 
axis, and then imagine striking the bottom end with different impulses. 


o Ifw; K wand Lı < L3, then both w and L point nearly straight upwards, so 
æ traces out a very thin cone around L and is always nearly vertical. X3 is also 
always nearly vertical. 

o Ifw; < œ but Ly > L; (which is possible if n is sufficiently large), then œ 
points nearly straight upwards initially, whereas L points nearly horizontally. So 
æ traces out a very wide cone and swings down almost to the negative vertical 
direction. X3 also traces out a very wide cone. 

e If. > @3, which implies L1 >> L3, then both w and L point nearly horizon- 
tally, so œ traces out a very thin cone around L and is always nearly horizontal. 


Fig. 9.89 


St 
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$; still traces out a very wide cone, because it starts vertical (by assumption), 
independent of where w and L are. Note that your eye can’t distinguish between the 
second and third cases here, because the X3 axis is doing the same thing in both 
cases. & 


9.17. The top 


(a) In order for there to exist real solutions for Q in Eq. (9.82), the discriminant 


must be non-negative. If @ > m/2, then cos@ < 0, so the discriminant is 
automatically positive, and any value of w3 is allowed. But if@ < 2/2, then the 
lower limit on w3 is 


V/J4MIgl cos 0 
ae 
a T 
The special case of 0 = 7/2 requires a limit to be taken; it turns out that there is 


only one (noninfinite) solution, as we’ll see in part (b). Note that at the critical 
value of w3 in Eq. (9.145), Eq. (9.82) gives 


o. =02 BOY 5 MEE ae (9.146) 
+=% = 27c0s6 Vicoso ° C ` 


03 z. (9.145) 


(b) Since w3 has units, “large w3” is a meaningless description. What we really 


mean is that the fraction in the square root in Eq. (9.82) is very small compared 
with 1. That is, € = (4MIg£ cos 0)/d2o3) <_ 1. In this case, we can use 


JVl—e+1-—€/2+--- to write 


I 2MIge cos6 
ox 2 ee SERE NN, (9.147) 
2I cos Boy 


The two solutions for Q are then, to leading order in w3 (or rather, to leading 
order in €), 


VETOR) Mg£ 
x , and Qo xE, (9.148) 
Icos hax 


24+ 


These are known as the “fast” and “slow” frequencies of precession, respectively. 
Q2_ is the approximate answer we found in Eq. (9.77). It was obtained here under 
the assumption € < 1, which is equivalent to 


VJ4MIgl cos 0 . . 
w3 > a (that is, w3 >> @3). (9.149) 


3 
This, therefore, is the condition for the result in Eq. (9.77) to be a good approx- 
imation. If Z is of the same order as /3, so that they are both of order M0? 
(assuming that the top is a reasonably shaped object without any strange tails), 
and if cos@ is of order 1, then this condition can be written as @3 > /g/E, 
which is the frequency of a pendulum of length £. 


REMARKS: 


1. The Q4 solution is a fairly surprising result. Two strange features of Q4 are 
that it grows with w3, and that it is independent of g. To see what is going 
on with this precession, note that Q+ is the value of Q that makes the L1 
in Eq. (9.79) essentially equal to zero. So L points nearly along the vertical 
axis. The rate of change of L is the product of a very small horizontal radius 
(of the tiny circle the tip traces out) and a very large Q2 (assuming that w3 is 
large). The product of these equals the “medium sized” torque, Mgé sin 0. 

2. In the limit of large w3, the fast precession should look basically like the 
motion of a free top, because the L here points essentially in a fixed direc- 
tion, just as it does for a free top. And indeed, Q4 is independent of g. More 
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precisely, we can see that the value of Q+ given in Eq. (9.148) agrees with 
the free-top precession frequency, by the following reasoning. /3~3 is the 
component of L along the symmetry axis, which makes an angle 0 with the 
vertical (which is essentially the direction of L). Therefore, L has magni- 
tude L ~ 13@3/cos@, which means we can write Q} ~ L/J. This is the 
precession frequency of a free top, given in Eq. (9.55), as viewed from a 
fixed frame. 

3. We can plot the Q+ in Eq. (9.82) as functions of w3. With the definitions of 
@3 and Qo in Eqs. (9.145) and (9.146), we can rewrite Eq. (9.82) as 


a3 or 
Q= (14 1 i). (9.150) 
03 3 
Since it is easier to work with dimensionless quantities, let’s rewrite this as 
Qe 
ye =xtVx2 -1, where y4 = —, and x= a (9.151) 
Qo ®3 


A rough plot of y+ vs. x is shown in Fig. 9.93. The behavior of this graph 
for large x is found from Eq. (9.151) to be y} ~% 2x, and y_ ~ 1/(2x). You 
can show that these are equivalent to the results in Eq. (9.148). & 


9.18. Many tops Fig. 9.93 
The system is made up of N rigid bodies, each consisting of a disk and a massless stick 
glued to it on its left (see Fig. 9.94). Label these subsystems as S;, with Sı being the 
one closest to the pole. Let each disk have mass m and moment of inertia /, and let each 
stick have length £. Let the angular speeds be w;. The relevant angular momentum of 
Si is then L; = Iwi, and it points horizontally.*! Let the desired precession frequency 
be Q. Then dL;/dt has magnitude L;82 = (Jw;)Q and points into the page at the instant — 
shown in Fig. 9.49. S, 
Consider the torque t; on S;, around its CM. Let’s first look at S1. The pole provides 
an upward force of Nmg (this force is what keeps all the tops up), so it provides a torque 
of Nmgé (into the page) around the CM of S;. The downward force from the stick on 
the right provides no torque around the CM, because it acts at the CM. Likewise, the 
gravitational force on Sı provides no torque around the CM. Therefore, tı = dL, /dt Fig. 9.94 
gives Nmgl = ([@ 1), and so 


Nmge 
w = ; 
IQ 
Now look at S2. Sı provides an upward force of (N — 1)mg (this force is what keeps S2 
through Sy up), so it provides a torque of (N — 1)mgé around the CM of S2. As with 
S1, this is the only torque on S2. Therefore, t2 = dL2/dt gives (N —1)mgl = a2)Q, 


(9.152) 


and so 
(N — l)mge 
= ——_— 9.153 
w? TQ ( ) 
Similar reasoning applies to the other S;, so we arrive at 
(N + 1—i)mge 
oe 154 
i TQ (9.154) 
The œ; are therefore in the ratio 
@ a: i wy- oy =N:(N—-1):---:2:1. (9.155) 


3! We’re ignoring the angular momentum arising from the precession. This part of L points vertically 
(because the tops all point horizontally) and therefore doesn’t change. Hence, it doesn’t enter into 
t = dL/dt. 
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Note that we needed to apply t = dL/dt many times, using each CM as the origin. 
Using only the pivot point on the pole as the origin would have given us only one piece 
of information, whereas we needed N pieces. 


REMARKS: Asa double-check, we can verify that the above w’s make t = dL/dt true, 
where t and L are the total torque and angular momentum relative to the pivot on the 
pole. The CM of the entire system is (M + 1)¢/2 from the pole, so the torque due to 
gravity is 


(N + le 
e 


t = Nm, 7 (9.156) 
The total angular momentum is, using Eq. (9.154), 
L= Iwi +@2 +: +o) 
£ 
= (V+(W-1)+W-2)4+---4+241) 
£(/N(N+1 
= “(Se me a (9.157) 


Comparing this with Eq. (9.156), we see that t = LQ, that is, t = |dL/dt|. 

We can also pose this problem for the setup where all the w; are equal (call them 
w), and the goal is to find the lengths of the sticks that allow circular precession with 
the sticks always forming a straight horizontal line. We can use the same reasoning as 
above, and Eq. (9.154) takes the modified form 


(N + 1—i)mgé; 
wo = = 


9.158 
Ta : ( ) 
where £; is the length of the ith stick. Therefore, the £; are in the ratio 
£, : £2: é iiy = P elui (9.159) 
eDOCS EN N E ar Pig ` 


Again, we can verify that these ¢’s make t = dL/dt true, where t and L are the total 
torque and angular momentum relative to the pivot on the pole. As an exercise, you 
can show that the CM happens to be a distance £y from the pole. So the torque due to 
gravity is, using Eq. (9.158) to obtain £y, 


t = Nngty = Nmg(olQ/mg) = NI oQ. (9.160) 


The total angular momentum is simply L = N/w. So indeed, t = LQ = |dL/dt|. Note 
that since the sum J` 1/n diverges, it is possible to make the setup extend arbitrarily 
far from the pole. & 


Heavy top on a slippery table 

In Section 9.7.3, we looked at t and L relative to the pivot point. Such quantities are 
of no use now, because the pivot is accelerating, so it isn’t a legal choice of origin for 
applying t = dL/dt. We will therefore look at t and L relative to the CM, which is 
always a legal origin for applying t = dL /dt. 

With the CM as our origin, there are two modifications we need to make to the 
derivation in Section 9.7.3. First, the moments of inertia are now measured with 
respect to the CM, instead of the pivot point. J3 is unchanged, but the parallel-axis 
theorem gives the new J as 


=1-Me. (9.161) 


9.20. 


9.10 Solutions 


Second, the torque is modified. The only force from the frictionless floor is the normal 
force N. But N is not necessarily equal to Mg, because the CM may be accelerating 
in the vertical direction. The vertical F = ma equation is N — Mg = My, where y = 
£cos@. Taking the second derivative of y, we obtain N = Mg—M£@ sin 6 +6? cos 0). 
The torque relative to the CM therefore has magnitude 


t =NésinO = Mgt sinf — ML? sind (6 sind + Å? cos), (9.162) 


and it points in the same direction as in Section 9.7.3. Putting everything together, we 
see that Eq. (9.69) is unchanged, Eq. (9.70) has J replaced with 7’ = J — M¢?, and the 
second of Eqs. (9.70) also has an extra term and becomes 


(mee — ML (Ö sind + Å? cos 0) + I'd? cos0 — Bosĝ) sind = 16. (9.163) 


Note that if = 0, the only modification needed is the change in 7. 

If you prefer to use the Lagrangian method in Section 9.7.4, then J needs to be 
changed to 7’ as above, and the other modification comes in the kinetic energy in the 
Lagrangian. We must add on the kinetic energy of the whole object treated like a point 
mass at the CM, because so far we have included only the kinetic energy relative to our 
new origin (the CM). Since the table is frictionless, the CM can move only vertically, 
so its velocity is ý = —£6 sin @. Therefore, we have to add the term M (£0 sin 0)? /2 to 
the Lagrangian. You can show that this leads to the extra term in Eq. (9.163). 


Fixed highest point 


For the desired motion with P always the highest point, the important thing to note is 
that every point in the top moves in a fixed circle around the z axis. Therefore, œ points 
vertically at all times. Hence, if Q is the frequency of precession, we have œ = QÊ. 

Another way to see that points vertically is to view things in the frame that rotates 
with angular velocity Qz. In this frame, the top has no motion at all. It isn’t even 
spinning, because the point P is always the highest point. In the language of Fig. 9.30, 
we therefore have œw = 0, so w = Q2 + o'%3 = Q2. 

The principal moments are (with the pivot as the origin; see Fig. 9.95) 


MR? 2 MR? 
h= and J=l=h=Mé 4 x? (9.164) 


where we have used the parallel-axis theorem in finding /. The components of w along 
the principal axes are œ = Qcos@ and w = Q sin 0. Therefore (keeping things in 
terms of the general moments, J3 and Z, for now), 


L = LQ cos 6 x3 + I2 sin 0 Xp. (9.165) 


The horizontal component of L is then L1 = (B2 cos 0) sin 8 — (IQ sin 0) cos 0, so 
dL/dt has magnitude 


dL 


T = Li Q = X sind cos0 (B — I), (9.166) 


and it is directed into the page (or out of the page, if this quantity is negative). This 
must equal the torque, which has magnitude |t| = Mg£ sin 0 and is directed into the 
page. Therefore, 


Q = _ Met (9.167) 
(B — I) cos 


Fig. 9.95 
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We see that for a general symmetric top, the desired precessional motion (where the 
same “side” always points up) is possible only if the product (J3 — J) cos 8 is greater 
than zero. That is, 


6<n/2 => >i, 
6@>n/72 = &<xil. (9.168) 


For the problem at hand, J; and J are given in Eq. (9.164), so we find 


4g£ 
(R2 — 42) cos 0 ` 


Q= (9.169) 


The necessary condition for such motion to exist is therefore R > 2¢ if 6 < 1/2, or 
R < 2€if6 > 7/2. 


REMARKS: 

1. It is intuitively clear that Q should become very large as 0 —> 7/2, although it 
is by no means intuitively clear that such motion should exist at all for angles 
near 7/2. 

2. If @ > m/2 and R = 0, we simply have the circular pendulum discussed in 
Problem 9.12. And indeed, when R = 0, the result in Eq. (9.169) reduces to 
Q = ./g/€ cosa, where a = x — 0, which agrees with the result in Eq. (9.126). 

3. Q approaches a nonzero constant as 0 > 0 or 9 —> x (depending on the sign of 
13 — I), which isn’t entirely obvious. 

4. If both R and £ are scaled up by the same factor, Eq. (9.169) shows that Q 
decreases. This also follows from dimensional analysis. 

5. Assuming that 0 < 2/2 (the 0 > 2/2 case can be handled in a similar manner), 
the condition /3 > Z can be understood in the following way. If 4 = Z, then 
L « w, so L points vertically along œ. If 3 > J, then L points somewhere 
to the right of the z axis (at the instant shown in Fig. 9.95). This means that 
the tip of L is moving into the page, along with the top. This is what we need, 
because T points into the page. If, however, J3 < Z, then L points somewhere 
to the left of the z axis, so dL/dt points out of the page, and hence cannot be 
equal to t. & 


9.21. Basketball on a rim 


Assume that the precessional motion is counterclockwise when viewed from above, 

as indicated in Fig. 9.52. Let’s look at things in the frame that has the center of the 

rim as its origin and that rotates with angular velocity QÊ. In this frame, the center 

of the ball is at rest, and the rim spins clockwise with angular speed Q. If the contact 

points are to form a great circle on the ball, the ball must be spinning around the 
~ X3 (negative) X3 axis shown in Fig. 9.96. Let the frequency of this spinning be w’ (in the 
language of Fig. 9.30). Then the nonslipping condition says that w'r = QR, and so 
œ = QR/r. Therefore, the total angular velocity of the ball with respect to the lab 
frame is 


contact 
points 


@ = 22 — o'î = Q2 — (R/P)QK. (9.170) 


Let us choose the center of the ball as the origin around which we calculate t and L. 
Then every axis in the ball is a principal axis with moment of inertia 7 = (2/3)mr’. 
The angular momentum is therefore 


L = Io = 12% — 1(R/r) QR. (9.171) 


9.22. 


9.10 Solutions 


Only the x3 piece has a horizontal component which contributes to dL/dt. This 
component has length L} = /(R/r)Q sin @. Therefore, dL/dt has magnitude 


dL 


2 
7 | = 8b = 32 mR sin 6, (9.172) 


and it points out of the page. 

The torque (relative to the center of the ball) comes from the force at the contact 
point. There are two components to this force. The vertical component is mg, and the 
horizontal component is m(R — r cos 0) Q? (pointing to the left), because the center of 
the ball moves in a circle of radius (R — r cos @). The torque therefore has magnitude 


|t| = mg(r cos 0) — m(R — r cos 0) 27 (r sin 8), (9.173) 


with out of the page taken to be positive. Equating this |t| with the |dL/dt| in Eq. (9.172) 
gives 


PaE (9.174) 


~ 3Rtan 6 —rsind— 


REMARKS: 

1. Q + œ as ð — 0, which makes sense. And Q — 0 as 6 — 7/2, which also 
makes sense. 

2. Q — oo when R = (3/5)r cos 0. But this isn’t physical, because we must have 
R > rcos@ in order for the other side of the rim to be outside the ball. 

3. You can also work out the problem for the case where the contact points trace out 
a circle other than a great circle (say, one that is inclined at an angle 6 below the 
great circle). The expression for the torque in Eq. (9.173) is unchanged, but the 
value of w and the angle of the X3 axis both change, so Eq. (9.172) is modified. 
If you want to play around with this, one thing you can show (with a bit of work) 
is that if R >> r and if you want the ball to travel around the rim infinitely fast, 
then 6 must equal tan~! ((5/2) tan 8). This is larger than 9, so the contact-point 
circle actually lies below the horizontal. & 


Rolling lollipop 

We must first find the angular velocity vector œ. Assume that the precessional motion 
is clockwise when viewed from above, as indicated in Fig. 9.53. We claim that œ points 
horizontally to the right (at the instant shown in Fig. 9.97), with magnitude (R/r)Q. 
This can be seen in two ways. 

The first way is to recognize that we essentially have the same scenario as in the 
“Rolling cone” setup of Problem 9.3 (imagine the sphere to be a ball of ice cream ina 
cone whose tip is located at the left end of the stick). The sphere’s contact point with the 
ground is instantaneously at rest (the nonslipping condition), so œ must pass through 
this point. But œ must also pass through the left end of the stick, because that point is 
also at rest. Therefore, œ must be horizontal. To find the magnitude, note that the center 
of the sphere moves with speed QR. But since the center may also be considered to be 
instantaneously moving with frequency in a circle of radius r around the horizontal 
axis, we have wr = QR. Therefore, w = (R/r)Q. 

The second way is to write œ as œ = —QzZz + o’X; (in the language of Fig. 9.30), 
where a’ is the frequency of the spinning as viewed by someone rotating around the 
(negative) Z axis with frequency Q. The contact points form a circle of radius R on 
the ground. But they also form a circle of radius r cos 0 on the sphere, where 0 is the 
angle between the stick and the ground (this circle is the circle of points where the 
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above-mentioned ice cream cone touches the sphere). The nonslipping condition then 
implies that QR = w'(rcos 6). Therefore, w = QR/(r cos 8), and so 


QR 
w = -N2 + a’ = -N2 4 ( ) (cosa + sin = (R/r)QXk, (9.175) 
rcosé 


where we have used tan 0 = r/R. 

Now let’s calculate the normal force. Choose the pivot as the origin. The principal 
axes are then X3 along the stick, along with any two directions orthogonal to the stick. 
Choose Xz to be in the plane of the paper (see Fig. 9.97). Then the components of œ 
along the principal axes are 


w3 = (R/r)Qceosé, and œ= —(R/r)Qsing. (9.176) 
The principal moments are 
b = (2/5)mr?, and h = (2/5) mr? + mr? + R?), (9.177) 


where we have used the parallel-axis theorem. The angular momentum is L = 
13.033 + 1)w2X2, so its horizontal component has length L1 = Bws cos 0 — howz sin 0. 
Therefore, the magnitude of dL/dt is 


IF = QL, = (Bows cos — haw? sin @) 
= [ (Gm?) (Fsrcos.) cos 0 
(Èm? H mr? H r))( f asine) sino | 
= mR a i (17 | R) sin?) 


7 
= zinrRQ?, (9.178) 


where we have used sin 0 = r//r? + R2. The direction of dL/dt is out of the page. 

The torque (relative to the pivot) is due to the gravitational force acting at the CM, 
along with the normal force N acting at the contact point. (Any horizontal friction 
that happens to exist at the contact point produces zero torque relative to the pivot.) 
Therefore, t points out of the page with magnitude |t| = (NV — mg)R. Equating this 
with the |dL/dt| in Eq. (9.178) gives 


N=mg+ Imre. (9.179) 


This has the interesting property of being independent of R, and hence also 0. This 
independence arises because both |t| and |dL/dt| are proportional to R, a fact which 
is easier to see via the reasoning in the first remark below. 


REMARKS: 

1. There is actually a quicker way to calculate the dL/dt in Eq. (9.178). At a given 
instant, the sphere is rotating around the horizontal x axis with frequency œ = 
(R/r)Q. The moment of inertia around this axis is i, = (7/ 5)mr*, from the 
parallel-axis theorem. Therefore, the horizontal component of L has magnitude 
Ly = Iw = (7/5)mrRQ. Multiplying this by the frequency (namely (2) at which 
L swings around the z axis gives the result for |dL/dt| in Eq. (9.178). There is 
also a vertical component of L relative to the pivot, but this component doesn’t 
change, so it doesn’t come into dL/dt. (The vertical component happens to equal 
—mR? Q. This can be obtained by realizing that the sphere behaves like a point 
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mass for the present purpose, or by using the inertia tensor relative to the pivot, 
or by calculating L; = ws sin 0 + hon cos @.) 

2. The pivot must provide a downward force of N — mg = (7/5)mrQ?, to make the 
net vertical force on the lollipop equal to zero. This result is slightly larger than 
the mrQ? result for the “sliding” setup in Exercise 9.53. 

3. The sum of the horizontal forces at the pivot and the contact point must equal the 
required centripetal force of mRQ?. But it is impossible to say how this force is 
divided up, without being given more information. & 


9.23. Rolling coin 


Choose the CM as the origin. The principal axes are then X2 and X3 (as shown in 
Fig. 9.98), along with $; pointing into the paper. Assume that the precessional motion 
is counterclockwise when viewed from above, as indicated in Fig. 9.54. Let’s look 
at things in the frame that has the center of the contact-point circle on the ground as 
its origin, and that rotates with angular velocity Qz. In this frame, the CM remains 
fixed, and the coin rotates with frequency œ’ (in the language of Fig. 9.30) around 
the negative $; axis. The nonslipping condition then says that œr = QR, and so 
œ = QR/r. Therefore, the total angular velocity of the coin with respect to the lab 
frame is 


@ = Q — aX = QZ — (R/r)QR3. (9.180) 
But z = sin 0 X2 + cos 0 X3, so we can write w in terms of the principal axes as 
o= Qsin6ia — 2(* — cose) $3. (9.181) 
The principal moments are 
b = (1/)m?, and h = (1/4mr?. (9.182) 
The angular momentum is L = w282 + Bw383, so its horizontal component has 


length L; = hon cos 6 — w; sin 0, with leftward taken to be positive. Therefore, the 
magnitude of dL/dt is 


dL 
Sl cor 
= + 


= (ha cos — 1303 sin 0) 


= of (7m )(2sin0)cos0 (5m) Q(R/r cosé)) sind] 


1 
= qi sin 0 (2R — r cos 8), (9.183) 


with a positive quantity corresponding to dL/dt pointing out of the page (at the instant 
shown). 

The torque (relative to the CM) comes from the force at the contact point. There are two 
components to this force. The vertical component is mg, and the horizontal component 
is m(R—r cos 0)Q? leftward, because the CM moves in a circle of radius (R —r cos 0). 
The torque therefore has magnitude 


|t| = mg(r cos 0) — m(R — r cos ORE sin 0), (9.184) 

with out of the page taken to be positive. Equating this |t| with the |dL/dt| in Eq. (9.183) 
gives 

Z= £ (9.185) 


3R tano — jr sind i 
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The right-hand side must be positive if a solution for Q is to exist. Therefore, the 
condition for the desired motion to be possible is 


5 
R> gi eos. (9.186) 


REMARKS: 


1. 


2. 


For 0 —> 7/2, Eq. (9.185) gives Q — 0, as it should. And for 0 — 0, we obtain 
Q — œ, which also makes sense. 

For r cos > R > (5/6)r cos 0, the CM of the coin lies to the /eft of the center 
of the contact-point circle (at the instant shown). The centripetal force, m(R — 
r cos @)7, is therefore negative (which means that it is directed radially outward, 
to the right), but the motion is still possible. As R gets close to (5/6)r cos 0, the 
frequency Q goes to infinity, which means that the radially outward force also 
goes to infinity. The coefficient of friction between the coin and the ground must 
therefore be correspondingly large. 


. We can consider a more general coin, whose density depends only on the distance 


from the center, and whose /3 equals Bmr?. For example, a uniform coin has 
ß = 1/2, and a coin with all its mass on the edge has 6 = 1. By the perpendicular- 
axis theorem, J) = h = (1/ 2) Bmr?, and you can show that the above methods 
yield 


Y= E r> (PE) cose 
(1 + 6)Rtan 6 — (1 + B/2)r sind 1+ 6 f 
(9.187) 


The larger £ is, the smaller the lower bound on R is. But even if 6 —> oo (imagine 
long radial spokes attached to the coin, and eliminate the ground except for the 
ring of contact points, so it doesn’t get in the way), R still can’t be any smaller 
than (7/2) cos0. & 


9.24. Wobbling coin 


(a) Choose the CM as the origin. The principal axes are then $2 and X3 (as shown 


in Fig. 9.99), along with X; pointing into the paper. Assume that the preces- 
sional motion is counterclockwise when viewed from above, as indicated in 
Fig. 9.55. Consider the setup in the frame rotating with angular velocity QÊ. 
In this frame, the location of the contact point remains fixed, and the coin 
rotates with frequency w’ (in the language of Fig. 9.30) around the negative 
%3 axis. The radius of the circle of contact points on the table is Rcosé. 
Therefore, the nonslipping condition says that œR = Q(Rcos@), and so 
ow! = Q cos. Hence, the total angular velocity of the coin with respect to the lab 
frame is 


w = QÊ — o'f; = Q (sin 8 $2 + cos 4 $3) — (Q cos 4)%3 = Q sin PR. 
(9.188) 


In retrospect, it makes sense that w must point in the X2 direction. Both the CM 
and the instantaneous contact point on the coin are at rest, so œ must lie along 
the line containing these two points, that is, along the X2 axis. 


(b) The principal moment around the & axis is 7 = mR? /4. The angular momen- 


tum is L = Jw2X2, so its horizontal component has length L} = Lcosé = 
(Iw) cos 0. Therefore, dL/dt has magnitude 


É 


mR? . 
x7 QL, =Q 4 (Q sin 0) cos 8, (9.189) 


and it points out of the page. 
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The torque (relative to the CM) is due to the normal force at the contact 
point. This normal force is essentially equal to mg, because the CM is assumed 
to be falling very slowly. Note that there is no sideways friction force at the 
contact point, because the CM is essentially motionless. The torque therefore 
has magnitude 


|t| = mgR cos 0, (9.190) 
and it points out of the page. Equating this |t| with the |dL/dt| in Eq. (9.189) 
gives 
2=2 |£. (9.191) 
Rsin@ 
REMARKS: 


1. 2 — œ as 0 — 0. This is evident if you do the experiment; the contact 
point travels very quickly around the circle. 

2. Q — 2,/g/R as ð —> 2/2, which isn’t so obvious. In this case, L points 
nearly vertically, and it traces out a tiny cone due to a tiny torque. In 
this 6 —> z/2 limit, Q is also the frequency at which the plane of the 
coin spins around the vertical axis. If you spin a coin very fast about a 
vertical diameter, it initially undergoes a purely spinning motion with only 
one contact point. But then it gradually loses energy due to friction, until 
the spinning frequency slows down to 2,/g/R, at which point it begins to 
wobble (we’re assuming that the coin is very thin, so that it can’t balance on 
its edge). In the case where the coin is a US quarter (with R ~ 0.012 m), 
this critical frequency of 2,/g/R turns out to be Q ~ 57rad/s, which 
corresponds to about 9 Hz. 

3. The result in Eq. (9.191) is a special case of the result in Eq. (9.185) 
of Problem 9.23. The CM of the coin in Problem 9.23 is motionless 
if R=rcos@. Plugging this into Eq. (9.185) gives Q? =4g/(rsin@), 
which agrees with Eq. (9.191), because r was the coin’s radius in 
Problem 9.23. & 


(c) Consider one revolution of the contact point around the z axis. Since the radius 
of the circle on the table is R cos 0, the contact point moves a distance 27 R cos 0 
around the coin during this time. Hence, the new contact point on the coin is 
a distance 27R — 27Rcos@ away from the original contact point. The coin 
therefore appears to have rotated by a fraction (1 — cos 0) of a full turn during 
this time. The frequency with which you see it turn is therefore 


(1 — cos @)Q = 2(1 — cos 0) £ (9.192) 
V Rsind 
REMARKS: 


1. If 6 ~ x/2, then the frequency of Abe’s rotation is essentially equal to Q. 
This makes sense, because the top of Abe’s head will be, say, always near 
the top of the coin, and this point will trace out a small circle around the z 
axis, with nearly the same frequency as the contact point. 

2. As 0 — 0, Abe appears to rotate with frequency 67/*./g7R (using sin 0 ~ 0 
andcosð ~ 1—0? /2). Therefore, although the contact point moves infinitely 
quickly in this limit, we nevertheless see Abe rotating very slowly, as you 
can experimentally verify. 

3. All of the results for frequencies in this problem must be some multiple of 
JVg/R, by dimensional analysis. But whether the multiplication factor is 
zero, infinite, or something in between, is not at all obvious. 

4. An incorrect answer for the frequency of Abe’s turning (when viewed 
from above) is that it equals the vertical component of œ, which is 
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@z = wsin@ = (Q sin 0) sin 8 = 2(sin@)*/?,./g/R. This does not equal the 
result in Eq. (9.192). (It agrees at 6 = 7/2 but is off by a factor of 2 for 
0 — 0.) This answer is incorrect because there is simply no reason why the 
vertical component of œw should equal the frequency of revolution of, say, 
Abe’s nose, around the vertical axis. For example, at moments when œ passes 
through the nose, then the nose isn’t moving at all, so it certainly cannot be 
described as moving around the vertical axis with frequency w, = wsin@. 
The result in Eq. (9.192) is a sort of average measure of the frequency 
of rotation. Even though any given point on the coin is not undergoing 
uniform circular motion, your eye sees the coin essentially rotating uniformly 
as a whole. & 


9.25. Nutation cusps 


(a) Because both ġ and Ê are continuous functions of time, we must have 


@ = 6 = 0 at a kink. Otherwise, either d0 /dọġ = 6/¢ or dø/d0 = $/6 
would be well defined at the kink. Let the kink occur at t = fo. Then the second 
of Eqs. (9.92) gives sin(@nfo) = 0. Therefore, cos(@nto) = +1, and the first of 
Eqs. (9.92) gives 


AQ = +O, (9.193) 


as desired. Note that if cos(@nto) = 1, then AQ = —Qsg, so Eq. (9.91) says that 
the kink occurs at the smallest value of 8, that is, at the highest point of the top’s 
motion. And if cos(wnto) = —1, then AQ = Qs, so Eq. (9.91) again says that 
the kink occurs at the highest point of the top’s motion. These results agree with 
the AQ = +Q; plots in Fig. 9.38. 


(b) To show that these kinks are cusps, we will show that the slope of the 0 


vs. ¢ plot is infinite on either side of the kink. That is, we will show that 
d8 /dġ = 0/¢ = +. Consider the case where cos(@nto) = 1 and AQ = —Q, 
(the cos(wnto) = —1 case proceeds similarly). With AQ = —Q4g, Eq. (9.92) 
gives 


ê £ sin 6o sin @nt (9.194) 
$ 1 — cos nat 
Let ¢ = to + €. Using cos(@pfo) = 1 and sin(wnto) = 0, and expanding 
Eq. (9.194) to lowest order in €, we find 


6 _ sin Oo (wne) E 2 sin ĝo 
ġ wee? /2 Wn€ 


(9.195) 


For infinitesimal e€, this switches from —oo to +00 as € passes through zero. 
The point here is that although both 6 and @ go to zero, @ goes quadratically, 
whereas 6 goes only linearly. 


9.26. Nutation circles 


(a) In the limit œ >> Qs, the original L points essentially along the x3 direction, 


with magnitude /3«3. So it makes an angle of essentially 4) with the vertical z 
axis. Now consider the quick kick. If ¢ (which is the angular speed around the 
vertical z axis) suddenly increases by AQ, then this corresponds to a sudden 
increase in angular speed around the x2 axis of sin fọ AQ (see Fig. 9.100). The 
kick therefore produces an angular momentum component (relative to the pivot) 
of J sin 6) AQ in the x2 direction. So from Fig. 9.100, the angle that the new L 
makes with the z axis is (using AQ < @3) 


Isin% AQ in AQ 
Pate aha, (9.196) 
w3 On 
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where we have used the definition of wn from Eq. (9.86). We see that the effect 
of the kick is to make L quickly change its 0 value (by only a small amount, 
because we are assuming w3 >> AQ). The ¢ value doesn’t immediately change, 
because immediately after the kick the x3 axis is still in exactly the same place, 
because it hasn’t had time to move. 

(b) After the kick is finished, L will trace out a cone around the z axis at approx- 
imately the same rate as before, namely Qs. (None of the relevant quantities 
in t = dL/dt have changed much from the original circular-precession case, 
so the precession frequency is basically the same.) So the new L has its (¢, 0) 
coordinates given by 


(0,00), = (2 iy = ne (9.197) 
On 

We’re assuming that Qs is very small, so if you want, you can ignore the Qs term 

and just consider L to be fixed, at least on the time scale of the nutations (see 

the remark below). But this term will cancel anyway in producing the following 

equation. Looking at Eq. (9.91), we see that the coordinates of the CM relative 


to L are 


(£), O(t¢ = AL jg t AV a 0 t 9.198 
(00.0) = a fad a Oo asa (9.198) 


The sin 69 factor in 0 (t) is exactly what is needed for the CM to trace out a 
circle around L, because a change in ¢ corresponds to a CM spatial change of 
£A¢@ sin 69, whereas a change in @ corresponds to a CM spatial change of €AQ. 

Now let’s see how this relates to a free top. For œn >> Qs, the time scale 
on which L changes due to the gravitational torque (namely 1/2.) is very long 
compared with the time scale of the nutation (namely 1/œn). Therefore, since 
L is essentially motionless on the time scale of 1/w,, the effects of gravity 
are negligible on this time scale. Hence, the system should behave like a free 
top. So let’s verify that the results here do indeed agree with the results from 
Section 9.6.2. 

Equation (9.55) in Section 9.6.2 says that the frequency of the precession of 
$; around L for a free top is L/J. But the frequency of the precession of X3 
around L in the present problem is œn, so this had better be equal to L/J. And 
indeed, L is essentially equal to 1303, so œn = 1303/7 = L/I. Therefore, for 
short enough time scales (short enough so that L doesn’t move much), a nutating 
top with w3 >> AQ > Q, looks very much like a free top. 


REMARK: We need the AQ > Q, condition so that the nutation motion looks 
roughly like circles (that is, like the top plot in Fig. 9.38, and not the others). 
This requirement can be seen by the following reasoning. The time for one 
period of the nutation motion is 27/@,. From Eq. (9.91), @(¢) increases by 
Ad = 2m 9Qs/@n during this time. And also from Eq. (9.91), the diameter d 
of the “circle” along the ¢ axis is roughly d = 2AQ/wy. The motion looks 
approximately like a circle if d >> Ag, that is, if AQ >> Qs.  & 


9.27. Rolling without slipping 


Yes, there are indeed other ways. It isn’t necessary for the contact points to form a 
vertical great circle. They can form a smaller tilted circle, as shown in Fig. 9.101. In 
this case the angular velocity vector points up to the left. There is indeed no slipping 
here, and the ball rolls straight. You can convince yourself of this in the following way. 
Imagine the ball hovering and spinning in place with its center at rest. At all times, 
the point at the bottom of the ball is moving directly out of the page; the collection of 
these points is the tilted circle in Fig. 9.101. Let the speed of the instantaneous bottom 
point be v, and now imagine taking a horizontal sheet of paper and sliding it out of the 
page with constant speed v, just beneath the ball, barely touching it. Since this is the 
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same speed as the bottom point on the ball (which is the contact point, because it’s at 
the bottom), there is no slipping between the ball and the paper. Finally, if we go to 
the reference frame of the paper, we have the desired motion of a ball rolling without 
slipping on a flat surface. 

If 0 is the angle that œ makes with the horizontal, then the radius of the con- 
tact circle is Rcos@. The linear speed of the ball is therefore w(Rcos@). If you 
want, you can think of this as (wcos@)R. That is, only the horizontal component 
of leads to translational motion of the ball. The vertical component simply yields 
a spinning motion around the vertical diameter. If 6 approaches 90°, then the con- 
tact circle is very small (see Fig. 9.102), so the speed of the ball is very small (for a 
given w). 

In the real world, the contact area between the ball and the surface isn’t an idealized 
single point, so if the contact circle isn’t a vertical great circle, there is inevitably some 
slipping at the points near the bottom of the ball. The smaller the contact circle, the 
more the slipping, because the slipping arises from the twisting motion around the 
vertical diameter, which comes from the vertical component of w. If you take an actual 
ball and roll it with a tilted œw, you will find that the rolling motion is noisier the more 
that @ is tilted. What you hear is the slipping due to the extended contact region. But 
in an ideal world (or perhaps in a bowling alley, which is a reasonable approximation 
thereof), a tilted œ wouldn’t be any noisier. 


Rolling straight? 
Intuitively, it is fairly clear that the sphere cannot change direction, but it is a little 
tricky to prove. Qualitatively, we can reason as follows. Assume that there is a nonzero 
friction force at the contact point. (The normal force is irrelevant here, because it 
doesn’t provide a torque about the center, because the contact point is always directly 
below the center. This is what is special about a sphere.) Then the sphere accelerates 
in the direction of this force. However, you can show with the right-hand rule that this 
force produces a torque that causes the angular momentum to change in a way that 
corresponds to the sphere accelerating in the direction opposite to the friction force 
(assuming that the sphere is rolling without slipping). There is thus a contradiction, 
unless the friction force is equal to zero. 

Let’s now be rigorous. Let the angular velocity of the sphere be œ, which may 
point diagonally if the contact points don’t form a vertical great circle on the sphere. 
The nonslipping condition says that the sphere’s CM velocity equals 


v=o x (RÈ). (9.199) 
The sphere’s angular momentum is 
L=Io. (9.200) 
The friction force (if it exists) from the ground at the contact point changes both the 
momentum and the angular momentum. F = dp/dt gives 


Fey. (9.201) 


And t = dL /dt (relative to the center) gives 


dL 
(Rô) x F= —, (9.202) 


because the force is applied at position —Rz relative to the center. We will now show 
that the preceding four equations imply that œ = 0. From Eq. (9.199), this then implies 
that v = 0, as desired. 

Equations (9.199) and (9.201) give F = mm x (R2). Plugging this F, along with the 
L from Eq. (9.200), into Eq. (9.202) then gives 


(—R2) x (mò x (R2)) = Io. (9.203) 


9.10 Solutions 


The vector œ must lie in the horizontal plane, because it equals the cross product 
of two vectors, one of which is the vertical z. This implies that (as you can verify) 
2 x (@ x Z) = &. Therefore, Eq. (9.203) gives 


2 
-mR = z"R o, (9.204) 


and so @ = 0, as we wanted to show. 


9.29. Ball on paper 
Our strategy will be to produce, and then equate, two different expressions for the total 
change in the angular momentum of the ball, relative to its center. The first comes 
from the effects of the friction force on the ball. The second comes from looking at the 
general form of the initial and final motion. 
To produce our first expression for AL, note that the normal force provides no 
torque, so we can ignore it. The friction force, F, from the paper changes both p and 
L according to 


Ap = f Fas 


(9.205) 
AL= fra = fr» xFdt = Rd) x f Far 
Both of these integrals run over the entire slipping time, which may include time on 
the table after the ball leaves the paper. In the second line above, we have used the fact 
that the friction force always acts at the same location, namely (—Rz), relative to the 
center of the ball. The two above equations yield 


AL = (—Rĉ) x Ap. (9.206) 


To produce our second equation for AL, let’s examine how L (the horizontal com- 
ponent of L)°? is related to p when the ball is rolling without slipping, which is the 
case at both the start and finish. When the ball is not slipping, we have the situation 
show in Fig. 9.103 (assuming that the ball is rolling to the right). The magnitudes of p 
and L] are given by 


p = m, 
(9.207) 


2 2 2 2 2 
Ly =lo, = z”R oO, = gente) = Fale = 5 fP: 


where we have used the nonslipping condition, v = Rw . (The actual J = (2/5)mR? 
value for a solid sphere won’t be important for the final result.) We can now combine 
the directions of L} and p in Fig. 9.103 with the above L, = (2/5)Rp scalar relation 
to write 


A 
Lı = ZÊ x p, (9.208) 


where z points out of the page. Since this relation is true at both the start and the finish, 
it must also be true for the differences in L] and p. That is, 


> 
AL, = [Rix Ap. (9.209) 


32 The vertical component of L (which is nonzero only if has a vertical component, which is the 
case if the contact points don’t form a vertical great circle on the ball) is constant, because the 
torque from friction provides only a horizontal torque. We may therefore ignore it, because we 
are concerned only with AL. 
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But AL, = AL because the vertical component of L doesn’t change, so Eqs. (9.206) 
and (9.209) give 


2 
(—Rz) x Ap = he x Ap => 0=2z~x Ap. (9.210) 


There are three ways this cross product can be zero: 


e Ap is parallel to z. But it isn’t, because Ap lies in the horizontal plane. 

e z= 0. Not true. 

e Ap = 0. This is the only possibility, so it must be true. Therefore, Av = 0, as we 
wanted to show. 


Note that the line of the final motion may very well be shifted sideways from the line 
of the initial motion. But what we’ve shown is that the final line is parallel to the initial 
line, and the speed is the same. 


REMARKS: 

1. As stated in the problem, it’s fine if you move the paper in a jerky motion, so that 
the ball slips around on it. We assumed nothing about the nature of the friction 
force in the above reasoning. And we used the nonslipping condition only at the 
initial and final times. The intermediate motion is arbitrary. 

2. As a special case, if you start a ball at rest on a piece of paper, then no matter 
how you choose to (horizontally) slide the paper out from underneath the ball, 
the ball will be at rest in the end. The final position will most likely be different 
from the initial position, but the ball will be at rest wherever it ends up. 

3. You are encouraged to experimentally verify that these crazy claims are true. 
Make sure that the paper doesn’t wrinkle, because a wrinkle would allow a force 
to be applied at a point other than the contact point. And balls that don’t squish 
are much better, of course, because the contact region better resembles a single 
point. & 

Ball on a turntable 

Let the angular velocity of the turntable be (22, and let the angular velocity of the ball 
(with respect to the lab frame) be œw, which may point diagonally if the contact points 
don’t form a vertical great circle on the ball. If the ball is at position r (with respect to 
the lab frame), then its CM velocity (with respect to the lab frame) may be broken up 
into the velocity of the turntable (at position r) plus the ball’s velocity with respect to 
the turntable. The nonslipping condition says that the latter is œ x (az), where a is the 
radius of the ball.*> The ball’s velocity with respect to the lab frame is thus 


v = (QZ) x r+@ x (az). (9.211) 


The important point to realize in this problem is that the friction force from the turntable 
is responsible for changing both the ball’s linear momentum and its angular momentum. 
In particular, F = dp/dt gives 


dv 
F=m—. (9.212) 
dt 
And the angular momentum of the ball is L = Jw, so t = dL/dt (relative to the 
center of the ball) gives 


dw 
dt ° 


because the force is applied at position —az relative to the center. 


(-—az) x F =I (9.213) 


33 The velocity with respect to the turntable is actually œ x (az), where wy is the angular velocity in 


the turntable frame. But w; differs from @ simply by the angular velocity of the turntable, which 


is Qz. This implies that w, x (aĉ) = w x (aĉ). 


9.10 Solutions 


We will now use the previous three equations to demonstrate that the ball undergoes 
circular motion. Our goal will be to produce an equation of the form, 


av _ os (9.214) 
— =Q xv, . 

dt 
because this describes circular motion with frequency Q’ (to be determined).** We 


will eliminate F first, and then œ. Plugging the expression for F from Eq. (9.212) into 
Eq. (9.213) gives 


F dv dw dw am\, dv 
(—az) x |m I ( )a x : (9.215) 
dt dt dt I dt 
Taking the derivative of Eq. (9.211) gives 
dv ~ adr do és 
= QZ x 1 x (aZ) 
dt dt dt 
Q& xv (F) waa eee (9.216) 
> I dt : i 


Since we know that the vector dv/dt lies in the horizontal plane, it is easy to work 
out the cross product in the second term here (or just use the identity (A x B) x C = 
(A - C)B — (B - C)A) to obtain 


dy R ma? \ dv dv Q r: 
= Q2 x v = 2x vV. (9.217) 
dt I dt dt 1 + (ma? /1) 
For a uniform sphere, J = (2/ 5)ma*, so we obtain 
dv 2 e 
— =(-Q)zxv. (9.218) 
dt 7 


Therefore, in view of Eq. (9.214), we see that the ball undergoes circular motion, with 
a frequency equal to 2/7 times the frequency of the turntable. Note that this result for 
the frequency doesn’t depend on initial conditions. For an extension to this problem, 
see Weltner (1987) and references therein. 


REMARKS: 
1. Integrating Eq. (9.218) from the initial time to some later time gives 


oe (52) 2% =m. (9.219) 


This may be written (as you can verify) in the more suggestive (if not more 
frightening) form, 


2 7 
= (22x (r—(ro+ =—@xv0))). 9.220 
v € )ax(s ro+ 35x w))) (9.220) 
This equation describes circular motion, with the center located at the point, 


ia 
Ye = ro + 39 @ x vo), (9.221) 


34 Equation (9.214) describes circular motion because it implies that v has constant magnitude (since 
the change in v is perpendicular to v), and so the direction of v changes at the constant angular rate 
Q’. The only curve with these properties is a circle. If you want to be more mathematical, you can 
integrate Eq. (9.214) to obtain v = Q/% x r + C, which can be written as dr/dt = Q’? x (r — rc), 
which implies that d(r — re)/dt = Q'% x (r — re), which means that the length of r — re doesn’t 
change (because its change is perpendicular to itself). And since the changes are also constrained 


to be perpendicular to Z, we must have a horizontal circle. 
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and with radius, 


7vo 
22° 


7 
R= |r = rel = zg Ê x vol = (9.222) 


2 


2. There are a few special cases to consider: 
e If vo = 0 (that is, if the spinning motion of the ball exactly cancels the 


rotational motion of the turntable, so that the CM of the ball is at rest in the 
lab frame), then R = 0 and the ball remains in the same place, as it should. 
If the ball is initially not spinning, and just moving along with the turntable, 
then vo = Qro. The radius of the circle is therefore R = (7/2)ro, and its 
center is located at, from Eq. (9.221), 

r, = rgd - (2r) = at (9.223) 
The point on the circle diametrically opposite to the initial point is therefore 
a distance re +R = (5/2)r9+(7/2)ro = 6ro from the center of the turntable. 
If we want the center of the circle to be the center of the turntable, then Eq. 
(9.221) says that we need (7/29)z x vo = —Yro. This implies that vo has 
magnitude vo = (2/7)Qro and points tangentially in the same direction as 
the turntable moves. That is, the ball moves at 2/7 times the velocity of the 
point on the turntable right beneath it. 


3. The fact that the frequency (2/7)Q is a rational multiple of Q means that the 
ball will eventually return to the same point on the turntable. In the lab frame, 
the ball traces out two circles in the time it takes the turntable to undergo seven 
revolutions. From the point of view of someone on the turntable, the ball “spirals” 
around five times before returning to the original position. 

4. If we look at a ball with moment of inertia J = ma? (so a uniform sphere has 
B = 2/5), then Eq. (9.217) shows that the “2/7” in Eq. (9.218) gets replaced 
by “6/(1 + B).” If a ball has most of its mass concentrated at its center (so that 
B — 0), then the frequency of the circular motion goes to 0, and the radius goes 
to co (as long as vo #0). & 


Chapter 10 
Accelerating frames of reference 


Newton’s laws hold only in inertial frames of reference. However, there are 
many noninertial (that is, accelerating) frames that we might reasonably want to 
consider, such as elevators, merry-go-rounds, and so on. Is there any possible 
way to modify Newton’s laws so that they hold in noninertial frames, or do we 
have to give up entirely on F = ma? It turns out that we can in fact hold on to our 
good friend F = ma, provided that we introduce some new “fictitious” forces. 
These are forces that a person in the accelerating frame thinks exist. If she applies 
F = ma while including these new forces, then she will get the correct answer 
for the acceleration a, as measured with respect to her frame. 

To be quantitative about all this, we’ll have to spend some time figuring out 
how the coordinates (and their derivatives) in an accelerating frame relate to those 
in an inertial frame. But before diving into that, let’s look at a simple example 
that demonstrates the basic idea of fictitious forces. 


Example (A train): Imagine that you are standing on a train that is accelerating 
to the right with acceleration a. If you are to remain at the same spot on the train, 
then there must be a friction force between the floor and your feet with magnitude 
Fç = ma, pointing to the right. Someone standing in the inertial frame of the ground 
simply interprets the situation as, “The friction force Ff, which equals ma, causes 
your acceleration a.” 

How do you interpret the situation in the frame of the train? Assume that there 
are no windows, so that all you see is the inside of the train. As we will show below 
in Eq. (10.11), you will feel a fictitious translation force, Ftrans = —ma, pointing 
to the left. You therefore interpret the situation as, “In my frame (the frame of the 
train), the friction force Ff = ma pointing to my right exactly cancels the mysterious 
Frans = —ma force pointing to my left, resulting in zero acceleration (in my frame).” 

Of course, if the floor of the train is frictionless so that there is no force at your 
feet, then you will say that the net force on you is Ftrans = —ma, pointing to the 
left. You will therefore accelerate with acceleration a to the left, with respect to your 
frame (the train). In other words, you will remain motionless (or move with constant 
velocity, if you happened to be moving initially) with respect to the inertial frame of 
the ground, which is all quite obvious to someone standing on the ground. 

In the case where the friction force at your feet is nonzero, but not large enough to 
balance out the whole Ftrans = —ma force, you will end up being accelerated toward 
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the back of the train (in the frame of the train), with an acceleration that is less than 
a. This undesired motion will continue until you make the required adjustments with 
your feet or hands to balance out all of the Ftrans force. 


Let’s now derive the fictitious forces in their full generality. The main task 
here is to relate the coordinates in an accelerating frame to those in an inertial 
frame, so this endeavor will require a bit of math. 


10.1 Relating the coordinates 


Consider an inertial coordinate system with axes X,, y,, and Z,, and let there be 
another (possibly accelerating) coordinate system with axes x, y, and z. These 
latter axes are allowed to change in an arbitrary manner with respect to the inertial 
frame. That is, the origin may undergo acceleration, and the axes may rotate (this 
is the most general possible motion, as we saw in Section 9.1). These axes may 
be considered to be functions of the inertial axes. 

Let Or and O be the origins of the two coordinate systems. Let the vector from 
Oj; to O be R, let the vector from O; to a given particle be r;, and let the vector 
from O to the particle be r. See Fig. 10.1 for the 2-D case. Then 


r=R+r. (10.1) 


These vectors have an existence that is independent of any specific coordinate 
system, but let’s write them in terms of some definite coordinates. We may write 


R=XX,4+ Y¥,4+ ZH, 
ri = xk +yy +214, (10.2) 
r=xx+yy +22. 


For reasons that will become clear, we have chosen to write R and r; in terms of the 
inertial-frame coordinates, and r in terms of the accelerating-frame coordinates. 
If desired, Eq. (10.1) may be written as 


xX] + yt + cava = (XX + YŞı + Z2) + (xx + yy + zi) é (10.3) 


Our goal is to take the second time derivative of Eq. (10.1) and to then inter- 
pret the result in an F = ma form. The second derivative of r; is the acceleration 
of the particle with respect to the inertial system, so Newton’s second law tells us 
that F = mr. The second derivative of R is the acceleration of the origin of the 
moving system. The second derivative of r is the tricky part. In view of its form 
in Eq. (10.2), changes in r can come about in two ways. First, the coordinates 
(x,y,z) ofr, which are measured with respect to the moving axes, may change. 
And second, the axes $, y, z themselves may change. The point is that r is not 
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just the ordered triplet (x,y,z). It is the whole expression, r = xx + yy + ZZ. 
So even if (x,y,z) are fixed, meaning that r doesn’t change with respect to the 
moving system, r can still change with respect to the inertial system if the x, y, z 
axes are moving. Let’s be quantitative about this. 


Calculation of d?r/ dt? 
We should clarify our goal here. We would like to obtain d*r/dt? in terms of 
the coordinates in the accelerating frame, because we want to be able to work 
entirely in terms of these coordinates so that a person in the accelerating frame 
can write down an F = ma equation in terms of her coordinates only, without 
having to consider the underlying inertial frame at all. In terms of the inertial 
frame, d?r /dt? is simply d? (r, — R) /dt*, but this isn’t very enlightening by itself. 
The following exercise in taking derivatives works for a general vector A = 
A,X + 4f + 4-2 in the moving frame; it isn’t necessary that it be a position 
vector. So we’ ll work with a general A and then set A = r when we’re done. To 
take d/dt of A = AyX + Ayy + 4-2, we can use the product rule to obtain 


dA (dA,, dAy, dA dì d§ 
=( “£4 —9 22) + (4 u4 


_ A, 
di Yt” a Ok 


ae 10.4 
dt dt dt ae ge] Ca) 


Yes, the product rule works with vectors too. We’re doing nothing more than 
expanding (A, + dAx)( + dx) — A,X, etc., to first order. Note that although we 
have expressed the vector A in terms of the coordinate axes of the moving frame, 
the total derivative dA/dt is measured with respect to the inertial frame. As 
mentioned above, the total rate of change of A comes from two effects, namely 
the two groups of terms in Eq. (10.4). The first group gives the rate of change 
of A, as measured with respect to the moving frame. We’ll denote this quantity 
by dA/6t. 

The second group arises because the coordinate axes are moving. In what 
manner are they moving? We have already extracted the motion of the origin of 
the accelerating system by introducing the vector R, so the only thing left is a 
rotation about some axis œ through this origin (see Theorem 9.1). The axis œ 
may change with time, but at any instant a unique axis of rotation describes the 
system. The fact that the axis may change will be relevant in finding the second 
derivative of r, but not in finding the first derivative. 

We saw in Theorem 9.2 that a vector B that has fixed length (the coordinate 
axes here do indeed have fixed length) and that rotates with angular velocity 
@ = wô changes at a rate dB/dt = w x B. In particular, dx/dt = w x $, etc. So 
in Eq. (10.4), the A,(dx/dt) term equals A,(@ x X) = @ x (A,X). Likewise for 
the y and z terms. Combining them gives @ x (Ax + A,y + 4-2), which is just 
@ x A. Therefore, Eq. (10.4) becomes 


=—+@xA. (10.5) 
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This agrees with the result obtained in Section 9.5, Eq. (9.41). We’ve basically 
given the same proof here, but with a little more mathematical rigor. 

We still have to take one more time derivative. Using the product rule, the 
time derivative of Eq. (10.5) is 


A d(sA > dw se iat dA (10.6) 
= x s 
dt? dt \ ôt dt = dt ` 


Applying Eq. (10.5) to the first term on the right (with 8A /ôt in place of A), and 
plugging Eq. (10.5) into the third term, gives 


A 8? A x ôA P dw Alg ôA KUKA 
= x x @ X =— @ xX 
dt? 8t? i ôt dt ôt 


-T e (10.7) 
= w x (w X @ x — + — x . 
DA dt 
At this point we will set A = r, which gives 
dr dw 
Ae OEE AN ae E (10.8) 


where r, v = ôr/ôt, and a = 61/6? are the position, velocity, and acceleration 
of the particle, as measured with respect to the accelerating frame. In other words, 
if the accelerating frame is enclosed in a windowless box, and if a person inside 
the box paints a coordinate grid on the floor, then she can (assuming that she has 
a clock at her disposal) measure r, v, and a without caring at all about what is 
going on in the outside world. 

You might be concerned that the r, v, and a on the right-hand side of 
Eq. (10.8) are measured by someone in the rotating frame, whereas the d?r /dt? on 
the left-hand side is measured by someone in the inertial frame. But this apparent 
discrepancy is a nonissue, because Eq. (10.8) is a statement about vectors, and 
these vectors have an existence that is independent of the coordinate system that 
is used to describe them. For example, the vector a points in a certain direction 
(say, toward the distant star Sirius) and has a certain magnitude, independent of 
whoever chooses to describe it with whatever coordinates. 


10.2 The fictitious forces 


From Eq. (10.1) we have 


= . 10. 
dt? dt? dt? oo) 


We can equate this expression for d*r/dt? with the expression in Eq. (10.8), 
and then multiply through by the mass m of the particle. Recognizing that 
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m(d?r,/dt*) is the force F acting on the particle (F may be gravity, a normal 
force, friction, tension, etc.), we can solve for ma to obtain 


i aR ( y-2 dw 
a= =a x xr)— x V— m— XT 
m m diz M@ (O) M@ v m dt 
= F + Frranslation + F centrifugal + F coriolis + Fazimuthal, (10.10) 


where the fictitious forces are defined as 


@R 
Frans = mE > 
Foent = —M@ xX (@ x r), 
cent ( ) (10.11) 
Foor = —2mo@ x V, 
dw 
F,, = —m— xr 
az dt 


We have taken the liberty of calling these quantities “forces,” because the left- 
hand side of Eq. (10.10) is ma, where a is measured by someone in the accelerating 
frame. This person should therefore be able to interpret the right-hand side as 
some effective force. In other words, if a person in the accelerating frame wishes 
to calculate ma, she simply needs to take the true force F, and then add on all 
the other terms on the right-hand side of Eq. (10.10), which she will then quite 
reasonably interpret as forces (in her frame). She will consider Eq. (10.10) to be 
an F = ma statement in the form, 


ma = Ý ` Face, (10.12) 


where Face represents all the forces, real or fictitious, in the accelerating frame. 
All we’ve really done is transfer some terms to the other side of an equation and 
reinterpret the result. The last three terms on the right-hand side of Eq. (10.8) are 
nothing more than various pieces of the acceleration. But if we transfer them to 
the left-hand side and solve for a, we can then interpret them (after multiplying 
by m) as forces in Eq. (10.10). 

Ofcourse, the extra terms in Eq. (10.10) are not actual forces. The constituents 
of F are the only real forces in the problem (and they are the same in any frame). 
All we are saying is that if our friend in the moving frame assumes that the extra 
terms are real forces, and if she adds them to F, then she will get the correct 
answer for ma, as measured in her frame. 

For example, consider a box (far away from other objects, in outer space) 
that accelerates at a rate of g = 10 m/s” in some direction. A person in the box 
feels a fictitious force of Frans = mg down into the floor. For all she knows, 
she is in a box on the surface of the earth. If she performs various experiments 
under this assumption, the results will always be what she expects. The surprising 
fact that no local experiment can distinguish between the fictitious force in the 
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accelerating box and the real gravitational force on the earth is what led Einstein 
to his Equivalence Principle and his theory of General Relativity (discussed in 
Chapter 14). These fictitious forces are more meaningful than you might think. 


As Einstein explored elevators, 

And studied the spinning ice-skaters, 
He eyed as suspicious, 

The forces, fictitious, 

Of gravity’s great imitators. 


Note that in addition to the sum ofall the real forces, F, there are two different 
types of quantities on the right-hand side of Eq. (10.10). The quantities r and v 
are measured by someone inside the accelerating frame; they depend on what 
the particle is doing. But d*R/dt* and are properties of the frame. In general, 
the person inside needs to be given their values, although it is possible for her to 
figure out what they are in certain special cases (see Problem 10.9). 

Let’s now look at each of the fictitious forces in detail. The translation and 
centrifugal forces are fairly easy to understand. The Coriolis force is a little more 
difficult. And the azimuthal force can be easy or difficult, depending on exactly 
how @ is changing. 


10.2.1 Translation force: —md?R/dt? 


This is the most intuitive of the fictitious forces. We’ve already discussed this 
force in the train example in the introduction to this chapter. If R is the position 
of the train, then Frans = —md2R / dt? is the fictitious translation force you feel 
in the accelerating frame. 


10.2.2 Centrifugal force: —m@ x (@ x r) 


This force goes hand-in-hand with the mv*/r = mra centripetal acceleration 
as viewed by someone in an inertial frame. 


Example (Standing on a carousel): Consider a person standing motionless 
with respect to a carousel, a distance r from the center. Let the carousel rotate in the 
x-y plane with angular velocity w = wz (see Fig. 10.2). What is the centrifugal force 
felt by the person? 


Solution: œ x r has magnitude wr and points in the tangential direction, in the 
direction of motion. Therefore, mw x (w x r) has magnitude mrw? and points radially 
inward. Hence, the centrifugal force —mw x (w x r) has magnitude mro? and points 
radially outward. 


Remark: Ifthe person is not moving with respect to the carousel, and if œ is constant, 
then the centrifugal force is the only nonzero fictitious force in Eq. (10.10). Since the 
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person is not accelerating in her rotating frame, the net force (as measured in her frame) 
must be zero. The forces in her frame are (1) gravity pulling downward, (2) the normal 
force pushing upward (which cancels gravity), (3) the friction force pushing inward at her 


feet, and (4) the centrifugal force pulling outward. We conclude that the last two of these 


must cancel. So the friction force must point inward with magnitude mrw’. 


Someone standing on the ground observes only the first three of these forces, so the 
net force is not zero. And indeed, there is a centripetal acceleration, v? /r = rw”, due to 
the friction force. To sum up: in the inertial frame, the friction force exists to provide a 
centripetal acceleration. In the rotating frame, the friction force exists to balance out the 
mysterious new centrifugal force, in order to yield zero acceleration. & 


Example (Effective gravity force, MGeff): Consider a person standing motion- 
less on the earth, at a polar angle 6; the way we’ve defined it, 0 equals 7/2 minus 
the latitude angle. In the rotating frame of the earth, the person feels a centrifugal 
force (directed away from the axis) in addition to the gravitational force, mg; see 
Fig. 10.3, although this figure is somewhat misleading, as explained in the first 
remark below. Note that we’re using g to denote the acceleration due solely to 
the gravitational force. This isn’t the “g-value” that the person measures, as we’ll 
shortly see. 

The sum of the gravitational and centrifugal forces (that is, what the person thinks 
is gravity) doesn’t point radially, unless the person is at the equator or at a pole. 
Let us denote the sum by mgerr. To calculate mgeff, we must calculate Foent = 
—m®@® x (w x r). The w x r part has magnitude Rw sin 0, where R is the radius of 
the earth, and it is directed tangentially along the latitude circle of radius R sin 0. So 
—m® x (w Xx r) points outward from the axis, with magnitude mRo” sin 0, which 
is just what we expect for something traveling at frequency w in a circle of radius 
R sin 0. Therefore, the effective gravitational force, 


Moff = m(g — w xX (@ X r)), (10.13) 


points slightly in the southerly direction (for someone in the northern hemisphere), 
as shown in Fig. 10.4. The magnitude of the correction term, mRo&? sin @, is small 
compared with g. Using w © 7.3 - 1075 s7! (that is, one revolution per day, which 
is 27 radians per 86400 seconds) and R © 6.4 - 10° m, we find Rw” ~ 0.03 m/s. 
Therefore, the correction to g is about 0.3% at the equator (but zero at the poles). 
However, this result does not imply that the value of geff is 0.3% smaller at the 
equator than at the poles, because the value of g itself (which we’ve defined to 
be the acceleration due to the gravitational force only) varies over the surface of 
the earth, due to its slightly nonspherical shape. (It also varies on a local scale due to 
density fluctuations and altitude.) When the centrifugal effect is combined with the 
nonspherical effect, the result is that geff is about 0.5% smaller at the equator; see 
Tona (1978). 


REMARKS: 


1. We found above that the sum of the gravitational and centrifugal forces doesn’t point 
radially. Interestingly, there are two reasons for this. The first, as we just saw, is 
that under the (seemingly obvious) assumption that the earth’s gravitational force is 
radial, the centrifugal force causes the sum not to be radial. The second reason is that 
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the earth’s gravitational force actually isn t radial; the bulge at the equator causes the 
gravitational force to point slightly away from the center of the earth (except at the 
equator or at a pole). This is believable, because the extra mass at the nearby part 
of the bulge tends to shift the gravitational force in that direction. An exaggerated 
picture of the situation is shown in Fig. 10.5. The result of the nonradial g is that 
the effective gravitational force points even further from the radial direction than 
expected. It turns out that the effect from the nonradial g on the direction of geff is 
roughly equal to the effect from the centrifugal force. 

So there is a feedback effect: The spinning of the earth causes geff to point away 
from the radial, which causes the earth to bulge (because the surface of the earth 
is, on average, perpendicular to geff), which then causes geff to point a little further 
from the radial, which causes the earth to bulge a little more, and so on. Problem 
10.12 works out some of the details of this, but see Mohazzabi and James (2000) for 
further discussion. 

2. In the construction of buildings, and in similar matters, it is ger, and not g, that 
determines the “upward” direction in which the building should point. The exact 
directions of the earth’s gravitational force and center of the earth are irrelevant. 
A plumb bob hanging from the top of a skyscraper touches exactly at the base. Both 
the bob and the building point in a direction slightly different from both the radial 
and g, but no one cares. 

3. If you look in a table and find the value for the acceleration due to gravity in Boston, 
remember that the number is the geff value and not the g value (which describes 
only the gravitational force, in our terminology). The way we’ve defined it, the 
g value is the acceleration with which things would fall if the earth kept its same 
shape but somehow stopped spinning. The exact value of g is therefore generally 
irrelevant. & 


10.2.3 Coriolis force: —2mm@ x v 


While the centrifugal force is a very intuitive concept (we’ve all gone around 
a corner in a car), the same thing cannot be said about the Coriolis force. This 
force requires a nonzero velocity v relative to the accelerating frame, and peo- 
ple normally don’t move with an appreciable v with respect to their car while 
rounding a corner. To get a feel for this force, let’s look at two special cases. 


Case 1 (Moving radially on a carousel): A carousel rotates counterclockwise 
with constant angular speed w. Consider someone walking radially inward on the 
carousel (imagine a radial line painted on the carousel; the person walks along this 
line), at speed v with respect to the carousel, at radius r. The angular velocity vector 
® points out of the page, where we’ve signified the “out” direction in Fig. 10.6 by a 
little circle with a dot inside. 


Remark: This remark might be a little picky, but I'll say it anyway. The direction of 
rotation is sometimes denoted by a curved arrow pointing tangentially along the circum- 
ference of the carousel. But this is technically not correct, because it would imply that 
the carousel is rotating in the rotating frame, which it isn’t; it’s just sitting there. And it’s 
understood that Fig. 10.6 is in fact drawn in the rotating frame, and not the lab frame, 
because it includes a fictitious force, which has nothing to do with the lab frame. (If you 
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wanted to draw things in the lab frame, then you wouldn’t draw any fictitious forces, and 
the velocity v would have a tangential component, at least in this setup.) & 


The Coriolis force, —2mm@ x v, points tangentially in the direction of the motion of 
the carousel, that is, to the person’s right in our scenario. It has magnitude 


Foor = 2mov. (10.14) 


The person will have to counter this force with a tangential friction force of 2mwv 
(pointing to his left) at his feet, so that he continues to walk on the same radial line. 
Note that there is also the centrifugal force, which is countered by a radial friction 
force at the person’s feet. But this effect won’t be important here. 

Why does this Coriolis force exist? It exists so that the resultant friction force 
changes the angular momentum of the person (measured with respect to the lab frame) 
in the proper way, according to t = dL/dt. To see this, take d/dt of L = mr2a, 


where w is the person’s angular speed with respect to the lab frame, which is also the 


carousel’s angular speed. Using dr/dt = —v, we have 
dL 
g = 2rov+ mr? (dw/dt). (10.15) 


But dw/dt = 0, because the person remains on one radial line, and we are assuming 
that the carousel is arranged to keep a constant w. Equation (10.15) then gives dL /dt = 
—2mrwv. So the L (with respect to the lab frame) of the person changes at a rate 
—(2mav)r. This is simply the radius times the tangential friction force applied by the 
carousel. In other words, it is the torque applied to the person. 


REMARK: What if the person doesn’t apply a tangential friction force at his feet? Then the 
Coriolis force of 2mwv produces a tangential acceleration of 2wv in the rotating frame, and 
hence also in the lab frame (initially, before the direction of the motion in the rotating frame 
has a chance to change), because the frames are related by a constant w. This acceleration 
exists essentially to keep the person’s angular momentum (with respect to the lab frame) 
constant. (It is constant in this scenario, because there are no tangential forces in the lab 
frame.) To see that this tangential acceleration is consistent with conservation of angular 
momentum, set dL/dt = 0 in Eq. (10.15) to obtain 2wv = r(dw/dt) (this is the person’s 
œ here, which is changing). The right-hand side of this is by definition the tangential 
acceleration. Therefore, saying that L is conserved is the same as saying that 2wv is the 
tangential acceleration (for this situation where the inward radial speed is v). &æ 


Case 2 (Moving tangentially on a carousel): Now consider someone walking 
tangentially on a carousel in the direction of the carousel’s motion, with speed v 
(relative to the carousel) at constant radius r (see Fig. 10.7). The Coriolis force 
—2m@ x v points radially outward with magnitude 2mwv. Assume that the person 
applies the friction force necessary to continue moving at radius r. 

There is a simple way to see why this outward force of 2mmv exists. Let V = wr 
be the speed of a point on the carousel at radius r, as viewed by an outside observer. 
If the person moves tangentially (in the same direction as the spinning) with speed v 
relative to the carousel, then his speed as viewed by the outside observer is V + v. 
The outside observer therefore sees the person walking in a circle of radius r at speed 
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V + v. The acceleration of the person with respect to the ground frame is therefore 
(VV +v)? /r. This acceleration must be caused by an inward-pointing friction force at 
the person’s feet, so 


mV + v)? mV? 2mVv m? 
Ffriction = 7 =o rt ot (10.16) 


This friction force is the same in any frame. How, then, does our person on the carousel 
interpret the three pieces of the inward-pointing friction force in Eq. (10.16)? The 
first term balances the outward centrifugal force due to the rotation of the frame, 
which he always feels. The third term is the inward force his feet must apply if he 
is to walk in a circle of radius r at speed v, which is exactly what he is doing in the 
rotating frame. The middle term is the additional inward friction force he must apply 
to balance the outward Coriolis force of 2mœv (using V = wr). Said in an equivalent 
way, the person on the carousel will write down an F = ma equation of the form 
(taking radially inward to be positive), 


m = 


v? m(V + v)? mV? 2mVv 
r r r r 


=> ma= Friction + Feent + Feor- (10.17) 


We see that the net force he feels does indeed equal his ma, where a is measured with 
respect to the rotating frame. Physically, the difference between the interpretations 
of Eqs. (10.16) and (10.17) is the existence of fictitious forces in the rotating frame. 
Mathematically, the difference is simply the rearrangement of terms. 


For cases in between the two special cases above, things aren’t so clear, but 
that’s the way it goes. Note that no matter what direction you move on a carousel, 
the Coriolis force always points in the same perpendicular direction relative to 
your motion. Whether it’s to your right or to your left depends on the direction 
of the rotation. But given œ, you’re stuck with the same relative direction of the 
force. 


On a merry-go-round in the night, 
Coriolis was shaken with fright. 

Despite how he walked, 

Twas like he was stalked 

By some fiend always pushing him right. 


Let’s do some more examples... 


Example (Dropped ball): A ball is dropped from height h, at a polar angle 0 
(measured down from the north pole). How far to the east is the ball deflected, by the 
time it hits the ground? 


Solution: The angle between and v is x — 0, so the Coriolis force —2mw x v 
is directed eastward with magnitude 2mwv sin 0, where v = gt is the speed at time 
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t (t runs from zero to the usual /2h/g ).! Note that the ball is deflected to the east, 
independent of which hemisphere it is in. The eastward acceleration at time ¢ is there- 
fore 2wgt sin 0. Integrating this to obtain the eastward speed (with an initial eastward 
speed of zero) gives Veast = ogt? sin. Integrating again to obtain the eastward 
deflection (with an initial eastward deflection of zero) gives deast = oogt sin 0/3. 
Plugging in t = y2h/g gives 


2whsinð |2h 
deast = —— | —. (10.18) 


3 g 


The frequency of the earth’s rotation is œ ~ 7.3 - 1075 s—!, so if we pick 0 = 1/2 
and h = 100m, for example, then we have deast © 2 cm. 


Remark: We can also solve this problem by working in an inertial frame; see Stirling 
(1983). Figure 10.8 shows the setup where a ball is dropped from a tower of height h 
located at the equator (the view is from the south pole). The earth is rotating in the inertial 
frame, so the initial sideways speed of the ball, (R+/)q, is larger than the sideways speed 
of the base of the tower, Rw. This is the basic cause of the eastward deflection. 

However, after the ball has moved to the right, the gravitational force on it picks up a 
component pointing to the left, and this slows down the sideways speed. If the ball has 
moved a distance x to the right, then the leftward component of gravity equals g sing ~ 
g(x/R). Now, to leading order we have x = Rat, so the sideways acceleration of the ball is 
a = —g(Rot/R) = —wet. Integrating this, and using the initial speed of (R + h)w, gives 
a rightward speed of (R + h)w — wgt?/2. Integrating again gives a rightward distance 
of (R + h)wt — wgt?/6. Subtracting off the rightward position of the base of the tower Fig. 10.8 
(namely Rot), and using t ~ ,/2h/g (neglecting higher-order effects such as the curvature 
of the earth and the variation of g with altitude), we obtain an eastward deflection of 
wh, /2h/g(1 — 1/3) = (2/3)wh,/2h/g, relative to the base of the tower. If the ball is 
dropped at a polar angle @ instead of at the equator, then the only modification is that all 
speeds are decreased by a factor of sin 0, so we obtain the result in Eq. (10.18).  & 


equator 


Example (Foucault’s pendulum): This is the classic example of a consequence 
of the Coriolis force. It unequivocally shows that the earth rotates. The basic idea 
is that due to the rotation of the earth, the plane of a swinging pendulum rotates 
slowly, with a calculable frequency. In the special case where the pendulum is at 
one of the poles, this rotation is easy to understand. Consider the north pole. An 
external observer, hovering above the north pole and watching the earth rotate, sees 
the pendulum’s plane stay fixed (with respect to the distant stars) while the earth rotates 
counterclockwise beneath it.? Therefore, to an observer on the earth, the pendulum’s 
plane rotates clockwise (viewed from above). The frequency of this rotation is of 
course just the frequency of the earth’s rotation, so the earth-based observer sees the 
pendulum’s plane make one revolution each day. 


' Technically, v = gt isn’t quite correct. Due to the Coriolis force, the ball will pick up a small 
eastward velocity component (this is the point of the problem). This component will then produce 
a second-order Coriolis force that affects the vertical speed (see Exercise 10.21). But we can ignore 
this small effect in this problem. Also, we really mean gap instead of g, but any ambiguity in this 
will have a negligible effect. 

2 Assume that the pivot of the pendulum is a frictionless bearing, so that it can’t provide any torque 
to twist the plane of the pendulum. 
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What if the pendulum is not at one of the poles? What is the frequency of the 
precession? Let the pendulum be located at a polar angle 6. We will work in the 
approximation where the velocity of the pendulum bob is horizontal (with respect to 
the earth’s surface). This is essentially true if the pendulum’s string is very long; the 
correction due to the rising and falling of the bob is negligible. The Coriolis force 
—2m® x v points in some complicated direction, but fortunately we are concerned 
only with the component that lies in the horizontal plane (that is, the plane of the 
ground). The vertical component serves only to modify the apparent force of gravity 
and is therefore negligible. Although the frequency of the pendulum does depend 
on g, the resulting modification is very small. With this in mind, let’s break œ into 
vertical and horizontal components in a coordinate system located at the pendulum. 
From Fig. 10.9, we see that 


@ = wcosh 2 + wsinð Ŷ. (10.19) 


We’ll ignore the y component, because it produces a Coriolis force in the z direction, 
since v lies in the horizontal x-y plane. So for our purposes, œ is essentially equal to 
cos @ Z. From this point on, the problem of finding the frequency of precession can 
be done in numerous ways. We’ll present two solutions. 


First solution (The slick way): The horizontal component of the Coriolis force 
has magnitude 


Fhoriz — |_2m(w cos 6%) x v| = 2m(wcos6)u, (10.20) 
and it is perpendicular to v(t). Therefore, as far as the pendulum is concerned, it 
is located at the north pole of a planet called Terra Costhetica which has rotational 
frequency of w cos 0. 3 But as we saw above, the precessional frequency of a Foucault 
pendulum located at the north pole of such a planet is simply 


OF = wcosé, (10.21) 


in the clockwise direction. So that’s our answer. 


Second solution (In the pendulum’s frame): Let’s work in the frame of the 
plane that the Foucault pendulum sweeps through. Our goal is to find the rate of 
precession of this frame. With respect to a frame fixed on the earth (with axes X, 
y, and z as above), we know that this plane rotates with frequency œp = —oz if 
we’re at the north pole (6 = 0), and with frequency wp = 0 if we’re at the equator 
(0 = 1/2). So it’s a good bet that the general answer is œf = —w cos 8 Z, and that’s 
what we’ll now show. 

Working in the frame of the plane of the pendulum is useful, because we can take 
advantage of the fact that the pendulum feels no sideways forces in this frame, because 
otherwise it would move out of the plane (which it doesn’t, by definition). The frame 
fixed on the earth rotates with frequency œ = wcos6z + wsin Oy, with respect to 
the inertial frame. Let the pendulum’s plane rotate with frequency wp = wpz with 


3 As mentioned above, the setup isn’t exactly like the one on the new planet. There is also a vertical 


component of the Coriolis force for the pendulum on the earth, but this effect is negligible. 
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respect to the earth frame. Then the angular velocity of the pendulum’s frame with 
respect to the inertial frame is 


@ + oF = (woos + wf)? + wsin by. (10.22) 


To find the horizontal component of the Coriolis force in this rotating frame, we care 
only about the z part of this frequency. The horizontal Coriolis force therefore has 
magnitude 2m(w cos 0 + wp)v. But in the frame of the pendulum, there must be zero 
horizontal force, so this must be zero. Therefore, 


WF = —@cosé. (10.23) 


This agrees with Eq. (10.21), where we just wrote down the magnitude of wp. 


10.2.4 Azimuthal force: —m(dw/dt) xr 


In this section, we will restrict ourselves to the simple and intuitive case where w 
changes only in magnitude, that is, not in direction (this more complicated case 
is the subject of Problem 10.10). The azimuthal force may then be written as 


Faz = —moo x r. (10.24) 


This force is easily understood by considering a person standing at rest with 
respect to a rotating carousel. If the carousel speeds up, then the person must 
feel a tangential friction force at his feet if he is to remain fixed on the carousel. 
This friction force equals matan, where atan = ro is the tangential acceleration 
as measured in the ground frame. But from the person’s point of view in the 
rotating frame, he is not moving, so there must be some other mysterious force 
that balances the friction. This is the azimuthal force. Quantitatively, when @ 
is orthogonal to r, we have |ô x r| = r, so the azimuthal force in Eq. (10.24) 
has magnitude mro. This is the same as the magnitude of the friction force, as 
it should be. 

What we have here is exactly the same effect that we had with the translation 
force on the accelerating train. If the floor speeds up beneath you, then you must 
apply a friction force if you don’t want to be thrown backward with respect to the 
floor. If you shut your eyes and ignore the centrifugal force, then you can’t tell if 
you are on a linearly accelerating train, or on an angularly accelerating carousel. 
The translation and azimuthal forces both arise from the acceleration of the floor. 
(Well, for that matter, the centrifugal force does too.) 

We can also view things in terms of rotational quantities, instead of the lin- 
ear atan acceleration above. If the carousel speeds up, then a torque must be 
applied to the person if he is to remain fixed on the carousel, because his angu- 
lar momentum in the fixed frame increases. Therefore, he must feel a friction 


469 


470 


Fig. 10.10 


Accelerating frames of reference 


force at his feet. Let’s show that this friction force, which produces the change in 
angular momentum of the person in the fixed frame, exactly cancels the azimuthal 
force in the rotating frame, thereby yielding zero net force in the rotating frame. 
Since L = mr*w, we have dL/dt = mr? (assuming r is fixed). And since 
dL/dt = t = rF, we see that the required friction force is F = mro. And as we 
saw above, when @ is orthogonal to r, the azimuthal force in Eq. (10.24) also 
equals mro, in the direction opposite to the carousel’s motion. So the tangen- 
tial forces in the rotating frame do indeed cancel. This was basically the same 
calculation as the one above, but with an extra factor of r in the t = dL/dt 
equation. 


Example (Spinning ice skater): We have all seen ice skaters increase their angu- 
lar speed by bringing their arms in close to their body. This can be understood in terms 
of angular momentum; a smaller moment of inertia requires a larger w in order to keep 
L constant. But let’s analyze the situation in terms of fictitious forces. We’ ll idealize 
things by giving the skater massive hands at the ends of massless arms attached to a 
massless body.* Let the hands have total mass m, and let them be drawn in radially. 

Look at things in the skater’s frame (which has an increasing w), defined by the 
vertical plane containing the hands. The crucial thing to realize here is that the skater 
always remains in the skater’s frame (a fine tautology, indeed). Therefore, the skater 
must feel zero net tangential force in her frame, because otherwise she would accel- 
erate with respect to it. Her hands are being drawn in by a muscular force that works 
against the centrifugal force, but there can be no net tangential force on the hands in 
the skater’s frame, by definition. 

What are the tangential forces in the skater’s frame? Let the hands be drawn in at 
speed v (see Fig. 10.10). Then there is a Coriolis force (in the same direction as the 
spinning) with magnitude 2mwv. There is also an azimuthal force with magnitude 
mro (in the direction opposite to the spinning, as you can check). Since the net 
tangential force is zero in the skater’s frame, we must have 


2mov = mro. (10.25) 


Does this relation make sense? Well, let’s look at things in the ground frame. The 
total angular momentum of the hands in the ground frame is constant. Therefore, 
d(mr*@) /dt = 0. Taking this derivative and using dr/dt = —v gives Eq. (10.25). 


A word of advice about using fictitious forces: Decide which frame you 
are going to work in (the lab frame or the accelerating frame), and then stick 
with it. The common mistake is to work a little in one frame and a little in the 
other without realizing it. For example, you might introduce a centrifugal force 
on someone sitting at rest on a carousel, but then also give her a centripetal 


4 This reminds me of a joke about a spherical cow. 
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acceleration. This is incorrect. In the lab frame, there is a centripetal acceleration 
(caused by the friction force) and no centrifugal force. In the rotating frame, 
there is a centrifugal force (which cancels the friction force) and no centripetal 
acceleration (because the person is sitting at rest on the carousel). In short, if you 
ever mention the words “centrifugal” or “Coriolis,” etc., then you had better be 
working in an accelerating frame. 


10.3 Tides 


The tides on the earth exist because the gravitational force from a point mass (or 
a spherical mass, in particular the moon or the sun) is not uniform; the direction 
of the force is not constant (the force lines converge to the source), and the 
magnitude is not constant (it falls off like 1/r”). On the earth, these effects cause 
the oceans to bulge around the earth, producing the observed tides. The study 
of tides is useful in part because tides are a very real phenomenon in the world, 
and in part because the following analysis gives us an excuse to use fictitious 
forces and play around with Taylor series approximations. Before considering 
the general case of tidal forces, let’s look at two special cases. 


Longitudinal tidal force 

In order to isolate the tidal effect, let’s consider a somewhat contrived setup 
instead of the earth-sun or earth-moon system (we’ll eventually get to these). 
Consider three masses in a line, as shown in Fig. 10.11. The right mass M is 
very large and exerts a gravitational force on mı and m2. But mı and m are 
sufficiently small so that they don’t exert much of a gravitational force on each 
other. Furthermore, assume m2 >> mı. Let R be the distance from M to m2, and 
let x be the distance from mı to m2. Assume x < R. 

The masses mı and mz accelerate radially inward toward M. In the acceler- 
ating reference frame of m2, what is the force on mı? To answer this, we must 
add the fictitious translation force on mı (which points to the left) to the actual 
gravitational force on m (which points to the right). So in the accelerating frame 
of mz, the net force on mı is 


GMm, . 
R- 
GMm, . GMm., 
(Rx) x x x. 


Fret = Ferav + Frrans = — max 


(10.26) 


This is equivalent to the statement that mz sees mı accelerate away with 
acceleration 


Fot GM GM 
m (R—x)? R 


= ay = ao, (10.27) 
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which is just the difference in accelerations you would intuitively expect. Using 
x X R to make suitable approximations in Eq. (10.26), we have 


GMm; GMm, GMm, 1 
Fret © 5 = = 2 l 
R4 — 2Rx R R 1 — 2x/R 
GMm, 


2GMm\x 
R3 i 


x (d +2x/R)— 1) = (10.28) 


R2 
This is, of course, simply x times the derivative of the gravitational force. It 
points to the right, so its effect is to increase the separation of the masses. 

If you are riding along on mz, and if a black box encloses mı and mz, then 
as far as you are concerned, you may as well be in a black box floating freely 
in outer space (from Einstein’s Equivalence Principle, discussed in Chapter 14). 
But if you are floating in outer space, and if you see mı accelerating away from 
you with acceleration Fnet/mı = 2GMx/R?, then you will naturally conclude 
that in your reference frame, there must be a mysterious force of 


2GMmx 


“Fida” = Fret = B 


(10.29) 
pulling mı away from you. This force is called the tidal force because it is what 
causes the tides, as we’ll see in detail below. Note that the tidal force is linear in 
the separation x and inversely proportional to the cube of the distance from the 
source. If x is negative, so that m is on the left side of mz, then the tidal force is 
negative, which means that mı accelerates leftward away from you. So the effect 
of the longitudinal tidal force is to increase the separation between the masses, 
independent of the sign of x. 

If you want to keep m; at rest with respect to you, then you will have to tie 
it down with a string, and the tension in the string will be 2GMm,x/R>.° Any 
experiment you do inside the black box will indicate that there is a force pulling 
mı away from you. The only actual force on m; is the gravitational force from M 
equal to GMm /(R — x)”, but because M gives mz nearly the same acceleration 
as mı, you perceive only the small Fida in Eq. (10.29). It is indeed small, because 
it is the actual gravitational force times the quantity 2x/R < 1. 


Transverse tidal force 

Consider now the setup shown in Fig. 10.12, with mz at the corner of a right trian- 
gle that has y < R. The masses m and mz accelerate radially inward toward M. 
In the accelerating frame of m2, what is the force on mı? As above, we must 
add the fictitious translation force on mı (which points to the left) to the actual 
gravitational force on mı (which points to the right and slightly downward). 


5 The tension in the string won’t affect the acceleration of m2, because m2 is assumed to be much larger 
than mı. So our accelerating frame is still defined to be the one that accelerates with acceleration 
a = GM/R?. This is where the mz >> mı assumption comes in. 
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In this case, however, the magnitudes of the gravitational accelerations of mı 
and m are essentially equal, because they are both a distance R from mass M, 
up to second-order effects in y/R (by the Pythagorean theorem). The direction is 
the only thing that is different, to first order in y/R. So in the accelerating frame 
of mz, the net force (which we’ll call the tidal force from now on) on m; is 


GMm\ R bien GMm, . 
Fiidal = Feray + Ferans © =o, (C080 xX -sin y) — R2 x 
GMm, . A GMmiy . 


where we have used cos0 ~% | and sinô ~ y/R. This difference is simply the 
y component of the force on mı, which is what you would expect. It points 
along the line joining the masses, and its effect is to pull them together. As in 
the longitudinal case, the transverse tidal force is linear in the separation and 
inversely proportional to the cube of the distance from the source. 


General tidal force 

We will now calculate the tidal force on a mass m located at an arbitrary point 
on a circle of radius r (for example, this circle could represent a cross section of 
the earth), due to a mass M located at the vector —R (so the vector from M to m 
is R + r); see Fig. 10.13. We will calculate the tidal force relative to the center 
of the circle. That is, we will find the net force on m in the accelerating frame 
whose origin is the center of the circle. As usual, assume r < R. And for now 
let’s ignore any orbital motion of the circle around M (although it turns out that 
this isn’t relevant to tides anyway; see the third remark below), so the circle just 
accelerates radially inward toward M. 

The gravitational force on m may be written as Foray = —GMm(R + r)/ 
IR + r|>. The cube is in the denominator because the vector in the numerator 
contains one power of the distance. As above, adding on the fictitious translation 
force due to the acceleration of the center of the circle (which is independent of 
whatever mass we put there) yields a tidal force of 


Fiar) —(R+r) -R 
GMm  |R+r} |R 


(10.31) 


This is the exact expression for the tidal force. However, it is rather useless. 
Let us therefore make some approximations in Eq. (10.31) and transform it into 
something technically incorrect (as approximations tend to be), but far more 
useful. The first thing we need to do is rewrite the |R + r| term. We have 


é This reminds me of a joke about two people lost in a hot-air balloon. 
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(using r < R and ignoring higher-order terms) 
IR+tr|)=/R+nr)-(R4n=VR24+7°42R-4r 
~ R,/1+2R- r/R? 


R-r 
| ea) (10.32) 
Therefore (again using r < R), 
Frida) R+r R 
G 3 Ds t Re 
GMm R(l+R-r/R2)3 R 
a R+r 4 R 
~ R3(143R-4/R2) R 
R+r 3R-r R 
N re (: - -z ) + g (10.33) 


Letting R=R /R, this finally simplifies to (once again using r < R) 


GMm ^a a 
RB GRR -r)-r). (10.34) 


Frida (r) © 


This is the general expression for the tidal force. We can put it in a simpler form 
if we let M lie on the positive x axis, which we can arrange for with a rotation 
of the axes. We then have R = —%, and so R- r = —x. Equation (10.34) then 
tells us that the tidal force on a mass m at position r = (x, y), due to a mass M at 
position (R, 0), equals 


GMm a a a GMm 
B (3xx — (xx + yy)) = FE (2x, —y). (10.35) 


Fida x, y) © 


This reduces properly in the longitudinal and transverse cases considered above. 
The tidal forces at various points on the circle are shown in Fig. 10.14. Since our 
setup with M on the x axis is invariant under rotations around the x axis, this 
picture of forces is also invariant. And because of the left-right symmetry, the 
picture would look the same if M were placed on the negative x axis. 

The potential energy associated with the tidal force in Eq. (10.35) is propor- 
tional to —x? + y?/2, because the negative gradient of this yields the (2x, —y) 
vector. Using x = r cos 0 and y = r sin 9, the potential can be written as 


GMmr? 
2R? 


GMmr? 
2R3 


Vadar, 0O) ~ (—2 cos?0 + sin?0) = (1 — 3 cos?8). (10.36) 
If the earth were a rigid body, then the tidal force would have no effect on it. 
But the water in the oceans is free to slosh around, so it bulges along the line 


from the earth to the moon, and also along the line from the earth to the sun. 
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And it also gets drawn in and forms a dip along the directions transverse to the 
moon and sun.’ We’ll see below that the moon’s effect is about twice the sun’s. 
As the earth rotates beneath the bulge and dip, a person on the earth sees them 
rotate in the other direction relative to the earth. From Fig. 10.14, we see that 
this produces two high tides and two low tides per day.® It’s actually not exactly 
two per day, because the moon moves around the earth. But this motion is fairly 
slow, taking about a month, so it’s a reasonable approximation to think of the 
moon as motionless. 

Note that it is not the case that the moon pushes the water away on the far side 
of the earth. It pulls on that water, too; it just does so in a weaker manner than it 
pulls on the rigid part of the earth. With tides, it’s not the force that matters, but 
rather the difference in force. Tides are a comparative effect. A child playing with 
a shovel and a pail on a beach will be able to do so only until the earth rotates 
to a point where the beach enters the region where the moon pulls on the water 
sufficiently more (or less) than it pulls on the rigid earth. 


Whether sieged by an army more grand, 
Or swamped by the tides in the sand, 
It’s a matter of course 

That the difference in force 

Is what leads to a castle’s last stand. 


REMARKS: 


1. Consider a mass on the surface of the earth. It turns out that the gravitational force from 
the sun on it is (much) larger than that from the moon, whereas the tidal force from the 
sun on it is (slightly) weaker than that from the moon. Quantitatively, the ratio of the 
gravitational forces is 


Fs GMs GMM 5.9 - 1073 m/s? 7 
n-(2 1/3 J ns (10.37) 
E,S E,M è 
And the ratio of the tidal forces is 
F GM: GM; 3.9. 10714 572 
ts F S 3 M\ _ -_ - x 0.45. (10.38) 
Fim Ris Rèm 37.10 7; 


7 There’s actually a lag effect as the earth rotates, due to various complicated things, so the bulge 
doesn’t point directly toward the moon or sun. But let’s not worry about that here. 

8 We’ve made the incorrect but fairly reasonable approximation that the moon lies in the plane of the 
earth’s equator. If you solve the problem exactly by taking into account the position of the moon 
relative to the equator, you will find that there is a piece of the tides (known as the diurnal tide) 
that has a period of one day, in addition to the piece we just found (the semidiurnal tide) with a 
period of half a day. See Horsfield (1976). If you want to derive this result yourself, a hint is that 
you’ ll need to use Eqs. (10.36) and (B.7). 
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Depending on the relative positions of the sun and moon with respect to the earth, their 
tidal effects may add (when the three bodies are collinear; this is called a “spring” tide), 
or they may partially cancel (when the sun and moon make a 90° angle in the sky; this is 
called a “neap” tide). 


. Equation (10.38) shows that the moon’s tidal effect is roughly twice the sun’s. This has 


the following interesting implication about the densities of the moon and sun. The tidal 
force from, say, the moon is proportional to 


GMM G(42r3,) em m \? 
( - = ( G ru) œ PM (= ~ pe, (10.39) 
Rem Rem E,M 


where ĝm is half of the angular size of the moon in the sky. Likewise for the sun’s tidal 
force. But it just so happens that the angular sizes of the sun and the moon are essentially 
equal, as you can see by looking at them (preferably quickly in the case of the sun, and 
through some thick haze), or by noting that total solar eclipses barely exist. Therefore, the 
combination of Eq. (10.38) and Eq. (10.39) tells us that the moon’s density is about twice 
the sun’s. It is quite remarkable that, at least roughly, you can experimentally determine 
the ratio of the densities of two celestial bodies by spending a few weeks at the beach. 


. There is one effect that we’ve glossed over, but fortunately it turns out not to be relevant. 


In the above analysis, we derived the tidal force by subtracting the fictitious translation 
force from the gravitational force. But in the actual case of the earth, the fictitious force 
is generally interpreted as a centrifugal force, because the earth is rotating around the 
earth-moon CM, and this CM is also rotating around the sun. And, of course, the earth is 
rotating around its axis. Basically, there’s a lot of rotation going on. 

Now, because the centrifugal force (from the sum of all the rotations) depends on 
position, there is a difference between the centrifugal force at the center of the earth and 
at a point on the surface. Shouldn’t this difference come into play when calculating the 
net force in the accelerating frame whose origin is the center of the earth? Well, yes ... 
and no... 

There are many ways of breaking up the earth’s motion, but for the present purposes the 
most useful way is the following. The motion of any object can be described as the sum 
of the translation of a nonrotating frame (for example, the seat on a ferris wheel), plus a 
rotation around a point in that frame (the seat on an old-fashioned ferris wheel wouldn’t 
have any such rotation, but the seat on a more modern scary kind of amusement park ride 
undoubtedly would). 

The first of these two motions accounts for the motion of the earth as a whole around 
the earth-moon CM and also for the motion of this CM around the sun.’ The only relevant 
fictitious force here is the translation force (at any instant in this motion, every atom in the 
earth has the same acceleration vector, so there is no need to think in terms of rotation), 
and this translation force is what we used in the above derivation of the tidal force.!° The 
second of the two motions accounts for the rotation of the earth on its axis. This most 
certainly does create a centrifugal force difference between the center of the earth and a 
point on the surface, but this difference doesn’t contribute to the tides; it contributes only 
to the uniform bulge of the earth at the equator. Since the force is uniform, it doesn’t cause 
water to slosh around, so it therefore doesn’t have anything to do with the tides. 


? Just imagine grabbing the earth with a very large hand and sliding it around without twisting your 
hand, so that a given point on the earth always has the same view of stars in the sky. 

10 f you insist on thinking in terms of circular motion as you slide the earth in a circle around the sun 
without twisting your hand, you can indeed view every point in the earth as moving in a circle. But 
all of these circles have the same radius; they simply have offset centers. The centrifugal forces 
are therefore all the same (in both magnitude and direction), and so there are no differences to 
worry about. 
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So the answers to the above question are: Yes, because the difference really does 
contribute to the net force. But no, because it doesn’t contribute to the tides, which is 
what we’re concerned with here. 

If you work out the actual numbers, you’ll see that the centrifugal force completely 
dominates the tidal force. This is consistent with the fact that the equatorial bulge is on the 
order of kilometers, whereas the tides are on the order of meters. But the point is that the 
centrifugal force doesn’t affect the tides because it is uniform around the earth. The tidal 
force, on the other hand, has the nonuniform shape in Fig. 10.14. The football-shaped 
bulge it creates therefore moves around relative to the earth as the earth rotates beneath it. 
To sum up: with the tidal force (which we define as the force that produces the tides), we 
are not concerned with the total difference in force between a point on the earth and the 
center, but rather only with the part of the difference that isn’t invariant under rotations 
ofthe earth & 


10.4 Problems 


Section 10.2: The fictitious forces 


10.1. Which way down? x 


You are floating high up in a balloon, at rest with respect to the earth. 
Give three quasi-reasonable definitions for which point on the ground 
is right “below” you. 


10.2. Longjumping in geff * 


Je 


If a longjumper can jump 8 meters at the north pole, how far can he 
jump at the equator? Assume that geff is 0.5% less at the equator than ö 
at the north pole (although this is only approximate). Ignore effects of K Cs 
wind resistance, temperature, and runways made of ice. 


10.3. Zeff VS. g * 
For what angle 0 (down from the north pole) is the angle between geff Gi 
and g maximum? 


10.4. Lots of circles « 


Fig. 10.15 
(a) Two circles in a plane, Cı and C2, each rotate with frequency 


æ relative to an inertia frame. The center of C4 is fixed in the 
inertial frame, and the center of C2 is fixed on C1, as shown 
in Fig. 10.15. A mass is fixed on C2. The position of the mass 2 
relative to the center of Cı is R(t). Find the fictitious force felt 
by the mass. 

(b) N circles in a plane, C;, each rotate with frequency œw relative to 
an inertia frame. The center of C4 is fixed in the inertial frame, C2 
and the center of C; is fixed on C;_, (fori = 2,...,N), as shown 
in Fig. 10.16. A mass is fixed on Cy. The position of the mass Cı 
relative to the center of C4 is R(ż). Find the fictitious force felt 
by the mass. Fig. 10.16 
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10.5. 


10.6. 


10.7. 


10.8. 


10.9. 


10.10. 


Mass on aturntable « 


A mass is at rest with respect to the lab frame, while a frictionless 
turntable rotates beneath it. The frequency of the turntable is w, and 
the mass is located at radius r. In the frame of the turntable, find the 
forces acting on the mass, and verify that F = ma. 


Released mass « 


A mass is bolted down to a frictionless turntable. The frequency of 
rotation is w, and the mass is located at radius a. The mass is then 
released. Viewed from an inertial frame, it travels in a straight line. 
In the rotating frame, what path does the mass take? Specify r(t) and 
A(t), where 0 is the angle with respect to the initial radius, as mea- 
sured in the rotating frame. Solve this by working in the inertial frame. 
(Exercise 10.25 deals with the more difficult task of working in the 
rotating frame.) 


Coriolis circles « 


A puck slides with speed v on frictionless ice. The surface is “level” 
in the sense that it is orthogonal to geff at all points. Show that the 
puck moves in a circle, as seen in the earth’s rotating frame. What is 
the radius of the circle? What is the frequency of the motion? Assume 
that the radius of the circle is small compared with the radius of the 
earth. 


t = dL/dt xx 

In Section 8.4.3, we derived the three conditions under which it is 
valid to write )> t?t = dL/dt. Rederive these conditions by work- 
ing entirely in the (possibly) accelerating frame. As in Section 8.4.3, 
assume that the frame isn’t rotating (so at most, the origin is 
accelerating). 


Determining your frame +» 


Imagine that you are on a large rotating disk, with œ perpendicular to 
the disk at its center. Assume that you know that the center of the disk is 
fixed, and that no real forces act on anything on the disk (except gravity 
pulling down, which is canceled by the normal force). Assume that œ 
changes only in magnitude, and that this rate of change is constant. Is it 
possible for you to determine w and dw/dt, and also the location of 
the center of the disk, by performing experiments in the rotating frame 
only? 


Changing w’s direction x+» 
Consider the special case where a reference frame’s œ changes only 
in direction (that is, not in magnitude). In particular, consider a cone 


10.11. 


10.12. 


10.4 Problems 


rolling on a table, which is a natural example of such a situation. Let the 
origin of the cone frame be the tip of the cone. This point remains fixed 
in the inertial frame. The instantaneous @ for a rolling cone points 
along its line of contact with the table, because these are the points that 
are instantaneously at rest. This line precesses around the origin. Let 
the frequency of the precession be Q. 

In order to isolate the azimuthal force, paint a dot on the surface of 
the cone (call it point P), and consider the moment when the dot lies 
on the instantaneous @ (see Fig. 10.17). From Eq. (10.11), we see that 
there are no contributions from the centrifugal force (because P lies 
on @), the Coriolis force (because P is not moving in the cone frame), 
or the translation force (because the tip of the cone is fixed). The only 
remaining fictitious force is the azimuthal force, and it exists due to 
the fact that œ is changing. Equivalently, it arises from the fact that P 
is accelerating up away from the table. 


(a) Find the acceleration of P. 

(b) Use Eq. (10.11) to calculate the azimuthal force on a mass m 
located at P, and show that the result is consistent with the 
acceleration you found in part (a). 


Unwinding string «««« 

A wheel with radius R is placed flat on a table. A massless string with 
one end attached to the rim of the wheel is wrapped clockwise around 
the wheel a large number of times. When the string is wrapped com- 
pletely around the wheel, a point mass m is attached to the free end 
and glued to the wheel. The wheel is then made to rotate with con- 
stant angular speed w. At some point, the glue on the mass breaks, 
and the mass and string gradually unwind (with the speed of the wheel 
kept constant at w by a motor, if necessary). Show that the length 
of the unwound string increases at the constant rate Rw, for both the 
clockwise and counterclockwise directions for w. (The latter is the 
tricky one.) 


Shape of the earth «x«« 


The earth bulges slightly at the equator, due to the centrifugal force in 
the earth’s rotating frame. The goal of this exercise is to find the shape 
of the earth, first incorrectly, and then correctly. 


(a) The common incorrect method is to assume that the gravita- 
tion force from the slightly nonspherical earth points toward the 
center, and to then calculate the equipotential surface (incor- 
porating both the gravitational and centrifugal forces). Show 
that this method leads to a surface whose height (relative to a 


Fig. 10.17 
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spherical earth of the same volume) is given by 


2 

h(0) =R (=) (3 sin?6 — 2), (10.40) 

6g 
where 8 is the polar angle (the angle down from the north pole), 
and R is the radius of the earth. 

(b) The above method is incorrect, because the slight distortion of 
the earth causes the gravitational force to not point toward the 
center of the earth (except at the equator and the poles). This tilt 
in the force direction then changes the slope of the equipotential 
surface, and it turns out (although this is by no means obvious) 
that this effect is of the same order as the slope of the surface 
found in part (a). Your task: Assuming that the density of the 
earth is constant,'' and that the correct height takes the form 
of some constant factor f times the result found in part (a), !? 
show that f = 5/2.'° Do this by demanding that the potential at 
a pole equals the potential at the equator. Feel free to do things 
numerically. 


10.13. Southward deflection «««« 


A ball is dropped from height A (small compared with the radius 
of the earth) at a polar angle 6. Assume (incorrectly) that the earth 
is a perfect sphere. Show that a second-order Coriolis effect leads 
to a southward deflection (in the northern hemisphere) equal to 
(2/3)(w7h*/g) sin 0 cos 6.!4 

It turns out that the actual southward deflection is larger than this; 
it equals 4(w7h*/g) sin 8 cos. So apparently there are other effects 
at work. The main task of this problem is to show how the factor of 
2/3 turns into a factor of 4. In what follows, we will keep terms up to 
order œ? (or technically w*R/g) and order A/R. Also, it will be easiest 
to calculate southward distances relative to the point on the ground 


This isn’t true, but it’s a very difficult problem to solve with a nonconstant density, assuming that 
we even know what the density is as a function of radius. 

Satellite data show that the general shape found in Eq. (10.40) is essentially correct, up to some 
factor. This is known as the quadrupole shape; the (3 sin?@ — 2) term is often written as (1 — 
3 cos?0). I can’t think of a clean theoretical way to justify the assumption that the shape is of this 
form, so let’s just accept it. It still makes for a very nice problem. 

If AA is the difference between the equatorial and polar radii, then this factor of 5/2 takes the 
incorrect Ah of about 11 km (as you can show) in Eq. (10.40) and turns it into about 28 km, which 
is reasonably close to the observed value of 21.5 km. The discrepancy arises from the fact that the 
density of the earth isn’t constant; it decreases with the radius. 

By this we mean that the Coriolis effect causes the ball to land this far south of the point where a 
plumb bob (hanging from where the ball is dropped) touches the ground. Measuring the deflection 
relative to the point on the ground along the radius to the dropping point would be impractical, 
because there is no way of knowing where the center of the earth is. 


10.4 Problems 


along the radius to the dropping point; call this point P. But our final 
goal will be to determine the southward distance relative to a hanging 
plumb bob. 


(a) Show that the distance between the plumb bob and P is 
(w?Rh/g) sin @ cos 6(1 — h/R). 

(b) The fact that the gravitational force decreases with height implies 
that the ball takes more time than the standard ,/2//g to hit the 
ground. Show that the time equals ./2h/g(1 + 5//6R). 

(c) Lety be the height above the ground, and let z be the southward 
distance away from the radial line to the dropping point. Show 
that the centrifugal force yields a southward acceleration away 
from the radial line equal to 7 = œ? (R + y) sin@ cos 6. 

(d) Show that the gravitational force yields a northward acceleration 
back toward the radial line equal to Z = —g(z/R). 

(e) Combine parts (b), (c), and (d) to show that the centrifugal and 
gravitational forces lead to a southward deflection away from P 
equal to (w*Rh/g) sin 8 cos 6(1 + 7h/3R). Adding on the above 
Coriolis effect and subtracting off the position of the plumb bob 
then quickly yields the desired factor of 4. (This problem is based 
on Belorizky and Sivardiere (1987).) 


Section 10.3: Tides 


10.14. 


10.15. 


Bead on a hoop «« 


A bead of mass m is constrained to move on a frictionless hoop of radius 
r that is located a distance R from an object of mass M. Assume that 
R > r, and assume that M is much larger than the mass of the hoop, 
which is much larger than m. 


(a) If the hoop is held fixed and the bead is released from a point 
close to the rightmost point, as shown in Fig. 10.18, what is the 
frequency of small oscillations? 

(b) If the hoop is released and the bead starts at a point close to 
the rightmost point, what is the frequency of small oscillations? 
Assume that you grab M and move it to the right to keep it a 
distance R from the hoop. (The time scale of oscillations turns out 
to be on the same order as the time scale of the hoop reaching M, 
if M stays fixed.) 


Precession of the equinoxes xxx 


Because the earth bulges at the equator, and because its axis of rotation 
is tilted with respect to the plane of the ecliptic (the plane containing 
the sun and (nearly) the moon), the tidal forces from the sun and the 
moon produce a torque on the earth, which causes the axis of rotation 
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to precess. The rate of precession is slow; the period is about 26 000 
years. Derive (approximately) this result in the following way.'° 

We’ll be making some rough approximations here, but our goal is 
simply to understand intuitively what’s going on, and to get an answer 
that’s in the right ballpark. Assume that: (1) the axis of the earth is tilted 
at 23° with respect to the plane of the ecliptic; (2) the equatorial bulge 
can be approximated by two point masses located on the equator, at the 
highest and lowest points relative to the plane of the ecliptic; (3) the 
mass of each of these effective point masses comes from a patch on 
the earth with an area of, say, r?, where r is the radius of the earth 
(this is just a guess); (4) the patch has an essentially constant height of 
h ~ 21 km, which is the difference between the equatorial and polar 
radii; (5) the mass density of the patch is roughly the average density 
of the earth (which isn’t true); (6) the earth spends half of its time in 
the summer/winter orientation, and half in the spring/fall orientation; 
(7) the moon’s tidal effect is twice the sun’s. Numerical values of 
various constants can be found in Appendix J. 


10.5 Exercises 


Section 10.2: The fictitious forces 


10.16. Swirling down a drain + 


Does the Coriolis force cause the rotation you often see in water going 
down a drain? That is, does the water rotate in different directions in 
the northern and southern hemispheres? A rough order-of-magnitude 
argument is sufficient. See Shapiro (1962) for a discussion of this. 


10.17. Magnitude of getr * 
Consider (unrealistically) a perfectly spherical rotating planet whose g 
value is constant over the surface. What is the magnitude of geff as a 
function of 6? Give your answer to the leading-order correction in w. 


10.18. Oscillations across the equator « 


Consider (unrealistically) a perfectly spherical rotating planet whose g 
value is constant over the surface. A bead lies on a frictionless wire that 
lies in the north-south direction across the equator. The wire takes the 
form ofan arc ofa circle; all points are the same distance from the center 
of the earth. The bead is released from rest, a short distance from the 


15 It’s possible to derive this by taking into account the exact shape of the earth, but this is very 
complicated. It’s also possible to derive it to a good approximation by treating the earth’s equatorial 
bulge as a thin ring of mass at the equator, but this is likewise fairly involved; see Haisch (1981). 
So we’ll use a simplified point-mass model, which will still give a reasonable answer. 


10.19. 


10.20. 


10.21. 


10.22. 


10.5 Exercises 


equator. Because geff does not point directly toward the earth’s center, 
the bead will head toward the equator and undergo oscillatory motion. 
What is the frequency of these oscillations? 


Circular pendulum « 


Consider a circular pendulum of mass m and length £, as shown in 
Fig. 10.19. The mass swings around in a horizontal circle with the 
(massless) string always making an angle 0 with the vertical. Find the 
angular speed, w, of the mass. Solve this in: 


(a) The lab frame. Draw the free-body diagram for the mass, and 
write down the F = ma equations in the vertical and horizontal 
directions. 

(b) The rotating frame of the pendulum. Draw the free-body diagram 
for the mass, and write down the F = ma equations in the vertical 
and horizontal directions. 


Spinning bucket +» 


An upright bucket of water is spun at frequency w around its vertical 
symmetry axis. If the water is at rest with respect to the bucket, find 
the shape of the water’s surface. 


Corrections to gravity +x 


A mass is dropped from a point directly above the equator. Consider 
the moment when the object has fallen a distance d. If we consider only 
the centrifugal force, then you can quickly show that the correction to 
er at this point (relative to the release point) is an increase by wd. 
There is, however, also a second-order Coriolis effect. What is the sum 
of these corrections? !° 


Bug ona hoop +» 


A hoop of radius R is made to rotate at constant angular speed w around 
a diameter, as shown in Fig. 10.20. A small bug of mass m walks at 
constant angular speed Q around the hoop. Let F be the total force that 
the hoop applies to the bug when the bug is at the angle 0 shown, and 
let F| be the component of F that is perpendicular to the plane of the 
hoop. Find F, in two ways (ignore gravity in this problem): 

(a) Work in the lab frame: At the angle 6, find the rate of change of 
the bug’s angular momentum around the rotation axis, and then 
consider the torque on the bug. 

(b) Work in the rotating frame of the hoop: At the angle 0, find the 
relevant fictitious force, and then take it from there. 


16 The value of g also varies with height, and this produces an increase in geff equal to g(2d/R). 


You can show that this is much larger than the above centrifugal and Coriolis effects. 


483 


Fig. 10.19 


Fig. 10.20 


484 


Accelerating frames of reference 


10.23. 


10.24. 


10.25. 


10.26. 


10.27. 


Maximum normal force «« 


A frictionless hoop of radius R is made to rotate at constant angular 
speed w around a diameter. A bead on the hoop starts on this diame- 
ter and is then given a tiny kick. Let N be the total force that the hoop 
applies to the bead, and let N, be the component of N that is perpendic- 
ular to the plane of the hoop. Where is N} maximum? What is the mag- 
nitude of N as a function of position? (Ignore gravity in this problem.) 
Projectile with Coriolis +» 

Atapolar angle 0, a projectile is fired eastward at an inclination angle œ 
above the ground. Find the westward and southward deflections due to 
the Coriolis force. In terms of 6, what angle @max yields the maximum 
total distance of deflection? What is @max when 6 equals 60°, 45°, and 
(approximately) 0°? What about values of 0 larger than 60°? 
Free-particle motion +» 

A particle slides on a frictionless turntable that rotates counterclock- 
wise with constant frequency w. When viewed in an inertial frame, the 


particle simply travels in a straight line. But in the rotating frame of 
the turntable, show that the F = ma equations take the form, 


¥ = wx + 2w), 
Ï = wy — 20%, (10.41) 


and verify that the solutions to these differential equations are!’ 


x(t) = (A+ Bt) cos æt + (C + Dt) sin at, 
y(t) = —(A4 Bt) sin œt + (C + Dt) cos at. (10.42) 


Coin on a turntable xxx 


A coin stands upright at an arbitrary point on a rotating turntable, and 
spins (without slipping) at the required angular speed to make its cen- 
ter remain motionless in the lab frame. In the frame of the turntable, 
the coin rolls around in a circle with the same frequency as that of the 
turntable. In the frame of the turntable, show that 

(a) F = dp/dt, and 

(b) t = dL/dt (Hint: Coriolis). 
Precession viewed from rotating frame ++» 


Consider a top made of a wheel with all its mass on the rim. A massless 
rod (perpendicular to the plane of the wheel) connects the CM to a pivot. 


17 Tf you want, you can derive these solutions in the spirit of Chapter 4 by guessing exponential 


solutions to the F = ma equations. You will find that there are degenerate solutions, which lead 


to the (4 + Bt) type terms that we saw in the critically damped case in Section 4.3. 


10.5 Exercises 


Initial conditions have been set up so that the top undergoes precession, 
with the rod always horizontal. In the language of Fig. 9.30, we may 
write the angular velocity of the top as w = QÊ + w’x3 (where $; is 
horizontal here). 

Consider things in the frame rotating around the z axis with angular 
speed Q. In this frame, the top spins with angular speed w” around its 
fixed symmetry axis. Therefore, in this frame we must have t = 0, 
because L is constant. Verify explicitly that t = 0 (calculated with 
respect to the pivot) in this rotating frame (you will need to find the 
relation between w’ and Q). In other words, show that the torque 
due to gravity is exactly canceled by the torque due to the Coriolis 
force (you can quickly show that the centrifugal force provides no net 
torque). 


Section 10.3: Tides 


10.28. 


10.29. 


10.30. 


Maximum tangential force x 


At what angle from the horizontal is the tangential component of the 
tidal force in Eq. (10.35) maximum? 


Bead on a hoop +x 


A bead of mass m is constrained to move on a frictionless hoop of 
radius r that is located a distance R from an object of mass M. Assume 
R > r, and assume that M is much larger than the mass of the hoop, 
which is much larger than m. 


(a) If the hoop is held fixed and the bead is released from the top 
point, as shown in Fig. 10.21, what is its speed when it gets to 
the rightmost point on the hoop? 

(b) If the hoop is released and the bead starts at a point just slightly 
to the right of the top point, what is its speed with respect to the 
hoop when it gets to the rightmost point on the hoop? Assume 
that you grab M and move it to the right to keep it a distance R 
from the hoop. 


Facing the planet +» 


A bead of mass m is constrained to move on a frictionless hoop of radius 
r that orbits a planet of mass M, a distance R from it. Initial conditions 
have been set up so that the plane of the hoop is always perpendicular 
to the line from it to the planet. Assume R > r, and assume that the 
mass of the hoop is much larger than m. Draw the force lines in the 
reference frame of the hoop (in the spirit of Fig. 10.14). If the bead is 
released from a point close to the front point on the hoop, what is the 
frequency of small oscillations? 
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10.31. Roche limit +» 


A small spherical rock covered with sand falls in radially toward a 
planet. Let the planet have radius R and density pp, and let the rock 
have density pr. It turns out that when the rock gets close enough to 
the planet, the tidal force ripping the sand off the rock will be larger 
than the gravitational force attracting the sand to the rock. The cutoff 
distance is called the Roche limit.!* Show that it is given by (note the 
lack of dependence on the rock’s radius) 


29, \ 1/3 
a=r(*2) . (10.43) 
pr 


10.32. Roche limit with rotation x» 


If an object orbits a planet in a nonspinning manner (such as the seat 
of a ferris wheel), then the Roche limit is the same as for the radially 
falling object in the previous exercise (as you can show). However, 
show that if an object orbits a planet in such a way that the same side 
always faces the planet, then the Roche limit is given by 


30, \ 1/3 
a=r(%) ; (10.44) 
p: 


10.6 Solutions 


10.1. 


10.2. 


Which way down? 


There are actually (at least) four possible definitions for the point “below” you on the 
ground: (1) the point that lies along the line between you and the center of the earth, 
(2) the point that lies along the direction of the earth’s gravitational force, (3) the 
point where a hanging plumb bob rests (that is, the point that lies along the direction 
of the effective gravitational force), and (4) the point where a dropped object hits the 
ground. 

The third definition is the most reasonable, because it defines the upward direction 
in which buildings are constructed. At any rate, the third and fourth definitions are the 
only ones you can make any practical use of. The third differs from the second due 
to the centrifugal force, which makes geff point in a slightly southward direction (in 
the northern hemisphere) relative to the gravitational force g. It additionally differs 
from the first due to the fact that g isn’t radial (see the first remark at the end of 
Section 10.2.2). And it differs from the fourth due to the Coriolis force, which causes 
a falling object to be deflected slightly eastward. Note that all four definitions are 
equivalent at the poles. And the first three are equivalent at the equator. 


Longjumping in geff 
Let the jumper take off with speed v, at an angle 0. The time to the top of the motion 
is given by geff (t/2) = vsin, so the total time is £ = 2v sin O/gege. The distance 


'8 The Roche limit gives the radial distance below which loose objects won’t collect into larger blobs. 


Our moon (which is a sphere of rock and sand) lies outside the earth’s Roche limit. But Saturn’s 


rings (which consist of loose ice particles) lie inside its Roche limit. 


10.3. 


10.6 Solutions 


traveled is therefore the standard 
2v? sin 0 cos 0 = v? sin 20 
Seff Seff ` 


d=v,t = vt cos 0 = 


(10.45) 


This is maximum when @ = 7/4, as we well know. So we see that d « 1/gegr. 
Taking geff © 10 m/s? at the north pole, and geg + (10 — 0.05) m/s? at the equator, 
we find that the jump at the equator is approximately 1.005 times as long as the one 
at the north pole. So the longjumper gains about four centimeters. This would be 
completely washed out by even the tiniest wind effect. 


Remark: For a longjumper, the optimal angle of takeoff is undoubtedly not 2/4. 
The act of changing the direction abruptly from horizontal to such a large angle would 
entail a significant loss in speed. The optimal angle is some hard-to-determine angle 
less than 2/4. But this won’t change our general d « 1/ger¢ result (which follows 
from dimensional analysis). However, we’ve also made the assumption that the CM 
of the longjumper starts and ends at the same height, which is definitely not true in 
longjumping; it is lower at the end. This in fact does change the d « 1/gegr result. 
But using the result from Problem 3.17, we see that the effect is small (using values 
ofhilmandv*10m/s). & 


Leff VS. g 

The forces mg and Feent are shown in Fig. 10.22. The magnitude of Feent is 
mRo sin @, so the component of Feet perpendicular to mg is mRo sin cos@ = 
mRa? (sin 20) /2. For small Feent, maximizing the angle between geff and g is equiva- 
lent to maximizing this perpendicular component. Therefore, we obtain the maximum 
angle when sin 20 = 1 => 0 = 7/4. The maximum angle turns out to be 


$ © tang ~ (mRw* (sin 1/2)/2)/mg = Ra? /2g © 1.7- 1073, (10.46) 


which is about 0.1°. Because the earth bulges at the equator, the distance from the 
axis isn’t exactly R sin 0, the angle of g isn’t exactly 0 (see the first remark at the end 
of Section 10.2.2), and the magnitude of g isn’t exactly constant over the surface of 
the earth. But these effects are negligible, and the optimal 8 is still essentially 7/4. 


REMARK: The above solution is an approximate one which is valid only when the 
magnitude of Feent is much smaller than mg. We’ll now give an exact solution which 
is valid when the magnitude of Feent is comparable to mg, but which is valid only in 
the unrealistic case where the planet is a perfect sphere, even though it is spinning. 
No planet would ever take a spherical shape, because planets aren’t rigid. But you 
could imagine a large spherical rock. 

To solve the problem exactly, we can break Fyent into components parallel and 
perpendicular to g and make use of the parallel component, in addition to the perpen- 
dicular component we used above. If ġ is the angle between geff and g, then from 
Fig. 10.22 we have 


mRa” sin 8 cos 0 


tang = (10.47) 


mg — mRo sin?0 ` 
We can then maximize ¢ by taking a derivative. But we need to be careful if Ro? > g, 
in which case maximizing ġ doesn’t mean maximizing tan @. You can work this out, 
and we’ll instead give the following slick geometric solution. 

In Fig. 10.23, draw the Feent vectors for various 6, relative to mg (so we’ve chosen 
mg always to be vertical in this figure, in contrast with Feent always being horizontal 
in Fig. 10.22). Since the lengths of the Fcent vectors are proportional to sin 0, you 
can show that the tips of the Fen: vectors form a circle. The maximum ¢ is therefore 
achieved when geff is tangent to this circle, as shown in Fig. 10.24. In the limit where 
g > Ro?” (that is, in the limit of a small circle), we want the point of tangency to 


Fig. 10.22 


Fig. 10.23 
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be the rightmost point on the circle, so the maximum ¢ is achieved when 0 = 7/4, 
in which case ¢ © sing ~ (Ro? /2)/g, as we found above. But in the general case, 
Fig. 10.24 shows that the maximum ¢ is given by 


5mRa* 
(10.48) 


sin = = 
Pmax mg — 5mRo 

In the limit of small œw, this is approximately Rw*/2g, as above. Note that this 
reasoning holds only if Rw? < g. In the case where Rw” > g (that is, the circle 


extends above the top end of the mg segment), the maximum ¢ is simply 7, and it is 
achieved at 0 = 7/2. & 


10.4. Lots of circles 


(a) The fictitious force, Fr, on the mass has an Fen part and an Frans part, because 
the center of C2 is moving. So the fictitious force is 


Fr = mor? + Frans, (10.49) 


where rp is the position of the mass in the frame of C2. But Ftrans, which arises 
from the acceleration of the center of C2, equals the centrifugal force felt by a 
point on C1. Therefore, 


Frans = mor), (10.50) 


where r is the position of the center of C2 in the frame of C1. Substituting this 
into Eq. (10.49) gives 


Fr = mo” (r2 +rj)= ma R(t). (10.51) 


(b) The fictitious force, Ff, on the mass has an Feent part and an Ftrans part, because 
the center of the Nth circle is moving. So the fictitious force is 


Fr = mor ty + Frrans.v, (10.52) 


where ry is the position of the mass in the frame of Cy. But Firans,y equals the 
centrifugal force felt by a point on the (N — 1)th circle, plus the translation force 
coming from the movement of the center of the (V — 1)th circle. Therefore, 


Frrans.v = mary + Frrans,v-1- (10.53) 


Substituting this into Eq. (10.52) and successively rewriting the Fyrans,; terms 
in a similar manner, gives 


Fe = ma (ry +ty-1 + +r) = mo RGA). (10.54) 


This turns out so clean because F ent is linear in r, and also because all the w’s 
are the same. 


REMARK: A much easier way to see that Ff = mw? R (4) is the following. Since 
all the circles rotate with the same w, they may as well all be glued together. 
Such a rigid setup does indeed yield the same w for all the circles, just as in the 
case of the moon rotating once on its axis for every revolution it makes around 
the earth, thereby causing the same side to always face the earth. It is then clear 
that the mass simply moves in a circle at frequency w, yielding a fictitious 
centrifugal force of mw*R(t). And as a bonus, we see that the magnitude of 
R(t) is constant. & 


10.5. Mass on turntable 


In the lab frame, the net force on the mass is zero, because it is sitting at rest. (The 
normal force cancels the gravitational force.) But in the rotating frame, the mass 
travels in a circle of radius r with frequency œw. So its speed is v = wr. Therefore, in 
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the rotating frame there must be a net force of mv? /r = mw*r inward to account for 


the centripetal acceleration. And indeed, the mass feels a centrifugal force of mw*r 
outward, and a Coriolis force of 2mwv = 2mw2r inward, which sum to the desired 
force (see Fig. 10.25). 


Foor = 2mo2r 


REMARK: The net inward force in this problem is a little different from the force on a © 
person swinging around in a circle of radius r and frequency w in an inertial frame. If 
a skater, for example, maintains a circular path by holding a rope whose other end is 
attached to a pole, then she has to use her muscles to maintain the position of her torso V=or 
with respect to her arm, and her head with respect to her torso, etc. But if a person 
takes the place of the mass in the present problem, then she doesn’t need to exert any 
effort at all to keep her body moving in the circle (which is clear, when viewed from Fig. 10.25 
the inertial frame), because each atom in her body is moving at (essentially) the same 

speed and radius, and therefore feels the same centrifugal and Coriolis forces. So she 

doesn’t really feel the net force of mw*r, in the same sense that someone doesn’t feel 

gravity when in free-fall with no air resistance, because gravity acts on each bit of 

mass in the same way. (As mentioned on page 462, this similarity with gravity is 

what led Einstein to his Equivalence Principle.) & 


10.6. Released mass 


Let the x’ andy’ axes of the rotating frame coincide with the x and y axes of the inertial 
frame at the moment the mass is released (at £ = 0). Let the mass initially be located 
on the x’ axis. Then after a time f, the situation looks like that in Fig. 10.26. The speed 
of the mass is v = aa, so it has traveled a distance awt. The angle that its position 
vector makes with the inertial x axis is therefore tan—! wt, with counterclockwise 
taken to be positive. Hence, the angle that its position vector makes with the rotating 
x’ axis is 


(| tan! 


ot 


O(t) = —(wt — tan! ot). (10.55) P 
And the radius is 1 
r(t) = ay 1 + t. (10.56) 


For large t, we have r(t) ~ awt and 8 (t) ~ —wt + 2/2, which make sense because 
the particle approaches an inertial-frame angle of x /2. 


Fig. 10.26 


10.7. Coriolis circles 


By construction (with the surface being orthogonal to geff at all points), the normal 
force from the ice exactly cancels all effects of the gravitational and centrifugal forces 
in the rotating frame of the earth. We therefore need only concern ourselves with the 
Coriolis force, —2mw x V. 

Let the angle down from the north pole be 6. We’re assuming that the circle is 
small enough so that @ is essentially constant throughout the motion. The compo- 
nent of the Coriolis force that points horizontally along the surface has magnitude 
f = 2mv(wcos 6) and is perpendicular to the direction of motion. (The vertical com- 
ponent of the Coriolis force, which comes from the component of that points along 
the surface, simply modifies the required normal force.) Because this force is per- 
pendicular to the direction of motion, v does not change. Therefore, f = 2mvuw cos 0 
is constant. But a constant force perpendicular to the motion of a particle produces a 
circular path.!° The radius of the circle is given by 

mv v 


2mv é f ; 10.57 
mvw cos ; r 2o cos ( ) 


The frequency of the circular motion is 
pow 


@ = — =2wcosé. (10.58) 
F 


19 Ifyou want to be mathematical about this, see Footnote 34 in Chapter 9. 
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10.8. 


To get a rough idea of the size of the circle, you can show (using w ~ 7.3 - 1075 s7!) 
that r ~ 10km when v = 1m/s and 6 = 45°. Even the tiniest bit of friction will 
clearly make this effect impossible to see. In the special case of 0 ~ 7/2 (that is, 
near the equator), the component of the Coriolis force along the surface is negligible, 
so r becomes large, and w’ goes to 0. 


Remark: Inthe limit@ ~ 0(thatis, near the north pole), the Coriolis force essentially 
points along the surface. The above equations give r © v/(2), and w © 2w. For 
the special case where the center of the circle is the north pole, this w ~ 2 result 
might seem incorrect, because you might want to say that the circular motion should 
be achieved by having the puck remain motionless in the inertial fame, while the 
earth rotates beneath it (thus making w = w). The error in this reasoning is that the 
“level” earth is not spherical, due to the nonradial direction of ger. If the puck starts 
out motionless in the inertial frame, it will be drawn toward the north pole, due to the 
component of the gravitational force along the nonspherical “level” earth. In order 
not to fall toward the pole, the puck needs to travel with frequency œw (relative to the 
inertial frame) in the direction opposite”? to the earth’s rotation. The reason for this is 
that in the rotating frame of the puck, the puck feels the same centrifugal force that it 
would feel if it were at rest in the frame of the earth, spinning along with it, because 
these two frames have the same magnitude of w; the w’s point in opposite directions, 
but this doesn’t affect the centrifugal force. The puck therefore happily stays at the 
same 0 value on the “level” surface, just as a puck at rest on the earth does. The angular 
velocity of the puck with respect to the earth is therefore (~w) — (w) = —2w, where 
the minus sign signifies the backward direction. & 


t=dL/dt « 

Let r} be the position vector in the accelerating frame. (In terms of the quantities in 
Section 8.4.3, r; equals r; — ro.) The total angular momentum of an object in the 
accelerating frame is 


L=% ri x mit. (10.59) 
i 
Therefore, 
dL 
a Yy x mit, + yor, x mjt, 
i i 


— 0+ `r x pon 


L L 


i 
= J ri x (Fe a p 4 eo) (10.60) 
i 


The first term, > r; x F aul equals the total external torque measured relative to 


the origin of the accelerating frame, as desired. The second term, }_ r; x F uae 


equals the total torque from internal forces, which is zero by the same reasoning as 
in Section 8.4.3. The third term, }_ rj x F fictitious. is the tricky one. Since the frame 
isn’t rotating, we have at most the fictitious translation force. So this term equals 


yon x (—m¥o) = -X mir; x Fo = —Mriyų X Fo, (10.61) 


where r/,, is the position of the object’s CM in the accelerating frame, and ro is the 
position of the origin of the frame with respect to the inertial lab frame. This result 


20 Of course, the puck can also move with frequency w in the same direction as the earth’s rotation. 


But in this case, the puck simply sits at one spot on the earth. 


10.9. 


10.6 Solutions 


agrees with the second term in Eq. (8.45), where r(,, is written as R — ro. The three 
conditions under which the third term vanishes are therefore: (1) riy = 0, that is, the 
CM is located at the origin of the accelerating frame. The fictitious translation force 
acts just like a gravitational force, so as far as the torque is concerned, the translation 
force acts at the CM. Therefore, if the CM is located at the origin, the translation 
force has no lever arm and thus provides no torque. (2) fo = 0, that is, the origin is 
not accelerating, so there is no translation force. (3) rų is parallel to ¥9. This means 
that if the translation force is considered to be a gravitational force, then the CM lies 
directly “above” or “below” the origin. So there is no lever arm and thus no torque. 

If the frame is rotating in addition to translating, then in general there will be 
torques from the centrifugal, Coriolis, and azimuthal forces, even if the CM is chosen 
as the origin. This is true because these fictitious forces involve r (or r), which results 
in the torque not being linear in r (because there is already an r int = r x F). This 
then means that the r/,, position vector doesn’t arise in the calculation of dL/dt as it 
did in Eq. (10.61). 


Determining your frame 


Yes. You can determine w and dw/dt as follows. Simultaneously measure the forces 
on particles of mass m at rest at positions rı and r2. The centrifugal and azimuthal 
are the relevant forces, so the difference in the forces at the two locations is 


d 
AF =F; — F> = —mo x (ø x (r1 — r2))— m 7 x(t1—1). (10.62) 


Using the fact that the cross product of two vectors is perpendicular to each vector, 
we see that the magnitudes of the components of AF parallel and perpendicular to 
rı — r? are Fy = mo? |r} — r2| and F, = m(dw/dt)|r; — r2|. So we have 


F d F 
om gad ee, (10.63) 
m|rı — r2| dt mir, -r| 


These expressions give w and dw/dt in terms of measured quantities. Note that we 
needed to measure the force at two points here. Measuring the force of —mw x (w x 
r) — m(dw/dt) x r at only one point r doesn’t give us anything, because we don’t 
yet know where the origin is, so we don’t know the value of r. But with two points, 
the difference rı — r2 is independent of the location of the origin. 

If you want, you can check the result for w by finding the difference between 
the forces on two objects at essentially the same location, one of which is at rest 
and the other of which is moving with velocity v. Because the r values are the same, 
only the Coriolis term survives in the difference. This gives w = |AF|/(2mv). 

To find the center of the disk, measure the force on a particle at a given location. 
Break this force into orthogonal components in the ratio of dw/dt to w, by drawing 
a line at an angle of tan! ((dw/dt)/w*) with respect to the force. Whether this line 
is to the right or to the left of the force depends on whether the rı — r2 vector above 
was to the right or to the left of the AF vector. This line contains the component 
proportional to œ, which is the radial component. The line therefore passes through 
the center of the disk. So if we repeat the process with the particle at another location 
(not along the line) and draw a similar line, then the intersection of the two lines is 
the center of the disk. 


Remark: If we remove the restriction that there aren’t any real forces, then it isn’t 
possible to determine the three desired quantities, because someone could claim that 
w, dw/dt, and the location of the center are different from what you found, and he 
could then just say that there are a bunch of (contrived) real forces that conspire to 
make the total force be what you observe. & 
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10.10. Changing w’s direction 


(a) Let Q be the point on the axis of the cone directly above P, and let its height 
above P be h (see Fig. 10.27). Consider the situation an infinitesimal time t 
later. Let P’ be the point that is now directly below Q (see Fig. 10.28). The 
angular speed of the cone is w, so Q moves horizontally with speed vg = wh. 
Therefore, in the infinitesimal time t, Q moves a distance wht to the side. 

This distance wht is also (essentially) the horizontal distance between P and 
P’. Therefore, a little geometry tells us that P is now a height 


th 1 
Fig. 10.27 y(t) = h- VP- omy =h- b/T— oF w LO" = xo hye 


2 
(10.64) 


horizontal Q 


: ; above the table. Since P started on the table with zero speed, this means that 
end views : 
() 
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P undergoes an acceleration of wh in the vertical direction. A mass m located 

at P must therefore feel a real force (normal force or whatever) of Fp = moh 

in the upward direction, if it is to remain motionless with respect to the cone. 

(b) The precession frequency Q (which is how fast œ swings around the origin) 

is equal to the speed of Q divided by r, where r is the distance from Q to 

the z axis (that is, the radius of the circle Q travels in). Therefore, @ has 

magnitude vg/r = wh/r, and it points in the —Z direction, for the situation 

shown in Fig. 10.27. Hence, dw/dt = Q x w has magnitude w*h/r, and it 

points out of the page in Fig. 10.27 (or to the left in Fig. 10.28). Therefore, 

Faz = —m(d@/dt) x r has magnitude mah, and it points in the —z direction. 

A person of mass m at point P therefore interprets the situation as, “I am 

not accelerating with respect to the cone. Therefore, the net force on me in the 

cone frame is zero. And indeed, the upward normal force Fp from the cone, 

with magnitude mah, is exactly balanced by the mysterious downward force 
Fig. 10.28 Faz, also with magnitude moh.” 

Note that this azimuthal force is still basically the same effect as with the 
translation force, the centrifugal force, and the simpler case of the azimuthal 
force discussed in Section 10.2.4. In all of these cases, the “ground” accelerates, 
so you feel like you are getting flung in the opposite direction with respect to 
the accelerating frame. 


10.11. Unwinding string 


Consider the clockwise direction. This case is easily solved by working in the lab 
frame. After the glue breaks, the mass moves in the straight line of the tangent, because 
the string can’t provide a transverse force unless the mass has already diverged from 
this straight line, which it hasn’t. The speed at which the mass starts this straight-line 
motion (when the glue breaks) is Rw. And it continues to move at this speed because 
the angular speed of the wheel allows the string to unwind at a rate Rw, which is 
exactly the rate needed. Note that the string has zero tension in it, so it’s just like it’s 
not there. 

The counterclockwise direction is tricker, because simple linear motion isn’t con- 
sistent with the constraint that the mass stays tied to the string. There is now a tension 
in the string, and the mass undergoes a spiral motion, which makes things more dif- 
ficult. It’s possible to solve this problem with F = ma, and also with the Lagrangian 
method. But we’ll solve it here by using a slick argument in a rotating frame. Part 
of the trickiness of this approach is choosing which rotating frame to use. The most 
obvious frame is the rotating frame of the wheel, but this frame isn’t too helpful, 
because the mass undergoes a spiral motion which is hard to get a handle on (how- 
ever, see the fourth remark below). It would be nice to work in a frame where the 
mass undergoes a simple kind of motion. It turns out that if we consider the frame 
that rotates counterclockwise at 2m instead of w, things turn out to be simple. 


10.6 Solutions 


The general idea is the following. In this new frame that rotates counterclockwise 
at 2w, the wheel rotates clockwise at w. Therefore, if there were no such things as 
fictitious forces, then we would have exactly the same situation as in the clockwise 
case we solved above, so we would be done. The bad news, however, is that fictitious 
forces do exist. And so without any miraculous cancellation, these forces will lead 
to a transverse force, causing the string’s contact point to move one way or another, 
thereby causing the unwound length to increase at a rate other than Rw. But the good 
news is that such a miraculous cancellation does indeed occur. Let’s see why. 

In the frame that rotates counterclockwise at 2w, consider the moment right after the 
mass leaves the wheel, as shown in Fig. 10.29. The velocity of the mass at this time is 
v = Ra, directed to the right. What are the forces on the mass? There may be a tension 
(it happens to be zero right at the start, but then it grows, as we’ll see below), and then 
there are also the centrifugal and Coriolis forces. The centrifugal force is initially (right 
when the glue breaks) directed radially upward with magnitude mR(2w)* = 4mRa”. 
The Coriolis force is (as you can verify) initially directed downward with magnitude 
2m(2w)v = 2m(2w)(Rw) = 4mRo?. These two forces cancel, so the mass feels no 
transverse force, so it continues to move in a straight line to the right. 

What about a later time? Assume (in an inductive spirit) that the mass is still on the 
straight line determined by its initial motion and moving with speed Rw. The Coriolis 
force is still directed downward with magnitude 4mRw. And the centrifugal force is 
directed radially outward with magnitude mr(2w)*, where r is the present radius, as 
shown in Fig. 10.30. The vertical component of this force is obtained by multiplying 
by R/r, so we again obtain an upward vertical component of 4mRw*. This therefore 
cancels the Coriolis force, and we again have the result that there is no transverse 
force. So we see that if the mass is presently moving in the straight line determined by 
its initial motion, then it will continue to move in this straight line. And since it is by 
definition initially moving in this line, we see inductively that it moves in this line for 
all time, as we wanted to show. Note that the horizontal component of the centrifugal 
force needs to be canceled by the tension (because the string doesn’t stretch), so the 
tension is nonzero in this scenario, in contrast with the clockwise scenario above. 
Since the string is always taut, the speed of the mass is determined by the rate at 
which the wheel rotates, which means that the speed of the mass is always Rw in this 
frame. The rate of increase in the unwound string’s length is therefore Rw (in any 
frame). 


REMARKS: 

1. The tension equals the longitudinal component of the centrifugal force. This 
component is obtained by multiplying the centrifugal force by d/r, where d is 
the length of unwound string. The tension therefore equals 4mdw”, so it grows 
linearly with d. 

2. Even if the string is massive, and even if the density varies, the unwound length 
still increases at a rate Rw, because the above reasoning can be used with every 
atom in the string. This result is a bit surprising, because when viewed in the 
lab frame, it is by no means obvious that a massive string remains straight. 

3. If we hadn’t been told in the statement of the problem that the unwound string’s 
length increases at a rate Ro, then the problem would of course be more difficult. 
For all we’d know, the rate might not even be constant. Aside from making a 
lucky guess that the rate is Rw, we’d have to solve the problem with F = ma or 
the Lagrangian method. The former is rather tricky, but the latter isn’t so bad. 

4. There’s actually an even slicker solution for the counterclockwise case than the 
one we gave above. It goes as follows. First consider the simpler clockwise 
case, and look at the setup in the rotating frame of the wheel. It this case, the 
centrifugal and Coriolis forces conspire to make the mass move in a backward 
spiral, the beginning of which is shown in Fig. 10.31. We know from our lab- 
frame reasoning above that in the lab frame the string provides no force during 
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this motion, but that it does in fact remain straight. And the rate of change of 
the unwound length is Rw. 

Now consider the more difficult counterclockwise case in the rotating frame 
of the wheel. The only difference from the clockwise case is that the w vec- 
tor has switched direction, and now points out of the page. The centrifugal 
force is therefore still the same function of r, but the Coriolis force has 
switched signs, as a function of v. However, the centrifugal force is the only 
force that does work on the mass (because the Coriolis force and the ten- 
sion are perpendicular to the velocity), so the mass ends up moving at the 
same speed as a function of r as in the clockwise case. The string constrains 
the mass to move along the same path as in the clockwise case (because the 
string is straight in both cases), and the only effect of the Coriolis force is 
to increase the tension. The motion is therefore exactly the same (same path, 
same speed as a function of r), so the rate of change of the unwound length is 
again Rw. & 


10.12. Shape of the earth 


(a) The potential energy function derived from the sum of the gravitational and 


centrifugal forces must be constant along the surface. Otherwise, a piece of the 
earth would want to move along the surface, which would mean that we didn’t 
have the correct surface to begin with. 

If x is the distance from the earth’s axis, then the centrifugal force is 
F, = mx, directed outward. The potential energy function for this force 
is Ve = —mw*x? /2, up to an arbitrary additive constant. Under the assumption 
that the distortion of the earth doesn’t change the gravitational force (which isn’t 
correct, as we'll see below), the potential energy for the gravitation force is mgh, 
where we’ve arbitrarily chosen the original spherical surface to correspond to 
h = 0. The equal-potential condition is therefore 


mgh — =C, (10.65) 


where C is a constant to be determined. Using x = r sin @, we obtain 


wr? sin?0 
h = —\— +B, (10.66) 
2g 
where B = C/(mg) is another constant. We may replace the r here with the 
radius of the earth, R, with negligible error. 

Depending what the constant B is, this equation describes a whole family 
of surfaces. We can determine the correct value of B by demanding that the 
volume of the distorted earth be the same as it would be in its spherical shape if 
the centrifugal force were turned off. This is equivalent to demanding that the 
integral of A over the surface of the earth is zero. The integral of (a sin? + b) 
over the surface of the earth is (the integral is easy if we write sin? as 1—cos”@) 


T 
J (a(1 — cos?0) + b) 27 R? sin 6 do 
0 


JE 
-=f ( acos?0 + (a b)) 27 R? sin do 
0 


36 
= 2 R? (S$ (a+ b) cos o) I; 


2a ( 2a ea ») (10.67) 
= 3 T T . a 
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Therefore, we need b = —(2/3)a for this integral to be zero. Plugging this 
result into Eq. (10.66) gives 
Ro? 2 
h = R | — | Gsin“d — 2), (10.68) 
6g 
as desired. 
(b) For convenience, let the correct height be h = £B(3 sin?0 — 2), with 


B = fR(Rw* /6g), where f is the desired fraction. 

Consider the earth to be the superposition of the h = 0 sphere plus an 
effective shell of positive or negative mass, depending on the sign of h at 
a given location. The potential energy of a mass m at a given point on the 
surface of the distorted earth is the sum of the potentials due to (1) gravity 
from the sphere, (2) the centrifugal force, and (3) gravity from the shell. From 
Eq. (10.68), the standard mgh contributions from (1) at a pole and the equator 
are essentially mgp (—2) and mg f (1), respectively. The contributions from (2) 
at a pole and the equator are 0 and —mw*R?/2, respectively. The contributions 
from (3) are trickier. We need to calculate the integral — f Gm dM /£, where 
dM runs over the shell, and £ is the distance from m to each dM. The mass of 
a small element in the shell is 


dM = pdV = phdA = pB(Qsin26 — 2)(R d0)(Rsin0 do). (10.69) 


The distance from the north pole to a point with polar angle 0 equals 
£ = 2Rsin(6/2) = R/20 = cos). Using the Pythagorean theorem, the 
distance from a point on the equator, say (R,0,0), to a point with polar 
angle @ whose general form is (R sin 0 cos¢,Rsind sing, Rcos@) equals 
£ = RJ/2(1 — sin 8 cos ¢). Demanding that the total potential at the north 
pole be equal to the total potential at the equator then gives 


B(—2) +0 F i Gm - pop sin2@ — 2)(Rd0)(R sin 6 do) 
m, — _ 
k Of RVZ = cos) 


mo?R? f7 [° Gm. pBBsin?6 — 2)(R d0) (R sin dd) 

= mgB(1) - 5 
2 o Jo R/2(1 — sin 8 cos ġ) 
(10.70) 
Letting 6 = fR(Ro” /6g), and using 
GMg G(4nrR?p/3) 3g 

= = = 10.71 

E R2 R2 i Gp 4nR’ ( 7 ) 


we can rewrite Eq. (10.70) as 


mo? R2 = 3mefR Ra? 
7 megf. iz 
T 2n 3g Ro J : 
= f f — | mfR | — J) Gsin“6 — 2) (R d0) (R sin 0 dd) 
0 0 4r R 6g 


1 1 
; 10.72 
= (a — sinf cosġ) R/2(1— =z) (10:72) 


After canceling common factors, we end up with 
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Evaluating this integral numerically gives essentially 0.6, which then yields 
f = 5/2, as desired. 


REMARK: Thef = 5/2 result leads to a difference in polar and equatorial radii 
of Ah = (5/2) (R?œ? /6g)(3) ~ 28 000 m = 28 km. The fact that this is larger 
than the correct value of 21.5 km makes sense for the following reason. If all 
of the earth’s mass were concentrated at the center, then the thin distortion 
shell at the surface would have no effect on the potential, because it would be 
massless. So the naive calculation in part (a) would in fact be correct, and Ah 
would be about 11 km. The actual density of the earth decreases with radius, 
which means that the density lies somewhere between the case ofa concentrated 
center and the case of uniform density. The actual value of Ah should therefore 
lie somewhere between the corresponding AA values of 11 km and 28 km. And 
21.5km indeed does. & 


10.13. Southward deflection 


The Coriolis force is 2mwv sin 0 eastward. But v © gt, so the eastward acceleration is 
2wgt sin 0. Integrating this gives an eastward speed of wgt? sin 8. This eastward speed 
produces a Coriolis force in the direction away from the earth’s axis, so the accelera- 
tion in this direction is 2w(wg?? sin 0). The component of this acceleration along the 
surface of the earth (that is, in the southward direction) is 2w7 gt sin @ cos 0. Integrat- 
ing this gives a southward speed of (2/ 3J gt sin 0 cos 0. Integrating again gives a 
southward deflection of (1/6)w*gr* sin 8 cos 8. But gt?/2 ~ h => Ê ~ 2h/g. So 
the southward deflection due to the Coriolis force is (2/ 3) (wh? /g) sin@ cos 6. 

Now for the rest of the problem. To be precise, we’ll define 6 to be the polar angle 
at the point P on the radial line to the dropping point. However, we would still obtain 
the same answer if we defined 0 to be the polar angle at the location of the plumb 
bob, because the difference between these two angles is of order œ? (since it is due 
to the centrifugal contribution to the effective gravity). And this difference yields a 
negligible effect of order w* in the 4(w7h?/g) sin @ cos @ result, because this already 
contains a factor of w”. 


(a) At point P, the magnitude of Feent is moR sin @, so (using Fig. 10.22 in 
the solution to Problem 10.3) the component of Feent perpendicular to mg is 
F. i = mw*Rsin@ cos@. The angle that geg makes with the radial at P is 
therefore essentially equal to F5,./mg = (w*R/g) sin @ cos 6. However, the 
direction of the plumb bob line is determined by the gee vector at the location 
of the plumb bob (call this point B), and not at P; see Fig. 10.32. Now, the 
angle we just found between ge and g (that is, the radial) at P is the same as 
the angle between geff and g at B (at least to order w”, by the argument in the 
preceding paragraph). Therefore, since the g vector at B is tilted at the angle 8 
(shown) relative to the g vector at P, we see that geff at B makes an angle of 
g= (w°R/g) sin 0 cos — £ relative to the radial at P. The angle £ is given 
by d/R, where d is the distance between P and B. So in the thin triangle DPB, 
the relation d © hd gives 


dx h((@?R/g) sin 6 cos6 — d/R) 


is h(w*R/g) sin cos 0 
g 1+h/R 
wRh . h 
x sin 6 cos 6(1 = =) (10.74) 


(b) The gravitational acceleration at height y above the earth is 
GM GM 2 om ( J _ (1 j 
(R+yP RAY RA RI R’ 


(10.75) 


10.6 Solutions 


(There are also corrections due to the centrifugal and Coriolis forces, but these 
are negligible; see Exercise 10.21.) So we have ý = —g(1 — 2y/R). Writing y 
as v du/dy, and separating variables and integrating, gives 


[ovave-f'e(i-2)o 


v = —y2g(h - y) — Qg/R\UP — y?) 


x agh =-y(1 ani (10.76) 


2R 


Writing v = dy/dt, and separating variables and integrating, gives 


p 0 0 h+y 
d pt 
f ax 7 x R dy. (10.77) 
0 h 


V2g(h — y)( =e) h J2g(h—y) 


You can show that the “1” here gives the leading-order time of ,/2h/g. 
The additional time comes from the other term, which yields (with 
z=y/h) 


1 0 aty P hJh f? i+z 
Jie Jn Jhay O 2RS2g J Tz 


dz. (10.78) 


Looking up this integral gives 


hyh 2 0 2h / 5h 
At= 2642/1 | my aed ef 10.79 
Re 3 PON eh (a) (10:79) 


The total time is therefore ,/2h/g(1 + 5h4/6R), as we wanted to show. 

(c) At height y, the distance from the center of the earth is R + y. So from 
the F4,, reasoning in part (a), the acceleration in the southward direction 
is? = œ (R + y) sin 0 cosé. 

(d) If the ball is a distance z away from the radial line through P (call this line 
L), then the radial line to the ball makes an angle of approximately z/R with 
respect to L. The component of the gravitational force on the ball perpendicular 
to L is therefore Z = —g(z/R), where the minus sign signifies toward L (that 
is, northward). 

(e) Parts (c) and (d) give 


z= w(R + y)sin@ cos — g(z/R). (10.80) 


The R term here dominates, so to leading order we have Z = 
w*Rsindcos@ => z & (@Rsinð cos 6)t? /2. Also, to leading order, 
y © h — gt*/2. Plugging these values of z and y into Eq. (10.80) gives 


t t 
=o sind coso (R+ (n- =) — £) 


R? ht? t tt 
= z= 0% sind cos0 (5 +(- £) £ | (10.81) 
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Plugging in the total time t = |/2h/g(1 + 5h/6R) gives, to leading order, a 
total z value equal to 


ee eee R ais) 
z = œ sin cos0 | =- — = 
2 g 3R 


i C 2h g =) g 4k 
N2 g 4 g) 4 g 
a Rh . h/5 5 1 
= . snocoso (1+ 5(5 + 2 ~ 2)) 
o Rh . Th 
z sing cos6(1+ ==). (10.82) 


When we subtract off the plumb bob’s position in Eq. (10.74), the leading 
terms proportional to R cancel. Adding on the Coriolis result then gives a total 
southward deflection (relative to the plumb bob) equal to 

eh 2 


7 
sind coso (z (—1)4 3 


4 2 7,2 
j= eo” sind coso, (10.83) 


path of ball, latitude circle 


as we wanted to show. From the third line in Eq. (10.82), we see that the 7/3 
result can be broken up into a 5/3 that comes from the extra time it takes the 
ball to hit the ground due to the decrease in the gravitational force with altitude, 
a 5/6 that comes from the dependence of the centrifugal force on height, and 
a —1/6 that comes from the slightly northward component of the gravitational 
force. 


great circle / REMARK: We can also solve this problem by working in an inertial frame. In this 

frame, the ball has an initial sideways motion due to the rotation of the earth, and this 

motion causes the ball to travel in the path shown in Fig. 10.33. Because only gravity 

acts on the ball, its path lies in the plane determined by its initial velocity and radius. 

Fig. 10.33 The intersection of this plane with the surface of the earth is the great circle shown, 
and the ball hits the earth at a point A on this great circle. 

The distance £ that the ball travels along the great circle is essentially equal to the 
distance it travels in the eastward direction (that is, along the latitude circle). This 
eastward distance equals the distance that point P rotates due to the spinning of the 

latitude circle earth, plus the extra Coriolis eastward deflection door that we found in Eq. (10.18). 
(In the spirit of working in an inertial frame, let’s assume that we found this deflection 
by the method given in the remark after Eq. (10.18).) So we have £ ~ R sin 6 wt+deor- 
The task now is to determine the distance between A and the latitude circle through 
P. This distance should equal the result in Eq. (10.82) plus the southward Coriolis 
deflection. (We’ll then subtract off the distance from the pendulum bob to P, given 
in Eq. (10.74), as we did above.)”! This task is equivalent to answering the question: 
If a circle of radius r = R sin 0 (the latitude circle) sits on a plane (the plane of the 
great circle) and is inclined at an angle 0 with respect to the normal to the plane, how 
far above the plane is a point on the circle with an “x” coordinate equal to £ (with 
plane of great circle £ & r)? Using a Taylor series approximation for the length vr? — £? in the right 
triangle in Fig. 10.34, we see that if the circle is perpendicular to the plane, then the 
. desired distance is ¢7/2r. Tilting the circle by an angle 6 simply brings in a factor of 
Fig. 10.34 cos @, so the distance is (£? /2r)cos@ = (€2/2R sin 0) cos 8. 


21 Tf you want, you can derive the distance from the pendulum bob to P by working in the inertial 
frame, to maintain the spirit of this solution. The mo?R type term we found in part (a) above 
simply shows up through the centripetal acceleration instead of through the centrifugal force. 
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Using the time of flight obtained above in part (b), and also the decor from Eq. (10.18), 
we obtain a southward deflection relative to P equal to (dropping higher order terms) 
L cos 
z= - 
2R sind 
(R sinô wt + deor)? cos 6 
2R sind 


x Ro sin @ cos 6 17/2 + @deor Cos O t 


Ro sin@cos6 2h 5h 2whsind [2h 2h 
xy ` (1 1 ) + Ww cos, | — 
2 g 3R 3 g g 


2 


gon sind coso (1+ 5(5 + 5)) (10.84) 

g RAS 3/7 )° l 
As expected, the factor of 9/3 here equals the 7/3 from Eq. (10.82) plus the 2/3 
from the southward Coriolis deflection. Subtracting off the position of the plumb bob 
given in Eq. (10.74) yields the desired southward deflection from the plumb bob, 
(4071? /g)sinOcos@. & 


10.14. Bead on a hoop 


(a) The gravitational force on the bead is essentially GM@m/(R — r)?, directed 
essentially to the right. But to leading order, we can neglect the r here. If the 
bead is at an angle @ up from the horizontal, then we need to multiply this 
rightward force by sin@ ~ 0 to obtain the force component along the hoop. 
Therefore, the F = ma equation along the hoop is 

GMm6 R GM 
r mro w RZ (10.85) 
This is just the usual ./g/r result for a pendulum, in disguised form. 

(b) From Eq. (10.35), the tidal force is (GMm/R?) (2x, —y). The force along the 
hoop comes not only from the horizontal component of this (when multiplied 
by negative sin 0), but also from the vertical component (when multiplied by 
cos @). So the force along the hoop is (using x = r cos 0 and y = r sin 0) 

GMm GMm 
R3 R? 


(2¢ cos 0)(— sin 0) + (—r sin @) cos o) =— (3r sin @ cos 0). 
(10.86) 
Using sin 0 ~ 0 and cos @ ~ 1, the F = ma equation along the hoop is 


3GMmré a 3GM 
mY =mrb w i (10.87) 
R? R? 
Note that this is independent of r. It is smaller than the result in part (a) by a 


factor /37/R. 


10.15. Precession of the equinoxes 


We’ll calculate the effect due to the sun, and then multiply by 3 to obtain the total krsinB 
effect, because the moon’s effect is twice the sun’s. Consider the case where the earth equator 
is in the summer/winter orientation. From Eq. (10.35), the tidal force on the effective 
point mass m is (GMsm/ R? )(2x, —y). Both components are relevant here, so the tidal 


-- — 
y direction 


forces on the two masses are shown in Fig. 10.35, where r is the radius of the earth, m B to sun 
and k = GMgm/R?. The torque due to these forces has magnitude 
2(2kr cos B(r sin £) + kr sin £ (r cos B) = 6kr? sin bcos B, (10.88) Fig. 10.35 


and it is directed into the page. For the case where the earth is in the spring/fall 
orientation (that is, where the sun is positioned where your nose is, as you look 
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at Fig. 10.35), there is no torque, because the longitudinal component of the tidal 
force is zero, and the transverse component is radial. Each of the summer/winter and 
spring/fall cases is applicable for half the time (by assumption 6), so the time-averaged 


torque is 

Tsun = 5 (642 sin B cos B + 0) = 3kr? sin £ cos £. (10.89) 
Adding on the effect of the moon gives an average total torque of 

Tiotal = 9kr? sin B cos B = as cosh. (10.90) 


The angular momentum of the earth is /3w3. The “horizontal” component of this 
is hows sin £, so |dL/dt| = Qhw3 sin B, where Q is the frequency of precession. 
Equating this with the torque gives 

E 9GMsmr? cos B 


Q= 10.91 
RB ( ) 


But from assumptions 3, 4, and 5, we have m = pr°h, where p is the average density 
of the earth. And B = (2/5) (47r? p/3)r? = (82/15) pr>, so~ 
_ 9GMg(pr7h)r* cosB _ 135 GMsh cos £ 
R? (87 pr3/15)w3 87rR303; ` 


(10.92) 


Plugging in the numerical values gives 


135(6.67 - 1071! m? /kg s?) (2 - 10%? kg) (21 - 10° m) cos 23° 

~ 87 (6.4 - 106 m) (1.5 - 10!! m)3(7.3 - 1075 s7!) 

2m 20 

Q ~ 88-10-72 s- 
This answer actually came out a little too good. We had no right to expect that it would 
come out so close to 26 000 years, considering the various approximations we made. 
But we did have a right to expect it to come out within a factor of, say, 5 or so, because 
our approximations could be off by only so much. At any rate, Eq. (10.92) does at least 
give the correct dependence on the various parameters. Note that Eq. (10.40) says 
that A « ro%/g. Using this along with g = G(4m7r? pg/3)/r* in Eq. (10.92), and 
ignoring all numerical factors, gives Q « w3Ms/ (PER?) x w (Ms/Me), where Me 
is the mass of a colossal object whose density equals the earth’s and whose radius 
equals the earth-sun distance. (This relation also holds if we consider the moon 
instead of the sun.) 


88-10 s7! 


T= z © 7.1 - 10"! s © 23 000 years. (10.93) 


Remark: Interestingly, the time of T = 26 000 years is large enough to allow us to 
treat the axis of rotation as essentially constant, but small enough to have a noticeable 
effect over the ages. The star we see in the northern direction nowadays is a different 
northern star from the one people saw, say, 2000 years ago. And the signs of the 
zodiac that we see now are shifted by approximately one place compared with what 
they were 2000 years ago. The “precession of the equinoxes” name comes from the 
fact that if you consider, for example, a distant galaxy that the sun blocks out when the 
earth is in the spring equinox position now, and if you consider the analogous galaxy 
13 000 years ago, then the two galaxies are in opposite directions in the universe, 
relative to the earth. & 


22 The mass m is actually smaller than this, because the earth’s crust is less dense than its interior. 
But J; is smaller than this too, because the earth’s core is denser than the mantle. These effects 
somewhat cancel each other, because m appears in the numerator and J3 appears in the denominator. 
But we’re just doing things roughly anyway. 


Chapter 11 
Relativity (Kinematics) 


We now come to Einstein’s theory of relativity. This is where we find out 
that everything we’ve done so far in this book has been wrong. Well, per- 
haps “incomplete” would be a better word. The important point to realize is 
that Newtonian physics is a limiting case of the more correct relativistic theory. 
Newtonian physics works perfectly fine when the speeds we’re dealing with are 
much less than the speed of light, which is about 3 - 108 m/s. It would be silly, to 
put it mildly, to use relativity to solve a problem involving the length of a baseball 
trajectory. But in problems involving large speeds, or in problems where a high 
degree of accuracy is required, we must use the relativistic theory.! This is the 
subject of the remainder of this book. 

The theory of relativity is certainly one of the most exciting and talked-about 
topics in physics. It is well known for its “paradoxes,” which are quite con- 
ducive to discussion. There is, however, nothing at all paradoxical about it. The 
theory is logically and experimentally sound, and the whole subject is actually 
quite straightforward, provided that you proceed calmly and keep a firm hold of 
your wits. 

The theory rests upon certain postulates. The one that most people find coun- 
terintuitive is that the speed of light has the same value in any inertial (that is, 
nonaccelerating) reference frame. This speed is much greater than the speed of 
everyday objects, so most of the consequences of this new theory aren’t notice- 
able. If we instead lived in a world identical to ours except for the fact that 
the speed of light was 50 mph, then the consequences of relativity would be 
ubiquitous. We wouldn’t think twice about time dilations, length contractions, 
and so on. 

Ihave included a large number of puzzles and “paradoxes” in the problems and 
exercises. When attacking these, be sure to follow them through to completion, 
and don’t say, “I could finish this one if I wanted to, but all I’d have to do would 
be such-and-such, so I won’t bother,” because the essence of the paradox may 


' You shouldn’t feel too bad about having spent so much time learning about a theory that’s just 
the limiting case of another theory, because you’re now going to do it again. Relativity is also the 
limiting case of another theory (quantum field theory). And likewise, quantum field theory is the 
limiting case of yet another theory (string theory). And likewise ... well, you get the idea. Who 
knows, maybe it really is turtles all the way down. 
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very well be contained in the such-and-such, and you will have missed out on all 
the fun. Most of the paradoxes arise because different frames of reference seem 
to give different results. Therefore, in explaining a paradox, you not only need 
to give the correct reasoning, you also need to say what’s wrong with incorrect 
reasoning. 

There are two main topics in relativity. One is Special Relativity (which doesn’t 
deal with gravity), and the other is General Relativity (which does). We’ll deal 
mostly with the former, but Chapter 14 contains some of the latter. Special 
Relativity may be divided into two topics, kinematics and dynamics. Kinematics 
deals with lengths, times, speeds, etc. It is concerned only with the space and 
time coordinates of an abstract particle, and not with masses, forces, energy, 
momentum, etc. Dynamics, on the other hand, does deal with these quantities. 
This chapter covers kinematics. Chapter 12 covers dynamics. Most of the fun 
paradoxes fall into the kinematics part, so the present chapter is the longer of the 
two. In Chapter 13, we’ll introduce the concept of 4-vectors, which ties much of 
the material in Chapters 11 and 12 together. 


11.1 Motivation 


Although it was obviously a stroke of genius that led Einstein to his theory 
of relativity, it didn’t just come out of the blue. A number of things going on 
in nineteenth-century physics suggested that something was amiss. There were 
many efforts made by many people to explain away the troubles that were arising, 
and at least a few steps had been taken toward the correct theory. But Einstein 
was the one who finally put everything together, and he did so in a way that 
had consequences far beyond the realm of the specific issues that people were 
trying to understand. Indeed, his theory turned our idea of space and time on its 
head. But before we get into the heart of the theory, let’s look at two of the major 
problems in late nineteenth-century physics. 


11.1.1 Galilean transformations, Maxwell's equations 


Imagine standing on the ground and watching a train travel by with constant 
speed v in the x direction. Let the train frame be S’ and the ground frame be S, as 
shown in Fig. 11.1. Consider two events that happen on the train. For example, 
one person claps her hands, and another person stomps his feet. If the space 
and time separations between these two events in the frame of the train are Ax’ 
and Az’, what are the space and time separations, Ax and At, in the frame of 
the ground? Ignoring what we’ll be learning about relativity in this chapter, the 
answers are “obvious” (well, in that incorrectly obvious sort of way, as we’ll see 


2 If you can’t wait to get to the postulates and results of Special Relativity, you can go straight to 
Section 11.2. The present section can be skipped on a first reading. 
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in Section 11.4.1). The time separation, Aż, is the same as on the train, so we have 
At = At’. We know from everyday experience that nothing strange happens with 
time. When you see people exiting a train station, they’re not fiddling with their 
watches, trying to recalibrate them with a ground-based clock. 

The spatial separation is a little more exciting, but still nothing too compli- 
cated. The train is moving, so everything in it (in particular, the second event) 
gets carried along at speed v during the time A+’ between the two events. So we 
have Ax = Ax’ + vAt’. As a special case, if the two events happen at the same 
place on the train (so that Ax’ = 0), then we have Ax = vA?’. This makes sense, 
because the spot on the train where the events occur simply travels a distance 
vAt by the time the second event happens. The Galilean transformations are 
therefore 

Ax = Ax’ + vAfť', 


(11.1) 
At = Af’. 


Also, nothing interesting happens in the y and z directions, so we have Ay = Ay’ 
and Az = Az’. 

The principle of Galilean invariance says that the laws of physics are invariant 
under the above Galilean transformations. Alternatively, it says that the laws 
of physics hold in all inertial frames.’ This is quite believable. For example, 
Newton’s second law holds in all inertial frames, because the constant relative 
velocity between any two frames implies that the acceleration of a given particle 
is the same in all frames. 


Remarks: Note that the Galilean transformations aren’t symmetric in x and t. This isn’t auto- 
matically a bad thing, but it turns out that it will in fact be a problem in Special Relativity, 
where space and time are treated on a more equal footing. We’ll find in Section 11.4.1 that the 
Galilean transformations are replaced by the Lorentz transformations (at least in the world we 
live in), and the latter are indeed symmetric in x and ¢ (up to factors of the speed of light, c). 

Note also that Eq. (11.1) deals only with the differences in x and t between two events, and 
not with the values of the coordinates themselves. The values of the coordinates of a single 
event depend on where you pick your origin, which is an arbitrary choice. The coordinate 
differences between two events, however, are independent of this choice, and this allows us to 
make the physically meaningful statement in Eq. (11.1). It makes no sense for a physical result 
to depend on the arbitrary choice of origin, and so the Lorentz transformations we derive later 
on will also involve only differences in coordinates. & 


One of the great triumphs of nineteenth-century physics was the theory of 
electromagnetism. In 1864, James Clerk Maxwell wrote down a set of equations 
that collectively described everything that was known about the subject. These 
equations involve the electric and magnetic fields through their space and time 
derivatives. We won’t worry about the specific form of the equations here," but it 


3 It was assumed prior to Einstein that these two statements say the same thing, but we will soon see 
that they do not. The second statement is the one that remains valid in relativity. 

4 Maxwell’s original formulation involved a large number of equations, but these were later written 
more compactly, using vectors, as four equations. 
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turns out that if you transform them from one frame to another via the Galilean 
transformations, they end up taking a different form. That is, if you’ve written 
down Maxwell’s equations in one frame (where they take their standard nice- 
looking form), and if you then replace the coordinates in this frame by those 
in another frame, using Eq. (11.1), then the equations look different (and not 
so nice). This presents a major problem. If Maxwell’s equations take a nice 
form in one frame and a not-so-nice form in every other frame, then why is 
one frame special? Said in another way, Maxwell’s equations predict that light 
moves with a certain speed c. But which frame is this speed measured with 
respect to? The Galilean transformations imply that if the speed is c with respect 
to a given frame, then it is not c with respect to any other frame. The proposed 
special frame where Maxwell’s equations are nice and the speed of light is c 
was called the frame of the ether. We’ll talk in detail about the ether in the next 
section, but what experiments showed was that light surprisingly moved with 
speed c in every frame, no matter which way the frame was moving through the 
supposed ether. 

There were therefore two possibilities. Either something was wrong with 
Maxwell’s equations, or something was wrong with the Galilean transforma- 
tions. Considering how “obvious” the latter are, the natural assumption in the 
late nineteenth century was that something was wrong with Maxwell’s equations, 
which were quite new, after all. However, after a good deal of effort by many peo- 
ple to make Maxwell’s equations fit with the Galilean transformations, Einstein 
finally showed that the trouble was in fact with the latter. More precisely, in 1905 
he showed that the Galilean transformations are a special case of the Lorentz 
transformations, valid only when the speed involved is much less than the speed 
of light.° As we’ll see in Section 11.4.1, the coefficients in the Lorentz transfor- 
mations depend on both v and the speed of light c, where the c’s appear in various 
denominators. Since c is quite large (about 3 - 108 m/s) compared with everyday 
speeds v, the parts of the Lorentz transformations involving c are negligible, for 
any typical v. This is why no one prior to Einstein realized that the transforma- 
tions had anything to do with the speed of light. Only the terms in Eq. (11.1) were 
noticeable. 


As he pondered the long futile fight 
To make Galileo’s world right, 

In a new variation 

On the old transformation, 

It was Einstein who first saw the light. 


> It was well known that Maxwell’s equations were invariant under the Lorentz transformations 
(in contrast with their noninvariance under the Galilean transformations), but Einstein was the 
first to recognize the full meaning of these transformations. Instead of being relevant only to 
electromagnetism, the Lorentz transformations replaced the Galilean transformations universally. 


11.1 Motivation 


In short, the reasons why Maxwell’s equations were in conflict with the 
Galilean transformations are: (1) The speed of light is what determines the scale 
on which the Galilean transformations break down; (2) Maxwell’s equations 
inherently involve the speed of light, because light is an electromagnetic wave. 


11.1.2 Michelson—Morley experiment 


As mentioned above, it was known in the late nineteenth century, after Maxwell 
wrote down his equations, that light is an electromagnetic wave and that it moves 
with a speed of about 3 - 108 m/s. Now, every other wave that people knew about 
at the time needed a medium to propagate in. Sound waves need air, ocean waves 
need water, waves on a string of course need the string, and so on. It was therefore 
natural to assume that light also needed a medium to propagate in. This proposed 
medium was called the ether. However, if light propagates in a given medium, 
and if the speed in this medium is c, then the speed in a reference frame moving 
relative to the medium will be different from c. Consider, for example, sound 
waves in air. If the speed of sound in air is Vsound, and if you run toward a sound 
source with speed vyoy, then the speed of the sound waves with respect to you 
(assuming it’s a windless day) is Usound + Vyou. Equivalently, if you are standing 
downwind and the speed of the wind is vind, then the speed of the sound waves 
with respect to you is Usound + Uwind- 

If this ether really exists, then a reasonable thing to do is to try to measure 
one’s speed with respect to it. This can be done in the following way (we’ll 
work in terms of sound waves in air here).° Let vs be the speed of sound in 
air. Imagine two people standing on the ends of a long platform of length L 
that moves at speed vp with respect to the reference frame in which the air is 
at rest. One person claps, the other person claps immediately when he hears 
the first clap (assume that the reaction time is negligible), and then the first 
person records the total time elapsed when she hears the second clap. What is 
this total time? Well, the answer is that we can’t say without knowing in which 
direction the platform is moving. Is it moving parallel to its length, or transverse 
to it (or somewhere in between)? Let’s look at these two basic cases. For both 
of these, we’ll view the setup and do the calculation in the frame in which the 
air is at rest. 

Consider first the case where the platform moves parallel to its length. In the 
frame of the air, assume that the person at the rear is the one who claps first. Then 
it takes a time of L/(vs — vp) for the sound to reach the front person. This is true 
because the sound must close the initial gap of L at a relative speed of vs — vp, 


é As we’ll soon see, there is no ether, and light travels at the same speed with respect to any frame. 
This is a rather bizarre fact, and it takes some getting used to. It’s hard enough to get away from the 
old way of thinking, even without any further reminders, so I can’t bring myself to work through 
this method in terms of light waves in an ether. I’ll therefore work in terms of sound waves in air. 
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as viewed in the air frame.’ By similar reasoning, the time for the sound to return 
to the rear person is L/(vs + Up). The total time is therefore 


"OE L $ E _ 2Lus (11.2) 
— op UstUp v2 —v2" ` 


This correctly equals 2L/vs when vp = 0, and infinity when vp > vs. 

Now consider the case where the platform moves perpendicular to its length. 
In the frame of the air, we have the situation shown in Fig. 11.2. Since the sound 
moves diagonally,® the “vertical” component is (by the Pythagorean theorem) 


[v2 — UE. This is the relevant component as far as traveling the length of the 


platform goes, so the total time is 


pa, (11.3) 


Again, this correctly equals 2L/vs when vp = 0, and infinity when vp — vs. 

The times in Eqs. (11.2) and (11.3) are not equal. As an exercise, you can 
show that of all the possible orientations of the platform relative to the direction 
of motion, tı is the largest possible time, and fp is the smallest. Therefore, if 
you are on a large surface that is moving with respect to the air, and if you 
know the values of L and vs, then if you want to figure out what vp is (assume 
that it doesn’t occur to you to toss a little piece of paper to at least find the 
direction of the wind), all you have to do is repeat the above setup with someone 
standing at various points along the circumference of a given circle around you. 
If you take the largest total time that occurs and equate it with ¢, then Eq. 
(11.2) will give you vp. Alternatively, you can equate the smallest time with 
t2, and Eq. (11.3) will yield the same vp. Note that if vp « vs, we can apply 
Taylor series approximations to the above two times. For future reference, these 
approximations give the difference in times as 


2L 1 1 Log 
At=tr-h= x ; (11.4) 


— v2 /v2 
vs \1 — 5/05 /1 — v2/v2 


The above setup is the general idea that Michelson and Morley used in 
1887 to measure the speed of the earth through the supposed ether[] There is, 
however, a major complication with light that doesn’t arise with sound — the 
speed of light is so large that any time intervals that are individually measured 
will have inevitable measurement errors that are far larger than the difference 


7 Alternatively, relative to the initial back of the platform, the position of the sound wave is ust, and 
the position of the front person is L + vpt. Equating these gives t = L/(vs — vp). 

8 The sound actually moves in all directions, of course, but it’s only the part of the wave that moves 
in a particular diagonal direction that ends up hitting the other person. 

? See Handschy (1982) for the data and analysis of the experiment. 
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between f; and t2. Therefore, individual time measurements give essentially no 
information. Fortunately, there is a way out of this impasse. 

Consider two of the above “platform” scenarios arranged to be at right angles 
with respect to each other, with the same starting point. This can be arranged 
by having a (monochromatic) light beam encounter a beam splitter that sends 
two beams off at 90° angles. The beams then hit mirrors and bounce back to 
the beam splitter where they (partially) recombine before hitting a screen, as 
shown in Fig. 11.3. The fact that light is a wave, which is what got us into this 
mess in the first place, is now what saves the day. The wave nature of light 
implies that the recombined light beam produces an interference pattern on the 
screen. At the center of the pattern, the beams will constructively or destructively 
interfere (or something in between), depending on whether the two light beams 
are in phase or out of phase when they recombine. This interference pattern is 
extremely delicate. The slightest change in travel times of the beams will cause 
the pattern to noticeably shift. 

If the whole apparatus is rotated around, so that the experiment is performed at 
various angles, then the maximum amount that the interference pattern changes 
can be used to determine the speed of the earth through the ether (vp in the 
platform setup above). In one extreme case, the time in one arm is longer than 
the time in the other by Lv*/c?, where we have changed notation in Eq. (11.4) 
so that vp — v is the speed of the earth, and vs —> c is the speed of light. But in 
the other extreme case, the time in this arm is shorter by Lv” /c?. So the maximal 
interference shift corresponds to a time difference of 2Lv?/c?. 

However, when Michelson and Morley performed their experiment, they 
observed no interference shift as the apparatus was rotated around. Their setup 
did in fact allow enough precision to measure a nontrivial earth speed through the 
ether, if such a speed existed. So if the ether did exist, their results implied that 
the speed of the earth through it was zero. This result, although improbable, was 
technically fine; it might simply have been the case that they happened to do their 
experiment when the relative speed was zero. However, when they performed 
their experiment half a year later, when the earth’s motion around the sun caused 
it to be moving in the opposite direction, they still measured zero speed. It wasn’t 
possible for both of these results to be zero (assuming that the ether exists), so 
something must have been wrong with the initial reasoning. Many people over 
the years tried to explain this null result, but none of the explanations were sat- 
isfactory. Some led to incorrect predictions in other setups, and some seemed to 
work fine but were a bit ad hoc.'° The correct explanation, which followed from 


10 The most successful explanation (and one that was essentially correct, although the reason why it 
was correct wasn’t known until Einstein fully explained things) was the Lorentz—FitzGerald con- 
traction. These two physicists independently proposed that lengths are contracted in the direction 
of the motion by precisely the right factor, namely y 1 — v2/c2, to make the travel times in the 
two arms of the Michelson—Morley setup equal, thus yielding the null result. 
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Einstein’s 1905 theory of relativity, was that the ether doesn’t exist.'! In other 
words, light doesn’t need a medium to propagate in; it doesn’t move with respect 
to a certain special reference frame, but rather it moves with respect to whoever 
is looking at it. 


The findings of Michelson—Morley 
Allow us to say very surely, 

“Tf this ether is real, 

Then it has no appeal, 

And shows itself off rather poorly.” 


REMARKS: 


1. We assumed above that the lengths of the two arms in the apparatus were equal. In practice, 


there is absolutely no hope of constructing the lengths to be equal, up to an error that is 
sufficiently small compared with the wavelength of the light. But fortunately this doesn’t 
matter. We’re concerned not with the difference in the travel times associated with the two 
arms, but rather with the difference in these differences as the apparatus is rotated around. 
Using Eqs. (11.2) and (11.3) with different lengths Lı and Z2, you can quickly show that 
the maximum interference shift corresponds to a time of (L1 +L)? / 2, assuming v & c. 


2. Assuming that the lengths of the arms are approximately equal, let’s plug in some rough 


numbers to see how much the interference pattern shifts. The Michelson—Morley setup 
had arms with lengths of about 10 m. And we’ll take v to be on the order of the speed of the 
earth around the sun, which is about 3 - 10+ m/s. We then obtain a maximal time change 
of t = 2Lv7/c3 ~ 7. 107!6 s. The large negative exponent here might make us want to 
throw in the towel, thinking that the effect is hopelessly small. But we should be careful 
about calling a dimensionful quantity “small.” We need to know what other quantity with 
the dimensions of time we’re comparing it with. The distance that light travels in the 
time t is ct = (3 - 108 m/s)(7- 107!6 s) ~ 2 - 1077 m, and this happens to be a perfectly 
reasonable fraction of a wavelength of visible light, which is around A = 6-107’ m, give 
or take. So we have ct/A ~ 1/3. The time we want to compare 2Lv? /c} with is therefore 
the time it takes light to travel one wavelength, namely A/c, and these two times turn out 
to be roughly the same size. The interference shift of about a third of a cycle was well 
within the precision of the Michelson—Morley setup. So if the ether had really existed, 
they definitely would have been able to measure the speed of the earth through it. 


. One proposed explanation of the observed null effect was “frame dragging.” Perhaps the 


earth drags the ether along with it, thereby yielding the observed zero relative speed. This 
frame dragging is quite plausible, because in the platform example above, the platform 
drags a thin layer of air along with it. And more mundanely, a car completely drags the 
air in its interior along with it. But it turns out that frame dragging is inconsistent with 
stellar aberration, which is the following effect. 

Depending on the direction of the earth’s instantaneous velocity as it orbits around the 
sun, a given star might (depending on its location) appear at slightly different places in 
the sky when viewed at two times, say, six months apart. This is due to the fact that your 
telescope must be aimed at a slight angle relative to the actual direction to the star, because 
as the star’s light travels down the telescope, the telescope moves slightly in the direction 
of the earth’s motion. The ratio of the earth’s speed around the sun to the speed of light 
is about 1074, so the effect is small. But it is large enough to be noticeable, and it has 


1L Although we’ve presented the Michelson—Morley experiment here for pedagogical purposes, the 
consensus among historians is that Einstein actually wasn’t influenced much by the experiment, 
except indirectly through Lorentz’s work on electrodynamics. See Holton (1988). 
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indeed been measured. Note that it is the velocity of the telescope that matters here, and 
not its position. !? 

However, if frame dragging were real, then the light from the star would get dragged 
along with the earth and would therefore travel down a telescope that was pointed directly 
at the star, in disagreement with the observed fact that the telescope must point at the slight 
angle mentioned above. Or even worse, the dragging might produce a boundary layer of 
turbulence which would blur the stars. The existence of stellar aberration therefore implies 
that frame dragging doesn’t occur. & 


11.2 The postulates 


Let’s now start from scratch and see what the theory of Special Relativity is all 
about. We’ ll take the route that Einstein took and use two postulates as the basis 
of the theory. We’ ll start with the speed-of-light postulate: 


e The speed of light has the same value in any inertial frame. 


I don’t claim that this statement is obvious, or even believable. But I do claim 
that it’s easy to understand what the statement says (even if you think it’s too silly 
to be true). It says the following. Consider a train moving along the ground at 
constant velocity. Someone on the train shines a light from one point on the train 
to another. Let the speed of the light with respect to the train be c (~3 - 10°m/s). 
Then the above postulate says that a person on the ground also sees the light 
move at speed c. 

This is a rather bizarre statement. It doesn’t hold for everyday objects. If a 
baseball is thrown on a train, then the speed of the baseball is different in the 
different frames. The observer on the ground must add the velocity of the ball 
(with respect to the train) and the velocity of the train (with respect to the ground) 
to obtain the velocity of the ball with respect to the ground. !? 

The truth of the speed-of-light postulate cannot be demonstrated from first 
principles. No statement with any physical content in physics (that is, one that 
isn’t purely mathematical, such as, “two apples plus two apples gives four 
apples”) can be proven. In the end, we must rely on experiment. And indeed, 
all the consequences of the speed-of-light postulate have been verified countless 


12 This aberration effect is not the same as the parallax effect in which the direction of the actual 
position of an object changes, depending on the location of the observer. For example, people at 
different locations on the earth see the moon at different angles (that is, they see the moon in line 
with different distant stars). Although stellar parallax has been measured for nearby stars (as the 
earth goes around the sun), its angular effect is much smaller than the angular effect from stellar 
aberration. The former decreases with distance, whereas the latter doesn’t. For further discussion 
of aberration, and of why it is only the earth’s velocity (or rather, the change in its velocity) that 
matters, and not also the star’s velocity (since you might think, based on the title of this chapter, 
that it is the relative velocity that matters), see Eisner (1967). 

Actually, this isn’t quite true, as the velocity-addition formula in Section 11.5.1 shows. But it’s 


true enough for the point we’re making here. 
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times during the past century. As discussed in the previous section, the most 
well-known of the early experiments on the speed of light was the one performed 
by Michelson and Morley. And in more recent years, the consequences of the pos- 
tulate have been verified continually in high-energy particle accelerators, where 
elementary particles reach speeds very close to c. The collection of all the data 
from numerous experiments over the years allows us to conclude with near cer- 
tainty that our starting assumption of an invariant speed of light is correct (or is 
at least the limiting case of a more accurate theory). 

There is one more postulate in the Special Relativity theory, namely the 
“relativity” postulate (also called the Principle of Relativity). It is much more 
believable than the speed-of-light postulate, so you might just take it for granted 
and forget to consider it. But like any postulate, of course, it is crucial. It can be 
stated in various ways, but we’ll simply word it as: 


e All inertial frames are “equivalent.” 


This postulate basically says that a given inertial frame is no better than any 
other. There is no preferred reference frame. That is, it makes no sense to say 
that something is moving; it makes sense only to say that one thing is moving 
with respect to another. This is where the “Relativity” in Special Relativity comes 
from. There is no absolute frame; the motion of any frame is defined only relative 
to other frames. 

This postulate also says that if the laws of physics hold in one inertial frame 
(and presumably they do hold in the frame in which I now sit),'* then they hold 
in all others. It also says that if we have two frames S and S’, then S should see 
things in S’ in exactly the same way as S’ sees things in S, because we can just 
switch the labels of S and S” (we'll get our money’s worth out of this statement 
in the next few sections). It also says that empty space is homogeneous (that is, 
all points look the same), because we can pick any point to be, say, the origin of a 
coordinate system. It also says that empty space is isotropic (that is, all directions 
look the same), because we can pick any axis to be, say, the x axis of a coordinate 
system. 

Unlike the first postulate, this second one is entirely reasonable. We’ ve gotten 
used to having no special places in the universe. We gave up having the earth as 
the center, so let’s not give any other point a chance, either. 

Copernicus gave his reply 

To those who had pledged to deny. 
“All your addictions 

To ancient convictions 


Won’t bring back your place in the sky.” 


14 Technically, the earth is spinning while revolving around the sun, and there are also little vibrations 
in the floor beneath my chair, etc., so I’m not really in an inertial frame. But it’s close enough 
for me. 
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The second postulate is nothing more than the familiar principle of Galilean 
invariance, assuming that the latter is written in the “The laws of physics hold in 
all inertial frames” form, and not in the form that explicitly mentions the Galilean 
transformations, which are inconsistent with the speed-of-light postulate. 

Everything we’ve said here about the second postulate refers to empty space. 
If we have a chunk of mass, then there is certainly a difference between the 
position of the mass and a point a meter away. To incorporate mass into the theory, 
we would have to delve into General Relativity. But we won’t have anything to 
say about that in this chapter. We will deal only with empty space, containing 
perhaps a few observant souls sailing along in rockets or floating aimlessly on 
little spheres. Though it may sound boring at first, it will turn out to be more 
exciting than you’d think. 


REMARK: Given the second postulate, you might wonder if we even need the first. If all inertial 
frames are equivalent, shouldn’t the speed of light be the same in any frame? Well, no. For all 
we know, light might behave like a baseball. A baseball certainly doesn’t have the same speed 
with respect to different frames, and this doesn’t ruin the equivalence of the frames. 

It turns out (see Section 11.10) that nearly all of Special Relativity can be derived by 
invoking only the second postulate. The first postulate simply fills in the last bit of necessary 
information by stating that something has the same finite speed in every frame. It’s actually 
not important that this thing happens to be light. It could be mashed potatoes or something else 
(well, it has to be massless, as we’ ll see in Chapter 12, so they’d have to be massless potatoes, 
but whatever), and the theory would come out the same. So to be a little more minimalistic, 
it’s sufficient to state the first postulate as, “There is something that has the same speed in any 
inertial frame.” It just so happens that in our universe this thing is what allows us to see.!> & 


11.3 The fundamental effects 


The most striking effects of our two postulates are (1) the loss of simultane- 
ity, (2) length contraction, and (3) time dilation. In this section, we’ll discuss 
these three effects using some time-honored concrete examples. In the following 
section, we’ ll derive the Lorentz transformations using these three results. 


11.3.1 Loss of simultaneity 


Consider the following setup. In A’s reference frame, a light source is placed 
midway between two receivers, a distance Z’ from each (see Fig. 11.4). The light 
source emits a flash. From 4’s point of view, the light hits the two receivers at 
the same time, ¢’/c seconds after the flash. Now consider another observer, B, 
who travels to the left at speed v. From her point of view, does the light hit the 
receivers at the same time? We will show that it does not. 


15 To go a step further, it’s actually not even necessary for there to exist something that has the same 
speed in any frame. The theory will still come out the same if we write the first postulate as, “There 
is a limiting speed of an object in any frame.” (See Section 11.10 for a discussion of this.) There’s 
no need to have something that actually travels at this speed. It’s conceivable to have a theory that 
contains no massless objects, so that everything travels slower than this limiting speed. 


511 


512 


Fig. 11.5 


Relativity (Kinematics) 


In B’s reference frame, the situation looks like that in Fig. 11.5. The receivers 
(along with everything else in A’s frame) move to the right at speed v, and the 
light travels in both directions at speed c with respect to B (not with respect to the 
light source, as measured in B’s frame; this is where the speed-of-light postulate 
comes into play). Therefore, the relative speed (as viewed by B) of the light and 
the left receiver is c + v, and the relative speed of the light and the right receiver 
isc —v. 


REMARK: Yes, it’s legal to just add and subtract these speeds to obtain the relative speeds as 
viewed by B. If v equals, say, 2 - 108 m/s, then in one second the left receiver moves 2 - 108 m 
to the right, while the left ray of light moves 3 - 10° m to the left. This means that they are now 
5 - 108 m closer than they were a second ago. In other words, the relative speed (as measured 
by B) is 5 - 108 m/s, which is simply c + v. (Note that this implies that it’s perfectly legal for 
the relative speed of two things, as measured by a third, to take any value up to 2c.) Both the v 
and c here are measured with respect to the same person, namely B, so our intuition works fine. 
We don’t need to use the “velocity-addition formula,” which we’ll derive in Section 11.5.1, 
and which is relevant in a different setup. I include this remark here just in case you’ve seen 
the velocity-addition formula and think it’s relevant in this setup. But if it didn’t occur to you, 
then nevermind. & 


Let £ be the distance from the source to the receivers, as measured by B.!6 
Then in B’s frame, the light hits the left receiver at 4 and the right receiver at 
tr, where 

€ £ 


l TEY an T ars ( ) 


These are not equal if v Æ 0. (The one exception is when £ = 0, in which case 
the two events happen at the same place and same time in all frames.) The moral 
of this exercise is that it makes no sense to say that one event happens at the same 
time as another, unless you state which frame you’re talking about. Simultaneity 
depends on the frame in which the observations are made. 


Of the many effects, miscellaneous, 
The loss of events, simultaneous, 
Allows B to claim 

There’s no pause in £’s frame, 


REMARKS: 
1. The invariance of the speed of light was used in saying that the two relative speeds above 


were c + v and c — v. If we were talking about baseballs instead of light beams, then the 
relative speeds wouldn’t look like this. If up is the speed at which the baseballs are thrown 


16 We’ll see in Section 11.3.3 that £ is not equal to Z’, due to length contraction. But this won’t be 
important for what we’re doing here. The only fact we need for now is that the light source is 
equidistant from the receivers, as measured by B. This is true because space is homogeneous, 
which implies that the length-contraction factor must be the same everywhere. More on this in 
Section 11.3.3. 
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in A’s frame, then B sees the balls move at speeds vp — v to the left and vp +v to the right. !7 
These are not equal to up, as they would be in the case of the light beams. The relative 
speeds between the balls and the left and right receivers are therefore (vp — v) + v = vp 
and (vp + v) — v = vp. These are equal, so B sees the balls hit the receivers at the same 
time, as we know very well from everyday experience. 

2. It is indeed legal in Eq. (11.5) to obtain the times by simply dividing £ by the relative 
speeds, c + v and c — v. But if you want a more formal method, then you can use this 
reasoning: In B’s frame, the position of the right photon is given by ct, and the position 
of the right receiver (which had a head start of £) is given by £ + vt. Equating these two 
positions gives f, = £/(c — v). Likewise for the left photon. 

. There is always a difference between the time an event happens and the time someone 
sees the event happen, because light takes time to travel from the event to the observer. 
What we calculated above were the times at which the events really happen. If we wanted 
to, we could calculate the times at which B sees the events occur, but such times are rarely 
important, and in general we won’t be concerned with them. They can easily be calculated 
by adding on a (distance)/c time difference for the path of the photons to B’s eye. Of 
course, if B actually did the above experiment to find fr and 4, she would do it by writing 
down the times at which she saw the events occur, and then subtracting off the relevant 
(distance) /c time differences to find when the events really happened. 

To sum up, the ¢, Æ t result in Eq. (11.5) is due to the fact that the events truly occur 
at different times in B’s frame. /t has nothing to do with the time it takes light to travel to 
your eye. In this chapter, we will often use sloppy language and say things like, “What 
time does B see event Q happen?” But we don’t really mean, “When do B’s eyes register 
that O happened?” Instead, we mean, “What time does B know that event Q happened in 
her frame?” If we ever want to use “see” in the former sense, we will explicitly say so (as 
in Section 11.8 on the Doppler effect). & 


U 


Where this last line is not so extraneous. 


H 
Example (Rear clock ahead): Two clocks are positioned at the ends of a train 
of length Z (as measured in its own frame). They are synchronized in the train frame. O © 
The train travels past you at speed v. It turns out that if you observe the clocks at L 
simultaneous times in your frame, you will see the rear clock showing a higher reading 
than the front clock (see Fig. 11.6). By how much? g 


Solution: As above, let’s put a light source on the train, but let’s now position it so Fig. 11.6 
that the light hits the clocks at the ends of the train at the same time in your frame. As 
above, the relative speeds of the photons and the clocks are c + v and c — v (as viewed 
in your frame). We therefore need to divide the train into lengths in this ratio, in your 
frame. But since length contraction (discussed in Section 11.3.3) is independent of 
position, this must also be the ratio in the train frame. So in the train frame, you 


can quickly show that two numbers that are in this ratio, and that add up to L, are L(c+v) L(c-v) 
L(c + v)/2c and L(c — v)/2c. Zc FG 
The situation in the train frame therefore looks like that in Fig. 11.7. The light must 
travel an extra distance of L(c + v)/2c — L(c — v)/2c = Lv/c to reach the rear clock. 
The light travels at speed c (as always), so the extra time is Lu/ c°. Therefore, the rear 


Fig. 11.7 


17 The velocity-addition formula in Section 11.5.1 shows that these formulas aren’t actually correct. 
But they’re close enough for our purposes here. 
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clock reads Lu/ c? more when it is hit by the backward photon, compared with what 
the front clock reads when it is hit by the forward photon. 

Now, let the instant you look at the clocks be the instant the photons hit them (that’s 
why we chose the hittings to be simultaneous in your frame). Then from the previous 
paragraph, you observe the rear clock reading more than the front clock by an amount, 


L 
(difference in readings) = es ; (11.6) 
c 


Note that the Z that appears here is the length of the train in its own frame, and not 
the shortened length that you observe in your frame (see Section 11.3.3). Appendix G 
gives a number of other derivations of Eq. (11.6), although they rely on material yet 
to come in this chapter and Chapter 14. 


REMARKS: 


1. This Lu/c? result has nothing to do with the fact that the rear clock takes more time 
to pass you. 

2. The result does not say that you see the rear clock ticking at a faster rate than the front 
clock. They run at the same rate (both have the same time-dilation factor relative to 
you; see Section 11.3.2). The rear clock is simply a fixed time ahead of the front 
clock, as seen by you. 

3. The fact that the rear clock is ahead of the front clock in your frame means that in 
the train frame the light hits the rear clock after it hits the front clock. 

4. It’s easy to forget which of the clocks is the one that is ahead. But a helpful mnemonic 
for remembering “rear clock ahead” is that both the first and fourth letters in each 
word form the same acronym, “rea,” which is an anagram for “car,” which is sort of 
like a train. Sure. & 


11.3.2 Time dilation 


We present here the classic example of a light beam traveling vertically on a train. 
Let there be a light source on the floor of the train and a mirror on the ceiling, 
which is a height A above the floor. Let observer A be at rest on the train, and 
let observer B be at rest on the ground. The speed of the train with respect to 
the ground is v.!* A flash of light is emitted. The light travels up to the mirror, 
bounces off it, and then heads back down. Assume that after the light is emitted, 
we replace the source with a mirror, so that the light keeps bouncing up and down 
indefinitely. 

In A’s frame, the train is at rest. The path of the light is shown in Fig. 11.8. 
It takes the light a time A/c to reach the ceiling and then a time A/c to return to 


'8 Technically, the words, “with respect to ...,” should always be included when talking about speeds, 
because there is no absolute reference frame, and hence no absolute speed. But in the future, when 
it’s clear what we mean (as in the case of a train moving with respect to the ground), we’ll 
occasionally be sloppy and drop the “with respect to ...” 
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the floor. The roundtrip time is therefore 
u = —. (11.7) 


In B’s frame, the train moves at speed v. The path of the light is shown in 
Fig. 11.9. The crucial fact to remember is that the speed of light in B’s frame 
is still c. This means that the light travels along its diagonally upward path at 
speed c. (The vertical component ofthe speed is not c, as would be the case if light 
behaved like a baseball.) Since the horizontal component of the light’s velocity 
is v,!° the vertical component must be Vc? — v2, as shown in Fig. 11.10.7° The 
time it takes to reach the mirror is therefore h//c? — v?,! so the roundtrip time is 


= at 11.8 
ae re 
Dividing Eq. (11.8) by Eq. (11.7) gives 
ts = VU, (11.9) 
where 
1 


Y (11.10) 


This y factor is ubiquitous in Special Relativity. Note that it is always greater 
than or equal to 1. This means that the roundtrip time is longer in B’s frame than 
in A’s frame. 

What are the implications of this? For concreteness, let v/c = 3/5, so y = 
5/4. Then we may say the following. If A is standing next to the light source, and 
if B is standing on the ground, and if A claps his hands at t4 = 4 second intervals, 
then B observes claps at tg = 5 second intervals (after having subtracted off the 
time for the light to travel to her eye, of course). This is true because both A and 
B must agree on the number of roundtrips the light beam takes between claps. 
If we assume, for convenience, that a roundtrip takes one second in A’s frame 
(yes, that would be a tall train), then the four roundtrips between claps take five 
seconds in B’s frame, using Eq. (11.9). 


REMARK: We just made the claim that both A and B must agree on the number of roundtrips 
between claps. However, since A and B disagree on so many things (whether two events are 


19 Yes, it’s still v. The light is always located on the vertical line between the source and the mirror. 
Since both of these objects move horizontally at speed v, the light does also. 

20 The Pythagorean theorem is indeed valid here. It’s valid for distances, and since speeds are just 
distances divided by time, it’s valid also for speeds. 

21 We’ve assumed that the height of the train in B’s frame is still h. Although we’ll see in Section 
11.3.3 that there is length contraction along the direction of motion, there is none in the direction 
perpendicular to the motion (see Problem 11.1). 


Fig. 11.10 
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simultaneous, the rate at which clocks tick, and the length of things, as we’ll see below), you 
might be wondering if there’s anything they agree on. Yes, there are still frame-independent 
statements we can hang on to. If a bucket of paint flies past you and dumps paint on your 
head, then everyone agrees that you are covered in paint. Likewise, if A is standing next to the 
light clock and claps when the light reaches the floor, then everyone agrees on this. If the light 
is actually a strong laser pulse, and if A’s clapping motion happens to bring his hands right 
over the mirror when the pulse gets there, then everyone agrees that his hands get burned by 
the laser.  & 


What if we have a train that doesn’t contain one of our special light clocks? It 
doesn’t matter. We could have built one if we wanted to, so the same results con- 
cerning the claps must hold. Therefore, light clock or no light clock, B observes 
A moving strangely slowly. From B’s point of view, A’s heart beats slowly, his 
blinks are lethargic, and his sips of coffee are slow enough to suggest that he 
needs another cup. 


The effects of dilation of time 

Are magical, strange, and sublime. 

In your frame, this verse, 

Which you'll see is not terse, 

Can be read in the same amount of time it takes someone 


else in another frame to read a similar sort of rhyme. 


Our assumption that A is at rest on the train was critical in the above derivation. 
If A is moving with respect to the train, then Eq. (11.9) doesn’t hold, because 
we cannot say that both A and B must agree on the number of roundtrips the 
light beam takes between claps, because of the problem with simultaneity. More 
precisely, if A is at rest on the train right next to the light source, then there are 
no issues with simultaneity, because the distance L in Eq. (11.6) is zero. And if 
A is at rest at a fixed distance from the source, then consider a person A’ at rest 
on the train right next to the source. The distance L between A and J’ is nonzero, 
so from the loss of simultaneity, B sees their two clocks read different times. But 
this difference is constant, so B sees A’s clock tick at the same rate as A’’s clock. 
Equivalently, we can just build a second light clock in a little box and have A 
hold it, and it will have the same speed v (and thus yield the same y factor) as 
the first clock. 

However, if A is moving with respect to the train, then we have a problem. 
If A’ is again at rest next to the source, then the distance L between A and A’ 
is changing, so B can’t use the reasoning in the previous paragraph to conclude 
that A’s and A’’s clocks tick at the same rate. And in fact they do not, because as 
above, we can build another light clock and have A hold it. In this case, A’s speed 
is what goes into the y factor in Eq. (11.10), but this is different from A’’s speed 
(which is the speed of the train). 


REMARKS: 


1. The time dilation result derived in Eq. (11.9) is a bit strange, no doubt, but there doesn’t 
seem to be anything downright incorrect about it until we look at the situation from A’s 


w 
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point of view. A sees B flying by at a speed v in the other direction. The ground is no 
more fundamental than a train, so the same reasoning applies. The time dilation factor, y, 
doesn’t depend on the sign of v, so A sees the same time dilation factor that B sees. That 
is, A sees B’s clock running slow. But how can this be? Are we claiming that A’s clock is 
slower than B’s, and also that B’s clock is slower than A’s? Well... yes and no. 

Remember that the above time-dilation reasoning applies only to a situation where 
something is motionless in the appropriate frame. In the second situation (where A sees B 
flying by), the statement t4 = ytz holds only when the two events (say, two ticks on B’s 
clock) happen at the same place in B’s frame. But for two such events, they are certainly 
not in the same place in A’s frame, so the tg = yt, result in Eq. (11.9) does not hold. The 
conditions of being motionless in each frame never both hold for a given setup (unless 
v = 0, in which case y = 1 and ¢, = ts). So, the answer to the question at the end of the 
previous paragraph is “yes” if you ask the questions in the appropriate frames, and “no” 
if you think the answer should be frame independent. 


. Concerning the fact that A sees B’s clock run slow, and B sees A’s clock run slow, consider 


the following statement. “This is a contradiction. It is essentially the same as saying, ‘I 
have two apples on a table. The left one is bigger than the right one, and the right one is 
bigger than the left one.’ ” How would you respond to this? Well, it is not a contradiction. 
Observers A and B are using different coordinates to measure time. The times measured in 
each of their frames are quite different things. The seemingly contradictory time-dilation 
result is really no stranger than having two people run away from each other into the 
distance, and having them both say that the other person looks smaller. In short, we are 
not comparing apples and apples. We are comparing apples and oranges. A more correct 
analogy would be the following. An apple and an orange sit on a table. The apple says to 
the orange, “You are a much uglier apple than I am,” and the orange says to the apple, 
“You are a much uglier orange than I am.” 


. One might view the statement, “A sees B’s clock running slow, and also B sees A’s clock 


running slow,” as somewhat unsettling. But in fact it would be a complete disaster for 
the theory if A and B viewed each other in different ways. A critical fact in the theory of 
relativity is that A sees B in exactly the same way as B sees A. 

In everything we’ve done so far, we’ve assumed that A and B are in inertial frames, 
because these are the frames that the postulates of Special Relativity deal with. However, 
it turns out that the time dilation result in Eq. (11.9) holds even if A is accelerating, as long 
as B isn’t. In other words, if you’re looking at a clock that is undergoing a complicated 
accelerating motion, then to figure out how fast it’s ticking in your frame, all you need to 
know is its speed at any instant; its acceleration is irrelevant (this has plenty of experimental 
verification). If, however, you are accelerating, then all bets are off, and it’s not valid for 
you to use the time dilation result when looking at a clock. It’s still possible to get a handle 
on such situations, but we’ll wait until Chapter 14 todo so. & 


Example (Twin paradox): Twin A stays on the earth, while twin B flies quickly 
to a distant star and back (see Fig. 11.11). Show that B is younger than A when they 
meet up again. 


Solution: From A’s point of view, B’s clock is running slow by a factor y, on 
both the outward and return parts of the trip. Therefore, B is younger than A when 
they meet up again. This is the answer, and that’s that. So if getting the right answer 
is all we care about, then we can pack up and go home. But our reasoning leaves 
one large point unaddressed. The “paradox” part of this example’s title comes from 
the following alternate reasoning. Someone might say that in B’s frame, A’s clock is 
running slow by a factor y, and so A is younger than B when they meet up again. 
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It’s definitely true that when the two twins are standing next to each other (that 
is, when they are in the same frame), we can’t have both B younger than A, and 
A younger than B. So what is wrong with the reasoning at the end of the previous 
paragraph? The error lies in the fact that there is no “one frame” that B is in. The 
inertial frame for the outward trip is different from the inertial frame for the return 
trip. The derivation of our time-dilation result applies only to one inertial frame. 

Said in a different way, B accelerates when she turns around, and our time-dilation 
result holds only from the point of view of an inertial observer.?” The symmetry in the 
problem is broken by the acceleration. If both A and B are blindfolded, they can still 
tell who is doing the traveling, because B will feel the acceleration at the turnaround. 
Constant velocity cannot be felt, but acceleration can be. (However, see Chapter 14 
on General Relativity. Gravity complicates things.) 

The above paragraphs show what is wrong with the “4 is younger” reasoning, but 
they don’t show how to modify it quantitatively to obtain the correct answer. There are 
many different ways of doing this, and you can tackle some of them in the problems 
(Exercise 11.67, Problems 11.2, 11.19, 11.24, and various problems in Chapter 14). 
Also, Appendix H gives a list of all the possible resolutions to the twin paradox that 
I can think of. 


Example (Muon decay): Elementary particles called muons (which are identical 
to electrons, except that they are about 200 times as massive) are created in the upper 
atmosphere when cosmic rays collide with air molecules. The muons have an average 
lifetime of about 2 - 1076 seconds”? (then they decay into electrons and neutrinos), 
and move at nearly the speed of light. Assume for simplicity that a certain muon is 
created at a height of 50 km, moves straight downward, has a speed v = 0.99998 c, 
decays in exactly T = 2 - 1076 seconds, and doesn’t collide with anything on the 
way down.* Will the muon reach the earth before it (the muon!) decays? 


Solution: The naive thing to say is that the distance traveled by the muon is 
d = vT ~ (3-108 m/s)(2- 1076s) = 600m, and that this is less than 50km, 
so the muon doesn’t reach the earth. This reasoning is incorrect, because of the time- 
dilation effect. The muon lives longer in the earth frame, by a factor of y, which is 
y = 1/1—v2/c? ~% 160 here. The correct distance traveled in the earth frame is 
therefore v(y T) ~ 100km. Hence, the muon travels the 50 km, with room to spare. 
The real-life fact that we actually do detect muons reaching the surface of the earth 
in the predicted abundances (while the naive d = vT reasoning would predict that 


22 For the entire outward and return parts of the trip, B does observe A’s clock running slow, but 
enough strangeness occurs during the turning-around period to make A end up older. Note, how- 
ever, that a discussion of acceleration is not required to quantitatively understand the paradox, as 
Problem 11.2 shows. 

23 This is the “proper” lifetime, that is, the lifetime as measured in the frame of the muon. 

24 Tn the real world, the muons are created at various heights, move in different directions, have 
different speeds, decay in lifetimes that vary according to a standard half-life formula, and may 
very well bump into air molecules. So technically we’ve got everything wrong here. But that’s no 
matter. This example will work just fine for the present purpose. 


11.3 The fundamental effects 


we shouldn’t see any) is one of the many experimental tests that support the relativity 
theory. 


11.3.3 Length contraction 


Consider the following setup. Person A stands on a train which he measures to 
have length 2’, and person B stands on the ground. The train moves at speed v 
with respect to the ground. A light source is located at the back of the train, and 
a mirror is located at the front. The source emits a flash of light that heads to the 
mirror, bounces off, then heads back to the source. By looking at how long this 
process takes in the two reference frames, we can determine the length of the 
train as measured by B.*> In A’s frame (see Fig. 11.12), the roundtrip time for 
the light is simply 


O 


u= (11.11) 


g 

Things are a little more complicated in B’s frame (see Fig. 11.13). Let the 
length of the train, as measured by B, be £. For all we know at this point, £ may 
equal £’, but we’ll soon find that it does not. The relative speed (as measured 
by B) of the light and the mirror during the first part of the trip is c — v. The 
relative speed during the second part is c + v. During each part, the light must 
close a gap with initial length £. Therefore, the total roundtrip time is 


pa £ £ € 2c _ U, (11.12) 
or ctu tae c f 
But we know from Eq. (11.9) that 
lay = Y la: (11.13) 


This is a valid statement, because the two events we are concerned with (light 
leaving back, and light returning to back) happen at the same place in the train 
frame, so it’s legal to use the time-dilation result in Eq. (11.9). Substituting the 
results for t4 and t from Eqs. (11.11) and (11.12) into Eq. (11.13), we find 


L 


e (11.14) 
Y 


Note that we could not have used this setup to find the length contraction if we 
had not already found the time dilation in Eq. (11.9). 


25 The third remark below gives another (quicker) derivation of length contraction. But we’ll go 
through the present derivation because the calculation is instructive. 
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Since y > 1, we see that B measures the train to be shorter than A measures. 


The term proper length is used to describe the length of an object in its rest frame. 
So l’ is the proper length of the above train, and the length in any other frame is less 
than or equal to ¢’. This length contraction is often called the Lorentz—FitzGerald 
contraction, for the reason given in Footnote 10. 


Relativistic limericks have the attraction 
Of being shrunk by a Lorentz contraction. 
But for readers, unwary, 

The results may be scary, 

When a fraction ... 


REMARKS: 


1. The length-contraction result in Eq. (11.14) is for lengths along the direction of the rel- 


ative velocity. There is no length contraction in the perpendicular direction, as shown in 
Problem 11.1. 


2. As with time dilation, this length contraction is a bit strange, but there doesn’t seem 


to be anything actually paradoxical about it, until we look at things from A’s point of 
view. To make a nice symmetrical situation, let’s say B is standing on an identical train, 
which is motionless with respect to the ground. A sees B flying by at speed v in the other 
direction. Neither train is any more fundamental than the other, so the same reasoning 
applies, and A sees the same length contraction factor that B sees. That is, A measures 
B’s train to be short. But how can this be? Are we claiming that A’s train is shorter 
than B’s, and also that B’s train is shorter than A’s? Is the actual setup the one shown in 
Fig. 11.14, or is it the one shown in Fig. 11.15? Which does it really look like? Well ... it 
depends. 

The word “is” in the above paragraph is a very bad word to use and is generally the 
cause of all the confusion (but it is ok in this paragraph, thankfully). There is no such 
thing as “‘is-ness” when it comes to lengths. It makes no sense to say what the length of 
the train really is. It makes sense only to say what the length is in a given frame. The 
situation doesn’t really look like one thing in particular. The look depends on the frame 
in which the looking is being done. 

Let’s be a little more specific. How do you measure a length? You write down the 
coordinates of the ends of something measured simultaneously, and then you take the 
difference. But the word “simultaneous” here should send up all sorts of red flags. 
Simultaneous events in one frame are not simultaneous events in another. Stated more 
precisely, here is what we are claiming: Let B write down simultaneous coordinates 
of the ends of A’s train, and also simultaneous coordinates of the ends of her own 
train. Then the difference between the former is smaller than the difference between 
the latter. Likewise, let A write down simultaneous coordinates of the ends of B’s train, 
and also simultaneous coordinates of the ends of his own train. Then the difference 
between the former is smaller than the difference between the latter. There is no contra- 
diction here, because the times at which A and B are writing down the coordinates don’t 
have much to do with each other, due to the loss of simultaneity. You can be quanti- 
tative about this in Problem 11.3. As with time dilation, we are comparing apples and 
oranges. 

. There is an easy argument to show that time dilation implies length contraction, and vice 
versa. Let B stand on the ground, next to a stick of length £. Let A fly past the stick at 
speed v. In B’s frame, it takes A a time of £/v to traverse the length of the stick. Therefore 
(assuming that we have demonstrated the time-dilation result), a watch on A’s wrist will 
advance by a time of only £/yv while he traverses the length of the stick. 


Nn 


11.3 The fundamental effects 


How does A view the situation? He sees the ground and the stick fly by at speed v. 
The time between the two ends passing him is €/yv (because that is the time elapsed 
on his watch). To get the length of the stick in his frame, he simply multiplies the 
speed by the time. That is, he measures the length to be (€/yv)v = ¢/y, which is 
the desired contraction. The same argument also shows that length contraction implies 
time dilation. 


. As mentioned earlier, the length contraction factor y is independent of position on the 


object. That is, all parts of the train are contracted by the same amount. This follows 
from the fact that all points in space are equivalent. Equivalently, we could put a large 
number of small replicas of the above source—mirror system along the length of the 
train. They would all produce the same value for y, independent of the position on 
the train. 


. If you still want to ask, “Is the contraction really real?” then consider the following 


hypothetical undertaking. Imagine a sheet of paper moving sideways past the Mona Lisa, 
skimming the surface. A standard sheet of paper is plenty large enough to cover her face, 
so if the paper is moving slowly, and if you take a photograph at the appropriate time, 
then in the photo you'll see her entire face covered by the paper. However, if the sheet 
is flying by sufficiently fast, and if you take a photograph at the appropriate time, then 
in the photo you’ll see a thin vertical strip of paper covering only a small fraction of her 
face. So you'll still see her smiling at you. & 


Example (Passing trains): Two trains, A and B, each have proper length Z and 
move in the same direction. A’s speed is 4c/5, and B’s speed is 3c/5. A starts behind 
B (see Fig. 11.16). How long, as viewed by person C on the ground, does it take for 
A to overtake B? By this we mean the time between the front of A passing the back 
of B, and the back of A passing the front of B. 


Solution: Relative to C on the ground, the y factors associated with A and B are 
5/3 and 5/4, respectively. Therefore, their lengths in the ground frame are 3L/5 and 
4L/5. While overtaking B, A must travel farther than B, by an excess distance equal 
to the sum of the lengths of the trains, which is 7L/5. The relative speed of the two 
trains (as viewed by C on the ground) is the difference of the speeds, which is c/5. 
The total time is therefore 

_IL/5 _ TL 


T ae (11.15) 


Cc 


Example (Muon decay, again): Consider the “Muon decay” example from 
Section 11.3.2. From the muon’s point of view, it lives for a time of T = 2- 1076 
seconds, and the earth is speeding toward it at v = 0.99998c. How, then, does 
the earth (which travels only d = vT œ~ 600m before the muon decays) reach the 
muon? 


Solution: The important point here is that in the muon’s frame, the distance to the 
earth is contracted by a factor y ~ 160. Therefore, the earth starts only 50 km/160 ~ 
300 m away. Since the earth can travel a distance of 600 m during the muon ’s lifetime, 
the earth collides with the muon, with time to spare. 

As stated in the third remark above, time dilation and length contraction are inti- 
mately related. We can’t have one without the other. In the earth’s frame, the muon’s 


Fig. 11.16 
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arrival on the earth is explained by time dilation. In the muon’s frame, it is explained 
by length contraction. 


Observe that for muons created, 

The dilation of time is related 

To Einstein’s insistence 

Of shrunken-down distance 

In the frame where decays aren’t belated. 


An extremely important strategy in solving relativity problems is to plant 
yourself in a frame and stay there. The only thoughts running through your head 
should be what you observe. That is, don’t try to use reasoning along the lines 
of, “Well, the person I’m looking at in this other frame sees such-and-such.” 
This will almost certainly cause an error somewhere along the way, because you 
will inevitably end up writing down an equation that combines quantities that 
are measured in different frames, which is a no-no. Of course, you might want 
to solve another part of the problem by working in another frame, or you might 
want to redo the whole problem in another frame. That’s fine, but once you decide 
which frame you’re going to use, make sure you put yourself there and stay there. 

Another very important strategy is to draw a picture of the setup (in whatever 
frame you’ve chosen) at every moment when something significant happens, as 
we did in Fig. 11.13. Once we drew the pictures there, it was clear what we 
needed to do. But without the pictures, we almost certainly would have gotten 
confused. 

At this point you might want to look at the “Qualitative relativity questions” in 
Appendix F, just to make sure we’re all on the same page. Some of the questions 
deal with material we haven’t covered yet, but most are relevant to what we’ve 
done so far. 

This concludes our treatment of the three fundamental effects. In the next 
section, we'll combine all the information we’ve gained and use it to derive the 
Lorentz transformations. But one last comment before we get to those: 


Lattice of clocks and meter sticks 
In everything we’ve done so far, we’ve taken the route of having observers sitting 
in various frames, making various measurements. But as mentioned earlier, this 
can cause some ambiguity, because you might think that the time when light 
reaches the observer is important, whereas what we are generally concerned with 
is the time when something actually happens. 

A way to avoid this ambiguity is to remove the observers and then imag- 
ine filling up space with a large rigid lattice of meter sticks and synchronized 
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clocks. Different frames are defined by different lattices; assume that the lattices 
of different frames can somehow pass freely through each other. All the meter 
sticks in a given frame are at rest with respect to all the others, so we don’t have 
to worry about issues of length contraction within each frame. But the lattice of 
a frame moving past you is squashed in the direction of motion, because all the 
meter sticks in that direction are contracted. 

To measure the length of an object in a given frame, we just need to determine 
where the ends are (at simultaneous times, as measured in that frame) with respect 
to the lattice. As far as the synchronization of the clocks within each frame 
goes, this can be accomplished by putting a light source midway between any 
two clocks and sending out signals, and then setting the clocks to a certain 
value when the signals hit them. Alternatively, a more straightforward method 
of synchronization is to start with all the clocks synchronized right next to each 
other, and then move them very slowly to their final positions. Any time-dilation 
effects can be made arbitrarily small by moving the clocks sufficiently slowly. 
This is true because the time-dilation y factor is second order in v, but the time 
it takes a clock to reach its final position is only first order in 1/v. 

This lattice way of looking at things emphasizes that observers are not impor- 
tant, and that a frame is defined simply as a lattice of space and time coordinates. 
Anything that happens (an “event”) is automatically assigned a space and time 
coordinate in every frame, independent of any observer. The concept ofan “event” 
will be very important in the next section. 


11.4 The Lorentz transformations 
11.4.1 The derivation 


Consider a coordinate system, S’, moving relative to another system, S (see 
Fig. 11.17). Let the constant relative speed of the frames be v. Let the corre- 
sponding axes of S and S” point in the same direction, and let the origin of S” 
move along the x axis of S, in the positive direction. Nothing exciting happens 
in the y and z directions (see Problem 11.1), so we’ll ignore them. 

Our goal in this section is to look at two events (an event is anything that 
has space and time coordinates) in spacetime and relate the Ax and At of the 
coordinates in one frame to the Ax’ and At?’ of the coordinates in another. We 
therefore want to find the constants 4, B, C, and D in the relations, 


Ax =AAx'+BAr?, 
At=CAt+DAx’. 


(11.16) 


The four constants here will end up depending on v (which is constant, given the 
two inertial frames). But we won’t explicitly write this dependence, for ease of 
notation. 


S 


Fig. 11.17 
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REMARKS: 


1. We have assumed in Eq. (11.16) that Ax and At are linear functions of Ax’ and At’. And 
we have also assumed that A, B, C, and D are constants, that is, they depend at most on 
v, and not on x, t, x’, t’. 

The first of these assumptions is justified by the fact that any finite interval can be built 
up from a series of many infinitesimal ones. But for an infinitesimal interval, any terms 
such as (A?’)*, for example, are negligible compared with the linear terms. Therefore, if 
we add up all the infinitesimal intervals to obtain a finite one, we will be left with only 
the linear terms. Equivalently, it shouldn’t matter whether we make a measurement with, 
say, meter sticks or half-meter sticks. 

The second assumption can be justified in various ways. One is that all inertial frames 
should agree on what “nonaccelerating” motion is. That is, if Ax’ = u’ At’, then we should 
also have Ax = u At, for some constant u. This is true only if the above coefficients 
are constants, as you can check. Another justification comes from the second of our two 
relativity postulates, which says that all points in (empty) space are indistinguishable. With 
this in mind, let’s assume that we have a transformation of the form, say, Ax = A Ax’ + 
B At + Ex’ Ax’. The x’ in the last term implies that the absolute location in spacetime 
(and not just the relative position) is important. Therefore, this last term cannot exist. 

2. Ifthe relations in Eq. (11.16) turned out to be the usual Galilean transformations (which are 
the ones that hold for everyday relative speeds v) then we would have Ax = Ax’ + v At, 
and At = Af’ (that is, A = C = 1, B = v, and D = 0). We will find, however, that under 
the assumptions of Special Relativity, this is not the case. The Galilean transformations 
are not the correct transformations. But we will show below that the correct transforma- 
tions do indeed reduce to the Galilean transformations in the limit of slow speeds, as they 
must. & 


The constants A, B, C, and D in Eq. (11.16) are four unknowns, and we can 


solve for them by using four facts we found above in Section | 1.3. The four facts 
we will use are: 


Effect Condition Result Eq. in text 
1 Time dilation x =0 t=yt' (11.9) 
2 Length contraction t= x =x/y (11.14) 
3 Relative v of frames x=0 x = —ut' 
4 Rear clock ahead t=0 t = —vx'/c* (11.6) 


We have taken the liberty of dropping the A’s in front of the coordinates, lest 
things get too messy. We will often omit the A’s, but it should be understood that 
x really means Ax, etc. We are always concerned with the difference between 
coordinates of two events in spacetime. The actual value of any coordinate is 
irrelevant, because there is no preferred origin in any frame. 

You should pause for a moment and verify that the four “results” in the above 
table are in fact the proper mathematical expressions for the four effects, given 
the stated “conditions.”*° My advice is to keep pausing until you’re comfortable 


26 We can state the effects in other ways too, by switching the primes and unprimes. For example, 
time dilation can be written as “t = yt when x = 0.” But we’ve chosen the above ways of writing 
things because they will allow us to solve for the four unknowns in the most efficient way. 


11.4 The Lorentz transformations 


with all the entries in the table. Note that the sign in the “rear clock ahead” effect 
is indeed correct, because the front clock shows less time than the rear clock. So 
the clock with the higher x’ value is the one with the lower ¢’ value. 

We can now use our four facts in the above table to quickly solve for the 
unknowns A, B, C, and D in Eq. (11.16). 


Fact (1) gives C = y. 

Fact (2) gives A = y. 

Fact (3) gives B/A =v => B = yv. 

Fact (4) gives D/C = u/c? => D = yv/c’. 


Equations (11.16), which are known as the Lorentz transformations, are therefore 
given by 


Ax = y (Ax +v Af’), 
At=y(Ať +v Ax'/®), 


(11.17) 
Ay = Ay’, 


Az = Az’, 


where 

1 
V1 —v*/c? 
We have tacked on the trivial transformations for y and z, but we won’t bother 
writing these in the future. Also, we'll drop the A’s from now on, but remember 
that they’re always really there. 


If we solve for x’ and ¢’ in terms of x and ¢ in Eq. (11.17), then we see that the 
inverse Lorentz transformations are given by 


y (11.18) 


x = y (x z vt), 
(11.19) 
t = y(t—vx/c’). 


Of course, which ones you label as the “inverse” transformations depends on 
your point of view. But it’s intuitively clear that the only difference between the 
two sets of equations is the sign of v, because S is simply moving backward with 
respect to S’. 

The reason why the derivation of Eqs. (11.17) was so quick is that we 
already did most of the work in Section 11.3 when we derived the fundamental 
effects. If we wanted to derive the Lorentz transformations from scratch, that is, 
by starting with the two postulates in Section 11.2, then the derivation would be 
longer. In Appendix I we give such a derivation, where it is clear what infor- 
mation comes from each of the postulates. The procedure there is somewhat 
cumbersome, but it’s worth taking a look at, because we will invoke the result in 
a very cool way in Section 11.10. 
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REMARKS: 


1. In the limit v « c (or more precisely, in the limit vx’ /c? « t', which means that even if v 


is small, we have to be careful that x’ isn’t too large), Eqs. (11.17) reduce tox = x’+ vt and 
t = t', which are the good old Galilean transformations. This must be the case, because we 
know from everyday experience (where v < c) that the Galilean transformations work 
just fine. 


. Equations (11.17) exhibit a nice symmetry between x and ct. With 8 = v/c, we have 


x= y(x' + B(ct')), 


(11.20) 
ct = y((ct’) + px’). 


Equivalently, in units where c = 1 (for example, where one unit of distance equals 3 - 108 
meters, or where one unit of time equals 1/(3 - 108) seconds), Eqs. (11.17) take the 
symmetric form, 


x= yr’ +01), 


(11.21) 
t= y(t +x’). 


. In matrix form, Eq. (11.20) can be written as 


x \_( vy yvB x! 
ORo 12 
This looks similar to a rotation matrix. More about this in Section 11.9, and in 
Problem 11.27. 


. We did the above derivation in terms of a primed and an unprimed system. But when 


you’re doing problems, it’s usually best to label your coordinates with subscripts such 
as A for Alice, or T for train. In addition to being more informative, this notation is less 
likely to make you think that one frame is more fundamental than the other. 


. It’s easy to get confused about the sign on the right-hand side of the Lorentz transforma- 


tions. To figure out if it should be a plus or a minus, write down x4 = y (xg + vtg), and 
then imagine sitting in system A and looking at a fixed point in B. This fixed point satisfies 
(putting the A’s back in to avoid any mixup) Axg = 0, which gives Axg = +yvAtpg. 
So if the point moves to the right (that is, if it increases as time increases), then pick the 
“+.” And if it moves to the left, then pick the “—.” In other words, the sign is determined 
by which way A (the person associated with the coordinates on the left-hand side of the 
equation) sees B (ditto for the right-hand side) moving. 


. One very important thing we must check is that two successive Lorentz transformations 


(from Sı to S2 and then from S2 to $3) again yield a Lorentz transformation (from S1 
to $3). This must be true because we showed that any two frames must be related by 
Eqs. (11.17). If we composed two L.T.’s (along the same direction) and found that the 
transformation from Sı to $3 was not of the form of Eqs. (11.17), for some new v, then the 
whole theory would be inconsistent, and we would have to drop one of our postulates.’ 
You can show that the combination of an L.T. (with speed vı) and an L.T. (with speed v2) 
does indeed yield an L.T., and it has speed (vı + v2)/(1 + vj v2/C?). This is the task of 
Exercise 11.47, and also Problem 11.27 (which is stated in terms of rapidity, introduced 


27 This statement is true only for the composition of two L.T.’s in the same direction. If we composed 
an L.T. in the x direction with one in the y direction, the result would interestingly not be an L.T. 
along some new direction, but rather the composition of an L.T. along some direction and a 
rotation through some angle. This rotation results in what is known as the Thomas precession. 
See the appendix of Muller (1992) for a quick derivation of the Thomas precession. For further 
discussion, see Costella et al. (2001) and Rebilas (2002). 
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in Section 11.9). This resulting speed is one that we'll see again when we get to the 
velocity-addition formula in Section 11.5.1. & 


Example: A train with proper length L moves at speed 5c/13 with respect to the 
ground. A ball is thrown from the back of the train to the front. The speed of the ball 
with respect to the train is c/3. As viewed by someone on the ground, how much time 
does the ball spend in the air, and how far does it travel? 


Solution: The y factor associated with the speed 5c/13 is y = 13/12. The two 
events we are concerned with are “ball leaving back of train” and “ball arriving at front 
of train.” The spacetime separation between these events is easy to calculate on the 
train. We have Axr = L, and Atr = L/(c/3) = 3L/c. The Lorentz transformations 
giving the coordinates on the ground are therefore 


~ yop tony = 2 E (22) 
eee ee se Bikes) 30 


13 /3L  (5c/13)L 11L 
tg = y (tr + vxr/c?) = ( / )- 3c 


(11.23) 


12 \ c c2 


In a given problem, such as the above example, one of the frames usually 
allows for a quick calculation of Ax and Aż, so you simply have to mechanically 
plug these quantities into the L.T.’s to obtain Ax’ and Af’ in the other frame, 
where they may not be as obvious. 

Relativity is a subject in which there are usually many ways to doa problem. If 
you’re trying to find some Ax’s and At?’s, then you can use the L.T.’s, or perhaps 
the invariant interval (introduced in Section 11.6), or maybe a velocity-addition 
approach (introduced in Section 11.5.1), or even the sending-of-light-signals 
strategy used in Section 11.3. Depending on the specific problem and what your 
personal preferences are, certain approaches will be more enjoyable than others. 
But no matter which method you choose, you should take advantage of the 
plethora of possibilities by picking a second method to double-check your answer. 
Personally, I find the L.T.’s to be the perfect option for this, because the other 
methods are generally more fun when solving a problem for the first time, while 
the L.T.’s are usually quick and easy to apply (perfect for a double-check).** 


The excitement will build in your voice, 
As you rise from your seat and rejoice, 
“A Lorentz transformation 

Provides information, 

As an alternate method of choice!” 


28 I would, however, be very wary of solving a problem using only the L.T.’s, with no other check, 
because it’s very easy to mess up a sign in the transformations. And since there’s nothing to do 
except mechanically plug in numbers, there’s not much opportunity for an intuitive check, either. 
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11.4.2 The fundamental effects 


Let’s now see how the Lorentz transformations imply the three fundamental 
effects (namely, loss of simultaneity, time dilation, and length contraction) dis- 
cussed in Section 11.3. Of course, we just used these effects to derive the L.T.’s, 
so we know everything will work out. We’ll just be going in circles. But since 
these fundamental effects are, well, fundamental, let’s belabor the point and 
discuss them one more time, with the starting point being the L.T.’s. 


Loss of simultaneity 

Let two events occur simultaneously in frame S’. Then the separation between 
them, as measured by S”, is (x’, t’) = (x’,0). As usual, we are not bothering to 
write the A’s in front of the coordinates. Using the second of Eqs. (11.17), we see 
that the time between the events, as measured by S, is t = yux’/c*. This is not 
equal to zero (unless x’ = 0). Therefore, the events do not occur simultaneously 
in frame S. 


Time dilation 

Consider two events that occur in the same place in S’. Then the separation 
between them is (x’, t) = (0, t’). Using the second of Eqs. (11.17), we see that 
the time between the events, as measured by S, is 


t=yt (ifx’ =0). (11.24) 


The factor y is greater than or equal to 1, so ¢ > t’. The passing of one second on 
S’’s clock takes more than one second on S’s clock. S sees S’ drinking his coffee 
very slowly. 

The same strategy works if we interchange S and S’. Consider two events 
that occur in the same place in S. The separation between them is (x, t) = (0,1). 
Using the second of Eqs. (11.19), we see that the time between the events, as 
measured by S’, is 


t =yt (ifx= 0). (11.25) 
Therefore, t/ > t. Another way to derive this is to use the first of Eqs. (11.17) to 
write x’ = —v¢', and then substitute this into the second equation. 


REMARK: — If we write down the two above equations by themselves, t = yt’ and t’ = yt, they 
appear to contradict each other. This apparent contradiction arises from the omission of the 
conditions they are based on. The former equation is based on the assumption that x’ = 0. The 
latter equation is based on the assumption that x = 0. They have nothing to do with each other. 
It would perhaps be better to write the equations as 


(t= yt)voo, and (f = yt)x=0, (11.26) 


but this is somewhat cumbersome. & 


11.5 Velocity addition 


Length contraction 
This proceeds just like the time dilation above, except that now we want to set 
certain time intervals equal to zero, instead of certain space intervals. We want 
to do this because to measure a length, we calculate the distance between two 
points whose positions are measured simultaneously. That’s what a length is. 
Consider a stick at rest in S’, where it has length 2’. We want to find the length £ 
in S. Simultaneous measurements of the coordinates of the ends of the stick in S 
yield a separation of (x, t) = (x, 0). Using the first of Eqs. (11.19), we have 


x =yx (ift=0). (11.27) 


But x is by definition the length in S. And x’ is the length in S’, because the stick 
isn’t moving in S’.?” Therefore, £ = €'/y. And since y > 1, we have £ < V, 
so S' sees the stick shorter than S” sees it. 

Now interchange S and S”. Consider a stick at rest in S, where it has length £. 
We want to find the length in S’. Measurements of the coordinates of the ends of 
the stick in S’ yield a separation of (x’, t’) = (x’, 0). Using the first of Eqs. (11.17), 
we have 


x=yx (if =0). (11.28) 


But x’ is by definition the length in S”. And x is the length in S, because the stick 
is not moving in S. Therefore, V = £/y, so V < £. 


Remark: As with time dilation, if we write down the two equations by themselves, £ = ¢'/y 
and ¢’ = ¢/y, they appear to contradict each other. But as before, this apparent contradiction 
arises from the omission of the conditions they are based on. The former equation is based on 
the assumptions that t = 0 and that the stick is at rest in S’. The latter equation is based on the 
assumptions that ¢’ = 0 and that the stick is at rest in S. They have nothing to do with each 
other. We should really write, 


(x=x'/y)=0, and (x =x/y)y=0, (11.29) 


and then identify x’ in the first equation with £’ only after invoking the further assumption that 
the stick is at rest in S’. Likewise for the second equation. But this isa pain. & 


11.5 Velocity addition 
11.5.1 Longitudinal velocity addition 


Consider the following setup. An object moves at speed vı with respect to 
frame S’. And frame S’ moves at speed v2 with respect to frame S, in the same 
direction as the motion of the object (see Fig. 11.18). What is the speed, u, of the 
object with respect to frame S? 


2° The measurements of the ends made by S are not simultaneous in the S’ frame. In the S” frame, 
the separation between the events is (x’, t’), where both x’ and ¢’ are nonzero. This doesn’t satisfy 
our definition of a length measurement in S” (because t’ Æ 0), but the stick isn’t moving in S’, so 
S’ can measure the ends whenever he feels like it, and he will always get the same difference. So 
x’ is indeed the length in the S’ frame. 


Fig. 11.18 
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The Lorentz transformations can be used to easily answer this question. The 
relative speed of the frames is v2. Consider two events along the object’s path (for 
example, say it makes two beeps). We are given that Ax’/A?’ = v1. Our goal is 
to find u = Ax/At. The Lorentz transformations from S’ to S, Eqs. (11.17), are 


Ax = y2(Ax’ + vA), and At = y2(At’ + v2Ax'/c?), (11.30) 


where y2 = 1/,/1 — v3/c?. Therefore, 


= he Ax’ + v At 
= At At’ + v2 Ax! /c? 
Ax'/At’ + v2 
1 + u2(Ax’/At')/c? 
V1 + U2 


= ——_—\_ <—<_—, 11.31 
1+ v,v2/c? ( ) 


This is the velocity-addition formula, for adding velocities along the same line. 
Let’s look at some of its properties. 


e It is symmetric with respect to vı and v2, as it should be, because we could switch the 
roles of the object and frame S. 

e For vjv2 & c?, it reduces to u ~ vy + v2, which we know holds perfectly well for 
everyday speeds. 

e Ifv; = c or v = c, then we find u = c, as should be the case, because anything that 
moves with speed c in one frame moves with speed c in another. 

e The maximum (or minimum) of u in the region —c < vj,v2 < c equals c (or —c), 
which can be seen by noting that du/dv, and du/dv2 are never zero in the interior of 
the region. 


If you take any two velocities that are less than c and add them according to 
Eq. (11.31), then you will obtain a velocity that is again less than c. This shows 
that no matter how much you keep accelerating an object (that is, no matter how 
many times you give the object a speed vı with respect to the frame moving at 
speed v2 that it was just in), you can’t bring the speed up to the speed of light. 
We’ ll give another argument for this result in Chapter 12 when we discuss energy. 


For a bullet, a train, and a gun, 
Adding the speeds can be fun. 
Take a trip down the path 

Paved with Einstein’s new math, 
Where a half plus a half isn’t one. 


REMARK: Consider the two scenarios shown in Fig. 11.19. If the goal is to find the velocity of 
A with respect to C, then the velocity-addition formula applies to both scenarios, because the 
second scenario is the same as the first one, as observed in B’s frame. 


11.5 Velocity addition 


The velocity-addition formula applies when we ask, “If A moves at v with respect to B, 
and B moves at v2 with respect to C (which means, of course, that C moves at speed v2 with 
respect to B), then how fast does A move with respect to C?” The formula does not apply if we 
ask the more mundane question, “What is the relative speed of A and C, as viewed by B?” The 
answer to this is just vı + v2. 

In short, if the two velocities are given with respect to the same observer, say B, and if 
you are asking for the relative velocity as measured by B, then you simply have to add the 
velocities.°° But if you are asking for the relative velocity as measured by A or C, then you 
have to use the velocity-addition formula. It makes no sense to add velocities that are measured 
with respect to different observers. Doing so would involve adding things that are measured 
in different coordinate systems, which is meaningless. In other words, taking the velocity of 
A with respect to B and adding it to the velocity of B with respect to C, hoping to obtain the 
velocity of A with respect to C, is invalid. & 


Example (Passing trains, again): Consider again the scenario in the “Passing 


trains” example in Section 11.3.3. 


(a) How long, as viewed by A and as viewed by B, does it take for A to overtake B? 

(b) Let event E4 be “the front of A passing the back of B”, and let event E> be “the 
back of A passing the front of B.” Person D walks at constant speed from the 
back of B to the front (see Fig. 11.20), such that he coincides with both events 
E; and E2. How long does the “overtaking” process take, as viewed by D? 


Solution: 


(a) First consider B’s point of view. From the velocity-addition formula, B sees A 
move with speed 


4c 3c 
z-57 _ 5e 


>> =. (11.32) 
4 3 

1-3-3 13 
The y factor associated with this speed is y = 13/12. Therefore, B sees 
A’s train contracted to a length 122/13. During the overtaking, A must travel a 
distance equal to the sum of the lengths of the trains in B’s frame (see Fig. 11.21), 
which is L + 12L/13 = 25L/13. Since A moves at speed 5c/13, the total time 
in B’s frame is 

25L/13 5L 


tg = ars, 11.33 
a eB e (iia) 


The exact same reasoning holds from A’s point of view, so we have 
u= i= 5L/c. 


30 Note that the resulting speed can certainly be greater than c. If I see a ball heading toward me at 
0.9c from the right, and another one heading toward me at 0.9c from the left, then the relative 
speed of the balls in my frame is 1.8c. In the frame of one of the balls, however, the relative speed 
is (1.8/1.81)c © (0.9945)c, from Eq. (11.31). 


Fig. 11.20 


B's frame 


Fig. 11.21 
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(b) Look at things from D’s point of view. D is at rest, and the two trains move 
with equal and opposite speeds v (see Fig. 11.22), because otherwise the second 
event E> wouldn’t be located at D. The relativistic addition of v with itself is 
the speed of A as viewed by B. But from part (a), we know that this relative 
speed equals 5c/13. Therefore, 


2v 5c 


E 
= =E, 11.34 
14/2 B Tag oo 


where we have ignored the unphysical solution, v = 5c. The y factor associated 
with v = c/5 is y = 5/(2V6). So D sees both trains contracted to a length 
2./6L/5. During the overtaking, each train must travel a distance equal to its 
length, because both events, E4 and £4, take place right at D. The total time 
in D’s frame is therefore 

_ 2V6L/5 _ 2V6L 


tp = = F 11.35 
D c/5 F ( ) 


Remarks: There are a few double-checks we can perform. The speed of D with respect 
to the ground can be obtained either via B’s frame by relativistically adding 3c/5 and c/5, 
or via A’s frame by subtracting c/5 from 4c/5. These both give the same answer, namely 
5c/7, as they must. (The c/5 speed can in fact be determined by this reasoning, instead of 
using Eq. (11.34).) The y factor between the ground and D is therefore 7 /2./6. We can 
then use time dilation to say that someone on the ground sees the overtaking take a time 
of (7/2./6)tp (we can say this because both events happen right at D). Using Eq. (11.35), 
this gives a ground-frame time of 7L/c, in agreement with Eq. (11.15). Likewise, the y 
factor between D and either train is 5/2./6. So the time of the overtaking as viewed by 
either A or B is (5/2/6)tp = 5L/c, in agreement with Eq. (11.33). 

Note that we cannot use simple time dilation to relate the ground to A or B, because 
the two events don’t happen at the same place in the train frames. But since both events 
happen at the same place in D’s frame, namely right at D, it’s legal to use time dilation to 
go from D’s frame to any other frame. & 


11.5.2 Transverse velocity addition 


Consider the following general two-dimensional situation. An object moves with 
velocity (w, uy) with respect to frame S’. And frame S’ moves with speed v 
with respect to frame S, in the x direction (see Fig. 11.23). What is the velocity, 
(ux, Uy), Of the object with respect to frame S? 

The existence of motion in the y direction doesn’t affect the preceding deriva- 
tion of the speed in the x direction, so Eq. (11.31) is still valid. In the present 


notation, it becomes 


/ 
u, HU 


Ta ae (11.36) 


ux 


11.6 The invariant interval 


To find u, we can again make easy use of the Lorentz transformations. 
Consider two events along the object’s path. We are given that Ax’/Ad’ = uk, 
and Ay/At! = Uy. Our goal is to find wu, = Ay/At. The relevant Lorentz 
transformations from S’ to S in Eq. (11.17) are 


Ay=Ay’, and At=y(At' +vAx'/c’). (11.37) 
Therefore, 
Ay Ay’ 
Uy = = 7 
At y (At + vAx'/c*) 
a Ay’ /At’ 
y+ v(Ax'/At’)/c?) 
ul 
a e (11.38) 


y (l +ulu/c?) ` 


REMARK: In the special case where ui = 0, we have u, = ul, /y. When u, is small and v is 
large, this result can be seen to be a special case of time dilation, in the following way. Consider 
a series of equally spaced lines parallel to the x axis (see Fig. 11.24). Imagine that the object’s 
clock ticks once every time it crosses a line. Since u/, is small, the object’s frame is essentially 
frame S’. So if S flies by to the left, then the object is essentially moving at speed v with respect 
to S. Therefore, S sees the clock run slow by a factor y. This means that S sees the object cross 
the lines at a slower rate, by a factor y (because the clock still ticks once every time it crosses 
a line; this is a frame-independent statement). Since distances in the y direction are the same 
in the two frames, we conclude that u, = u’, /y. This y factor will be very important when we 
deal with momentum in Chapter 12. 

To sum up: if you run in the x direction past an object, then its y speed is slower in your 
frame (or faster, depending on the relative sign of u/. and v). Strange indeed, but no stranger 
than other effects we’ve seen. Problem 11.16 deals with the special case where ui = 0, but 
where u’, is not necessarily small. &æ 


11.6 The invariant interval 


Consider the quantity, 


(As)? = c?(At)? — (Ax). (11.39) 


Technically, we should also subtract off (Ay)? and (Az)*, but nothing exciting 
happens in the transverse directions, so we’ll ignore them. Using Eq. (11.17), 
we can write (As)? in terms of the S’ coordinates, Ax’ and At’. The result is 
(dropping the A’s) 


ap a = c(t + vx! ey (x! ait ot’)? 


1 — v2/c? 1 — v2 /c? 
T t? (e _ v?) —x?(1 _ vu? /c*) 
1—v2/c? 


= ct? —x?, (11.40) 
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We see that the Lorentz transformations imply that the quantity c7/? — x? doesn’t 
depend on the frame. This result is more than we bargained for, for the following 
reason. The speed-of-light postulate says that if c?#’? — x? = 0, then °t — 
x? =0. But Eq. (11.40) says that if c*7’2 — x? = b, then c? — x? = b, for 
any value of b, not just zero. This is, as you might guess, very useful. There are 
enough things that change when we go from one frame to another, so it’s nice 
to have a frame-independent quantity that we can hang on to. The fact that s? is 
invariant under Lorentz transformations of x and ¢ is exactly analogous to the fact 
that 7? is invariant under rotations in the x-y plane. The coordinates themselves 
change under the transformation, but the special combination of c*#* — x? for 
Lorentz transformations, or x? + y? for rotations, remains the same. All inertial 
observers agree on the value of s*, independent of what they measure for the 
actual coordinates. 


“Potato?! Potahto!” said she, 

“And of course it’s tomahto, you see. 
But the square of ct 

Minus x? will be 

Always something on which we agree.” 


Anote on terminology: The separation in the coordinates, (cAt, Ax), is usually 
referred to as the spacetime interval, while the quantity (As)? = c? (At)? — (Ax)? 
is referred to as the invariant interval (or technically the square of the invariant 
interval). At any rate, just call it s*, and people will know what you mean. The 
invariance of s* is actually just a special case of more general results involving 
inner products and 4-vectors, which we’ll discuss in Chapter 13. Let’s now 
look at the physical significance of s? = c?t? — x?; there are three cases to 
consider. 


Case 1: s? > 0 (timelike separation) 

In this case, we say that the two events are timelike separated. We have c7t? > x’, 
and so |x/t| < c. Consider a frame S” moving at speed v with respect to S. The 
Lorentz transformation for x is 


x’ = y(x — vt). (11.41) 


Since |x/t| < c, there exists a v that is less than c (namely v = x/t) that makes 
x’ = 0. In other words, if two events are timelike separated, it is possible to 
find a frame S’ in which the two events happen at the same place. (In short, the 
condition |x/t| < c means that it is possible for a particle to travel from one event 
to the other.) The invariance of s? then gives s? = ct’? — x’? = c*t’”. So we see 
that s/c is the time between the events in the frame in which the events occur at 
the same place. This time is called the proper time. 


11.6 The invariant interval 


Case 2: s? <0 (spacelike separation) 

In this case, we say that the two events are spacelike separated.°! We have 
ct? < x’, and so |t/x| < 1/c. Consider a frame S’ moving at speed v with 
respect to S. The Lorentz transformation for t’ is 


t = y(t —vx/c’). (11.42) 


Since |t/x| < 1/c, there exists a v that is less than c (namely v = c’t/x) that 
makes ¢’ = 0. In other words, if two events are spacelike separated, it is possible 
to find a frame S” in which the two events happen at the same time. (This statement 
is not as easy to see as the corresponding one in the timelike case above. But if 
you draw a Minkowski diagram, described in the next section, it becomes clear.) 
The invariance of s? then gives s? = c?t’? — x? = —x’?. So we see that |s| is the 
distance between the events in the frame in which the events occur at the same 
time. This distance is called the proper distance, or proper length. 


Case 3:s* =0 (lightlike separation) 

In this case, we say that the two events are lightlike separated. We have c7t? = x’, 
and so |x/t| = c. This holds in every frame, so in every frame a photon emitted 
at one of the events will arrive at the other. It is not possible to find a frame S’ 
in which the two events happen at the same place or the same time, because the 
frame would have to travel at the speed of light. 


Example (Time dilation): An illustration of the usefulness of the invariance of s? 
is a derivation of time dilation. Let frame S’ move at speed v with respect to frame S. 
Consider two events at the origin of S”, separated by time ¢’. The separation between 
the events is 


in S: (x,t) = (0,1), 


(11.43) 
in S: (x,t) = (vt, t). 
The invariance of s? implies ct? — 0 = t — v*t*. Therefore, 
t 
t = ——— (11.44) 


This method makes it clear that the time-dilation result rests on the assumption that 
/ 
x =0. 


Example (Passing trains, yet again): Consider again the scenario in the 
“Passing trains” examples in Sections 11.3.3 and 11.5.1. Verify that the s* between 
the events £ and £ is the same in all of the frames, A, B, C, and D (see Fig. 11.25). 


31 Tes fine that s? is negative in this case, which means that s is imaginary. We can take the absolute 
value of s if we want to obtain a real number. 


Fig. 11.25 
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Solution: The only quantity that we’! need that we haven’t already found in the two 
examples above is the distance between £1 and E> in C’s frame (the ground frame). 
In this frame, train A travels at a rate 4c/5 for a time tc = 7L/c, covering a distance 
of 28Z/5. But event E2 occurs at the back of the train, which is a distance 3L/5 
behind the front end (this is the contracted length in the ground frame). Therefore, 
the distance between events £; and E> in the ground frame is 28Z/5 — 3L/5 = SL. 
You can also apply the same line of reasoning using train B, in which the 5L result 
takes the form, (3c/5)(7L/c) + 4L/5. 

Putting the previous results together, we have the following separations between 
the events in the various frames: 


A B C D 


At 5L/c 5L/e TL/e 2V6L/c 
Ax -L L SL 0 


From the table, we see that As? = c2 At? — Ax? = 241? for all four frames, as 
desired. We could have worked backwards, of course, and used the s? = 242? result 
from frames A, B, or D, to deduce that Ax = 5Z in frame C. In Problem 11.10, you 
are asked to perform the tedious task of checking that the values in the above table 
satisfy the Lorentz transformations between the six different pairs of frames. 


11.7 Minkowski diagrams 


Minkowski diagrams (sometimes called “spacetime” diagrams) are extremely 
useful in seeing how coordinates transform between different reference frames. 
If you want to produce exact numbers in a problem, you’ll probably have to use 
one of the strategies we’ve encountered so far. But as far as getting an overall 
intuitive picture of a setup goes (if there is in fact any such thing as intuition in 
relativity), there is no better tool than a Minkowski diagram. Here’s how you 
make one. 

Let frame S’ move at speed v with respect to frame S (along the x axis, as 
usual, and ignore the y and z components). Draw the x and ct axes of frame S.°? 
What do the x’ and ct’ axes of S’ look like, superimposed on this diagram? That 
is, at what angles are the axes inclined, and what is the size of one unit on these 
axes? (There is no reason why one unit on the x’ and ct’ axes should have the 
same length on the paper as one unit on the x and ct axes.) We can answer these 
questions by using the Lorentz transformations, Eqs. (11.17). We’ll first look at 
the ct’ axis, and then the x’ axis. 


32 We choose to plot ct instead of t on the vertical axis, so that the trajectory of a light beam lies at 
a nice 45° angle. Alternatively, we could choose units where c = 1. 


11.7 Minkowski diagrams 


ct' -axis angle and unit size 

Look at the point (x’, ct’) = (0, 1), which lies on the cf’ axis, one ct’ unit from 
the origin (see Fig. 11.26). Equations (11.17) tell us that this point is the point 
(x, ct) = (yv/c, y). The angle between the ct’ and ct axes is therefore given by 
tan 01 = x/ct = v/c. With B = v/c, we have 


tan 6, = £. (11.45) 


Alternatively, the ct’ axis is simply the “worldline” of the origin of S’. (A worldline 
is the path an object takes as it travels through spacetime.) The origin moves at 
speed v with respect to S. Therefore, points on the ct’ axis satisfy x/t = v, or 
x/ct = v/e. 

On the paper, the point (x’, ct’) = (0, 1), which we just found to be the point 
(x, ct) = (yv/c, y), isa distance y y 1 + v?/c? from the origin. Therefore, using 
the definitions of 6 and y, we see that 


t' unit 1+ 8? 
one c om _ +£ , (11.46) 
one ct unit 1 — p? 


as measured on a grid where the x and ct axes are orthogonal. This ratio 
approaches infinity as 6 —> 1. And it of course equals 1 if 8 = 0. 


x'-axis angle and unit size 

The same basic argument holds here. Look at the point (x’, ct’) = (1,0), which 
lies on the x’ axis, one x’ unit from the origin (see Fig. 11.26). Equations (11.17) 
tell us that this point is the point (x, ct) = (y, yv/c). The angle between the x’ 
and x axes is therefore given by tan 62 = ct/x = v/c. So, as in the ct’-axis case, 


tan 6) = £. (11.47) 


On the paper, the point (x’, ct’) = (1,0), which we just found to be the point 
(x,ct) = (y,yv/c), is a distance y./1 + v2/c* from the origin. So, as in the 


ct'-axis case, 
one x’ unit  /1+ £? (11.48) 
onexunit \ 1— 2’ f 


as measured on a grid where the x and ct axes are orthogonal. Both the x’ and 
ct’ axes are therefore stretched by the same factor, and tilted in by the same 
angle, relative to the x and ct axes. This “squeezing in” of the axes in a Lorentz 
transformation is different from what happens in a rotation, where both axes 
rotate in the same direction. 


Remarks: If u/c = B = 0, then 6; = 6) = 0, so the ct’ and x’ axes coincide with the ct and 
x axes, as they should. If £ is very close to 1, then the x’ and ct’ axes are both very close to 
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the 45° light-ray line. Note that since 6; = 62, the light-ray line bisects the x’ and cr’ axes. 
Therefore (as we verified above), the scales on these axes must be the same, because a light 
ray must satisfy x’ = ct’. & 

We now know what the x’ and ct’ axes look like. Given any two points in 
a Minkowski diagram (that is, given any two events in spacetime), we can just 
read off the Ax, Act, Ax’, and Act’ quantities that our two observers measure, 
assuming that our graph is accurate enough. Although these quantities must of 
course be related by the Lorentz transformations, the advantage of a Minkowski 
diagram is that you can actually see geometrically what’s going on. 

There are very useful physical interpretations of the cr’ and x’ axes. If you 
stand at the origin of S’, then the ct’ axis is the “here” axis, and the x’ axis is the 
“now” axis (the line of simultaneity). That is, all events on the ct’ axis take place 
at your position (the ct’ axis is your worldline, after all), and all events on the x’ 
axis take place simultaneously (they all have £? = 0). 


Example (Length contraction): For both parts of this problem, use a Minkowski 


diagram where the axes in frame S are orthogonal. 


(a) The relative speed of S’ and S is v (along the x direction). A meter stick lies 
along the x’ axis and is at rest in S’. If S measures its length, what is the result? 

(b) Now let the meter stick lie along the x axis and be at rest in S. If S’ measures 
its length, what is the result? 


Solution: 


(a) Without loss of generality, pick the left end of the stick to be at the origin in S”. 
Then the worldlines of the two ends are shown in Fig. 11.27. The distance AC 
is 1 meter in the S’ frame, because A and C are the endpoints of the stick at 
simultaneous times in the S’ frame; this is how a length is measured. And since 
one unit on the x’ axis has length //1 + B2/,/1 — B2, this is the length on the 
paper of the segment AC. 

How does S measure the length of the stick? He writes down the x coordinates 
of the ends at simultaneous times (as measured by him, of course), and takes 
the difference. Let the time he makes the measurements be £ = 0. Then he 
measures the ends to be at the points A and B.?? Now it’s time to do some 
geometry. We have to find the length of segment AB in Fig. 11.27, given that 


segment AC has length y 1 + B2/,/1 — 62. We know that the primed axes are 
tilted at an angle 0, where tan 0 = £. Therefore, CD = (AC) sin 0. And since 


33 If S measures the ends in a dramatic fashion by, say, blowing them up, then S” will see the right 
end blow up first (the event at B has a negative t’ coordinate, because it lies below the x’ axis), and 
then a little while later S’ will see the left end blow up (the event at A has ¢’ = 0). So S measures 
the ends at different times in the S” frame. This is part of the reason why S’ should not be at all 
surprised that S’s measurement is smaller than one meter. 


11.8 The Doppler effect 


ZBCD = 98, we have BD = (CD) tan@ = (AC) sin 9 tan 0. Therefore (using 
tan = £), 


AB = AD — BD = (AC) cos 0 — (AC) sin 8 tan 0 
= (AC) cos (1 — tan?0) 


_ fits? 1 (1 — 2) 
V 1-6 14 82 
=/1-p2. (11.49) 


Therefore, S measures the meter stick to have length y 1 — 82, which is the 
standard length-contraction result. 


(b) The stick is now at rest in S, and we want to find the length that S’ measures. 
Pick the left end of the stick to be at the origin in S. Then the worldlines of the 
two ends are shown in Fig. 11.28. The distance AB is 1 meter in the S frame. 

In measuring the length of the stick, S’ writes down the x’ coordinates of the 
ends at simultaneous times (as measured by him), and takes the difference. Let 
the time he makes the measurements be ¢’ = 0. Then he measures the ends to 
be at the points A and E. Now we do the geometry, which is easy in this case. 
The length of AE is simply 1/cos@ = y 1 + 82. But since one unit along the 
x’ axis has length y 1 + 62/,/1 — B? on the paper, we see that AE is y 1 — 82 
of one unit in the S’ frame. Therefore, S’ measures the meter stick to have 
length y 1 — £2, which again is the standard length-contraction result. 


The analysis used in the above example also works for time intervals. 
The derivation of time dilation, using a Minkowski diagram, is the task of 
Exercise 11.62. And the derivation of the Lv/c” rear-clock-ahead result is the 
task of Exercise 11.63. 


11.8 The Doppler effect 
11.8.1 Longitudinal Doppler effect 


Consider a source that emits flashes at frequency f” (in its own frame) while 
moving directly toward you at speed v, as shown in Fig. 11.29. With what fre- 
quency do the flashes hit your eye? In these Doppler-effect problems, you must 
be careful to distinguish between the time at which an event occurs in your frame, 
and the time at which you see the event occur. This is one of the few situations 
where we are concerned with the latter. 

There are two effects contributing to the longitudinal Doppler effect. The first 
is relativistic time dilation. There is more time between the flashes in your frame, 
which means that they occur at a smaller frequency. The second is the everyday 
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Doppler effect (as with sound), arising from the motion of the source. Successive 
flashes have a smaller (or larger, if v is negative) distance to travel to reach your 
eye. This effect increases (or decreases, if v is negative) the frequency at which 
the flashes hit your eye. 

Let’s now be quantitative and find the observed frequency. The time between 
emissions in the source’s frame is At! = 1/f’. The time between emissions in 
your frame is then At = y Av’, by the usual time dilation. So the photons of one 
flash have traveled a distance (in your frame) of cAt = cy A?’ by the time the 
next flash occurs. During this time between emissions, the source has traveled a 
distance vAt = vy Ať toward you in your frame. Therefore, at the instant the 
next flash occurs, the photons of this next flash are a distance (in your frame) of 
cAt — vAt = (c — v)y At’ behind the photons of the previous flash. This result 
holds for all adjacent flashes. The time, AT, between the arrivals of the flashes 
at your eye is 1/c times this distance, so we have 


o1 B i l- r_ jl= Pps 
SETE eae A 


where 6 = v/c. Therefore, the frequency you see is 


o 1 _ [A+ By 
L= Tg (11.51) 


If B > 0 (that is, the source is moving toward you), then f >f’. The everyday 
Doppler effect wins out over the time-dilation effect. In this case we say that 
the light is “blueshifted,” because blue light is at the high-frequency end of the 
visible spectrum. The light need not have anything to do with the color blue, of 
course; by “blue” we just mean that the frequency is increased. If 6 < 0 (that 
is, the source is moving away from you), then f < f’. Both effects serve to 
decrease the frequency. In this case we say that the light is “redshifted,” because 
red light is at the low-frequency end of the visible spectrum. 


REMARK: We can also derive Eq. (11.51) by working in the frame of the source. In this frame, 
the distance between successive flashes is cA’. And since you are moving toward the source at 
speed v, the relative speed of you and a given flash is c + v. So the time between your running 
into successive flashes is cA¢’ /(c +v) = At’ /(1 + B), as measured in the frame of the source. 
But your clock runs slow in this frame, so a time of only AT = (1/y)At’/(1 + £) elapses on 
your watch, which you can show agrees with the time in Eq. (11.50). 

We certainly needed to obtain the same result by working in the frame of the source, because 
the principle of relativity states that the result can’t depend on which object we consider to be 
the one at rest; there is no preferred frame. This is different from the situation with the standard 
nonrelativistic Doppler effect (relevant to a siren moving toward you), because the frequency 
there does depend on whether you or the source is the one that is moving. The reason for this is 
that when we say “moving” here, we mean moving with respect to the rest frame of the air, which 
is the medium that sound travels in. We therefore do in fact have a preferred frame of reference, 
unlike in relativity (where there is no “ether”). Using the arguments given above for the two 
different frames, but without the y factors, you can show that the two nonrelativistic Doppler 
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results are f = f’/(1 — B) if the source is moving toward a stationary you, and f = (1 + yf” 
if you are moving toward the stationary source. Here £ is the ratio of the speed of the moving 
object to the speed of sound. In view of these two different results, the relativistic Doppler 
effect can be considered to be a simpler effect, in the sense that there is only one frequency to 
remember. & 


11.8.2 Transverse Doppler effect 


Let’s now consider a two-dimensional situation. Consider a source that emits 
flashes at frequency f” (in its own frame), while moving across your field of 
vision at speed v. There are two reasonable questions we can ask about the 
frequency you observe: 


e Case 1: At the instant the source is at its closest approach to you, with what frequency 
do the flashes hit your eye? 

e Case 2: When you see the source at its closest approach to you, with what frequency 
do the flashes hit your eye? 


The difference between these two scenarios is shown in Fig. 11.30 and 
Fig. 11.31, where the source’s motion is taken to be parallel to the x axis. In 
the first case, the photons you see must have been emitted at an earlier time, 
because the source moves during the nonzero time it takes the light to reach you. 
In this scenario, we are dealing with photons that hit your eye when (as measured 
in your frame) the source crosses the y axis. You therefore see the photons come 
in at an angle with respect to the y axis. 

In the second case, you see the photons come in along the y axis (by the 
definition of this scenario). At the instant you observe one of these photons, the 
source is at a position past the y axis. Let’s find the observed frequencies in these 
two cases. 


Case 1: Let your frame be S, and let the source’s frame be S”. Consider the 
situation from S’’s point of view. S” sees you moving across his field of vision at 
speed v. The relevant photons hit your eye when you cross the y’ axis (defined to 
be the axis that passes through the source) of the S” frame. S” sees you get hit by 
a flash every Ar’ = 1/f” seconds in his frame. (This is true because when you 
are very close to the y’ axis, all points on your path are essentially equidistant 
from the source. So we don’t have to worry about any longitudinal effects, in the 
S’ frame.) This means that you get hit by a flash every AT = At'/y = 1/(f’y) 
seconds in your frame, because S’ sees your clock running slow. Therefore, the 
frequency in your frame is 
1 ; f: 

f a mr (11.52) 
Hence, f is greater than f”. You see the flashes at a higher frequency than S’ emits 
them. 


Fig. 11.30 


(case 2) 


Fig. 11.31 
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Case 2: Again, let your frame be S, and let the source’s frame be S’. Consider 
the situation from your point of view. Because of time dilation, a clock on the 
source runs slow (in your frame) by a factor of y. So you get hit by a flash 
every AT = y Afť' = y/f' seconds in your frame. (We have used the fact that 
the relevant photons are emitted from points that are essentially equidistant from 
you. So they all travel the same distance, and we don’t have to worry about any 
longitudinal effects, in your frame.) When you see the source cross the y axis, 
you therefore observe a frequency of 


1 1 f' LICA 
f= AF yar 7 =f y l= pA (11.53) 


Hence, f is smaller than f”. You see the flashes at a lower frequency than S’ emits 
them. 


When people talk about the “transverse Doppler effect,” they sometimes mean 
Case |, and they sometimes mean Case 2. The title “transverse Doppler” is 
therefore ambiguous, so you should remember to state exactly which scenario 
you are talking about. Other cases that are “in between” these two can also be 
considered. But they get a bit messy. 


REMARKS: 


1. The two scenarios may alternatively be described, respectively (as you can convince 
yourself), in the following ways (see Fig. 11.32). 


e Case 1: A receiver moves with speed v in a circle around a source. What frequency 
does the receiver register? 

e Case 2: A source moves with speed v in a circle around a receiver. What frequency 
does the receiver register? 


These two setups make it clear that the results in Eqs. (11.52) and (11.53) arise from a 
simple time-dilation argument used by the inertial object at the center of each circle. 
These setups involve accelerating objects. We must therefore invoke the fact (which has 
plenty of experimental verification) that if an inertial observer looks at a moving clock, 
then only the instantaneous speed of the clock is important in computing the time-dilation 
factor. The acceleration is irrelevant.** 

2. Beware of the following incorrect reasoning for Case 1, leading to an incorrect version of 
Eq. (11.52). “S sees things in S’ slowed down by a factor y (that is, At = y At’), by the 
usual time-dilation effect. Hence, S sees the light flashing at a slower pace. Therefore, 
f = f'/y ” This reasoning puts the y in the wrong place. Where is the error? The error 
lies in confusing the time at which an event occurs in S’s frame, with the time at which S 
sees (with his eyes) the event occur. The flashes certainly occur at a lower frequency in 
S, but due to the motion of S’ relative to S, it turns out that the flashes meet S’s eye at a 
faster rate, because the source is moving slightly toward S while it is emitting the relevant 
photons. You can work out the details from S’s point of view.°> 


34 The acceleration is, however, very important if things are considered from an accelerating object’s 
point of view. But we’ll wait until Chapter 14 on General Relativity to talk about this. 

35 This is a fun exercise (Exercise 11.66), but it should convince you that it’s much easier to look at 
things in the frame in which there are no longitudinal effects, as we did in our solutions above. 
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Alternatively, the error can be stated as follows. The time dilation result, At = y AY’, 
rests on the assumption that the Ax’ between the two events is zero. This applies fine 
to two emissions of light from the source. However, the two events in question are the 
absorption of two light flashes by your eye (which is moving in S”), so At = yA?’ is not 
applicable. Instead, Av’ = y At is the relevant result, valid when Ax = 0. (But we still 
need to invoke the fact that all the relevant photons travel the same distance, which means 
that we don’t have to worry about any longitudinal effects.) & 


11.9 Rapidity 
11.9.1 Definition 
Let us define the rapidity, ¢, by 


tanh p = B= (11.54) 


A few properties of the hyperbolic trig functions are given in Appendix A. 
In particular, tanh@ = (e? — e~%)/(e? + e~*). The rapidity defined in Eq. 
(11.54) is very useful in relativity because many of our expressions take on a 
particularly nice form when written in terms of it. Consider, for example, the 
velocity-addition formula. Let 6; = tanh ġı and £2 = tanh ¢2. Then if we add 
f and £2 using the velocity-addition formula, Eq. (11.31), we obtain 


Bi + f2 _ tanh ¢) + tanh go 
1+ 6162 1+ tanh ¢@; tanh 2 


where we have used the addition formula for tanh ø, which you can prove by 
writing things in terms of the exponentials, e+?. Therefore, while the velocities 
add in the strange manner of Eq. (11.31), the rapidities add by standard addition. 

The Lorentz transformations also take a nice form when written in terms of 
the rapidity. Our friendly y factor can be written as 


1 1 
v1 - p? 7 Ji — tanh* 


= tanh ($1 + %2), (11.55) 


y = cosh ¢. (11.56) 


Also, 
B _ tanh @ 
v1 =p? Ji — tanh? ¢ 


Therefore, the Lorentz transformations in matrix form, Eqs. (11.22), become 


x \  (cosh@ sinh¢ yf 
2 7 ( sinh @ cosh ø ) G) (11.58) 


This transformation looks similar to a rotation in a plane, which is given by 


x cos@ sind x 
= 11: 


yÉ = sinh ¢. (11.57) 
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except that we now have hyperbolic trig functions instead of trig functions. 
The fact that the interval s? = c?t? — x? does not depend on the frame is 
clear from Eq. (11.58), because the cross terms in the squares cancel, and 
cosh? — sinh? = 1. (Compare with the invariance of r? = x? + y? for rota- 
tions in a plane, where the cross terms from Eq. (11.59) likewise cancel, and 
cos26 + sin?@ = 1.) 

The quantities associated with a Minkowski diagram also take a nice form 
when written in terms of the rapidity. The angle between the S and S’ axes 
satisfies 


tan = $ = tanh¢. (11.60) 


And the size of one unit on the x’ or ct’ axes is, from Eq. (11.46), 


2 2 
— — ta 


For large ¢, this is approximately equal to e? //2. 


11.9.2 Physical meaning 


The fact that the rapidity makes many of our formulas look nice and pretty is 
reason enough to consider it. But in addition, it turns out to have a very meaningful 
physical interpretation. Consider the following setup. A spaceship is initially at 
rest in the lab frame. At a given instant, it starts to accelerate. Let a be the 
proper acceleration, which is defined as follows. Let ¢ be the time coordinate 
in the spaceship’s frame.*° If the proper acceleration is a, then at time t + dt, 
the spaceship is moving at speed a dt relative to the frame it was in at time f. 
An equivalent definition is that the astronaut feels a force of ma applied to his 
body by the spaceship. If he is standing on a scale, then the scale shows a reading 
of F = ma. 

What is the relative speed of the spaceship and the lab frame at (the spaceship’s) 
time ¢? We can answer this question by considering two nearby times and using 
the velocity-addition formula, Eq. (11.31). From the definition of a, Eq. (11.31) 
gives, with vı = a dt and v2 = v(t), 


v(t) + adt 


Mies as ava” 


(11.62) 


36 This frame is of course changing as time goes by, because the spaceship is accelerating. The time 
t is simply the spaceship’s proper time. Normally, we would denote this by ¢’, but we don’t want 
to have to keep writing the primes over and over in the following calculation. 
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Expanding this to first order in dt yields?” 


dv v? v dv t 
— = 1- — — = dt. 11.6 
dt a( =) =x [ 1 — v? /c? 1 a ( 3) 


Separating variables and integrating gives, using f dz/(1 —z?) = tanh“ !z,’”’ and 
assuming that a is constant, 


v(t) = c tanh(at/c). (11.64) 


For small a or small ¢ (more precisely, for at/c « 1), we obtain u(t) ~ at, as 
we should (because tanh z ~ z for small z, which follows from the exponential 
form of tanh). And for at/c >> 1, we obtain v(t) ~ c, as we should. If a happens 
to be a function of time, a(t), then we can’t take the a outside the integral in 
Eq. (11.63), so we instead end up with the general formula, 


t 
v(t) = ctanh (< a(tyat). (11.65) 
c Jo 


The rapidity @, as defined in Eq. (11.54), is therefore given by 


t 
g(t) = zf a(t) dt. (11.66) 
c Jo 


Note that whereas v has c as a limiting value, @ can become arbitrarily large. 
Looking at Eq. (11.65) we see that the ¢ associated with a given v is 1/mc times 
the time integral of the force (felt by the astronaut) that is needed to bring the 
astronaut up to speed v. By applying a force for an arbitrarily long time, we can 
make ¢ arbitrarily large. 

The integral f a(t) dt may be described as the naive, incorrect speed. That is, 
it is the speed that the astronaut might think he has, if he has his eyes closed and 
knows nothing about the theory of relativity. And indeed, his thinking would be 
essentially correct for small speeds. This quantity f a(t) dt = f F(t) dt/m looks 
like a reasonably physical thing, so it seems like it should have some meaning. 
And indeed, although it doesn’t equal v, all you have to do to get v is take a tanh 
and throw in some factors of c. 

The fact that rapidities add via simple addition when using the velocity- 
addition formula, as we saw in Eq. (11.55), is evident from Eq. (11.65). There is 
really nothing more going on here than the fact that 


h ti b2 
/ a(t) a= | ac ar f a(t) dt. (11.67) 
to to 


ti 


x Equivalently, just take the derivative of (v + w)/(1 + vw/c?) with respect to w, and then set 
w=0. 

28 Alternatively, you can use 1/(1 z?) = 1/(2(1 —z)) + 1/(2(1 +2)), and then integrate to obtain 
some logs, which then yield the tanh. You can also use the result of Problem 11.17 to find v(t). 


See the remark in the solution to that problem (after trying to solve it, of course!). 
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To be explicit, let a force be applied from fo to tı that brings a mass up to a speed 
of (dropping the c’s) 8, = tanh ¢; = tanh ( A adt), and then let an additional 
force be applied from ż; to t that adds on an additional speed of 62 = tanh ġ2 = 
tanh ( J a dt), relative to the frame at tı. Then the resulting speed may be looked 
at in two ways: (1) it is the result ofrelativistically adding the speeds 6; = tanh ġı 
and f2 = tanh go, and (2) it is the result of applying the force from tọ to t (you 
get the same final speed, of course, whether or not you bother to record the speed 
along the way at t1), which is 6 = tanh ( K a dt) = tanh(¢; + %2), where the 
second equality comes from the statement, Eq. (11.67), that integrals simply add. 
Therefore, the relativistic addition of tanh ¢; and tanh ¢2 gives tanh(¢; + ¢2), 
as we wanted to show. Note that this reasoning doesn’t rely on the fact that the 
function here is a tanh. It could be anything. If we lived in a world where the 
speed were given by, for example, 6 = tan( f adt), then the rapidities would 
still add via simple addition. It’s just that in our world we have a tanh. (We’ll see 
in the next section that a “tan” world would have issues, anyway.) 


11.10 Relativity without c 


In Section | 1.2, we introduced the two postulates of Special Relativity, namely the 
speed-of-light postulate and the relativity postulate. Appendix | gives a derivation 
of the Lorentz transformations that works directly from these two postulates and 
doesn’t use the three fundamental effects, which were the basis of the derivation 
in Section 11.4.1. It’s interesting to see what happens if we relax these postulates. 
It’s hard to imagine a reasonable (empty) universe where the relativity postulate 
doesn’t hold, but it’s easy to imagine a universe where the speed of light depends 
on the reference frame. Light could behave like a baseball, for example. So 
let’s drop the speed-of-light postulate now and see what we can say about the 
coordinate transformations between frames, using only the relativity postulate. 
For further discussion of this topic, see Lee and Kalotas (1975) and references 
therein. 

In Appendix I, the form of the transformations, just prior to invoking the 
speed-of-light postulate, is given in Eq. (1.8) as 


x = Ay(x’ + vf), 


1 1 (11.68) 
= Ay (4+ 2(1 zy): 


We’ll put a subscript on A in this section, to remind us of the v dependence. 
Can we say anything about A, without invoking the speed-of-light postulate? 
Indeed we can. Define Vy by 


1 1 1 1 
= (: ) => A= ——— (11.69) 


I= 02/2 


11.10 Relativity without c 
We have picked the positive square root because when v = 0 we should have 


x =x’ and t = f’. Equations (11.68) now become 


1 
x= =i 4+ vf’), 


J1—v2/V2 
1 v, ') 
t= EE Ws 
JI —w2/V2 (73 


All we’ve done so far is make a change of variables. But we now make the 
following claim. 


(11.70) 


Claim 11.1 V2 is independent of v. 


Proof: As stated in the last remark in Section 11.4.1, we know that two suc- 
cessive applications of the transformations in Eqs. (11.70) must again yield a 
transformation of the same form. Consider a transformation characterized by 
velocity vı, and another one characterized by velocity v2. For simplicity, define 


Vi = o V2 = VWa; 
1 1 11.71 
= I aaa Leek — 
(1-2/9; (1-3/4 


To calculate the composite transformation, it is easiest to use matrix notation. 
Looking at Eqs. (11.70), we see that the composite transformation is given by 
the matrix 


y2 y2U2 V1 yıvı 1+ A v2 + V2 hee 
ny v2 yı yZ y Zr uap 14 Eh an) 
2 


y 


The composite transformation must still be of the form of Eqs. (11.70). But this 
implies that the upper-left and lower-right entries of the composite matrix must 
be equal. Therefore, v? = Ve Since this holds for arbitrary vı and v2, we see 
that V? must be a constant. So it is independent of v. a 


Denote the constant value of V? by V. Then the coordinate transforma- 


tions in Eq. (11.70) become 


1 
x = ———— (x' + vf’), 


y1 = v2/V2 
= 1 1 U 
= V1 —v2/V2 (: j 73"): 


We have obtained this result using only the relativity postulate. These trans- 
formations have the same form as the Lorentz transformations in Eqs. (11.17). 


(11.73) 
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The only extra information in Eqs. (11.17) is that V equals the speed of light, c. 
It is remarkable that we were able to prove so much by using only the relativity 
postulate. 

We can say a few more things. There are four basic possibilities for the value 
of V7. However, two of these are not physical. 


e V? = œœ: This gives the Galilean transformations, x = x’ + vt’ and t = f'. 

e 0 < V? < oo: This gives transformations of the Lorentz type. V is the limiting speed 
of an object. Experiments show that this case is the one that corresponds to the world 
we live in. 

e V? = 0: This case isn’t physical, because any nonzero value of v makes the y factor 
imaginary (and infinite). Nothing could ever move. 

e V2 <0: It turns out that this case is also not physical. You might be concerned that the 
square of V is less than zero, but this is fine because V appears in the transformations 
(11.73) only through its square. There is no need for V to actually be the speed of 
anything. The trouble is that the nature of Eqs. (11.73) implies the possibility of time 
reversal. This opens the door for causality violation and all the other problems associated 
with time reversal. We therefore reject this case. To be more explicit, define b? = —V2, 
where b is a positive number. Then Eqs. (11.73) may be written in the form, 


x =x'cos6 + (bt’) sind, 


. , (11.74) 
bt = —x' sin + (bt') cosé@, 


where tan 0 = v/b. Also, —2/2 < 6 < 2/2, because the positive sign of the x’ and ¢’ 
coefficients in Eqs. (11.73) implies that the cos @ in Eqs. (11.74) satisfies cos 0 > 0. We 
have the normal trig functions in Eqs. (11.74) instead of the hyperbolic trig functions 
in the Lorentz transformation in Eqs. (11.58). This present transformation is simply a 
rotation of the axes through an angle @ in the plane. The S axes are rotated counter- 
clockwise (as you can check) by an angle 6 relative to the S” axes. Equivalently, the S’ 
axes are rotated clockwise by an angle 0 relative to the S axes. 

Equations (11.74) satisfy the requirement that the composition of two transformations 
is again a transformation of the same form. Rotation by 6), and then by 62, yields a 
rotation by 0; + 62. However, if 6; and 62 are positive, and if the resulting rotation is 
through an angle 8 that is greater than 90°, then we have a problem. The tangent of such 
an angle is negative. Therefore, tan 0 = v/b implies that v is negative. 

This situation is shown in Fig. 11.33. Frame S” moves at velocity v2 > 0 with respect 
to frame S’, which moves at velocity vj > 0 with respect to frame S. From the figure, 
we see that someone standing at rest in frame S” (that is, someone whose worldline is 
the br’ axis) is going to have some serious issues in frame S. For one, the bt” axis has 
a negative slope in frame S, which means that as ¢ increases, x decreases. The person is 
therefore moving at a negative velocity with respect to S. Adding two positive velocities 
and obtaining a negative one is clearly absurd. But even worse, someone standing at 
rest in S” is moving in the positive direction along the br” axis, which means that he is 
traveling backwards in time in S. That is, he will die before he is born. This is not good. 


11.11 Problems 


An equivalent method of dismissing this case, given in Lee and Kalotas (1977), but one 
that doesn’t specifically refer to causality violation, is to note that the transformations 
in Eqs. (11.74) don’t form a closed group. In other words, successive applications 
of the transformations can eventually yield a transformation that isn’t of the form in 
Eqs. (11.74), due to the —7/2 < 0 < m/2 restriction. (In contrast, the rotations in a 
plane form a closed group, because there are no restrictions on what 6 can be.) This 
argument is equivalent to the time-reversal argument above, because — 7 /2 < 0 < 1/2 
is equivalent to cos @ > 0, which is equivalent to the statement that the coefficients of 


t and ¢’ in Eqs. (11.74) have the same sign. 


Note that all of the finite 0 < V? < 00 possibilities are essentially the same. 
Any difference in the numerical value of V can be absorbed into the definitions 
of the unit sizes for x and t. Given that V is finite, it has to be something, so 
it doesn’t make sense to put much importance on its numerical value. There is 
therefore only one decision to be made when constructing the spacetime structure 
of an (empty) universe. You just have to say whether V is finite or infinite, that 
is, whether the universe is Lorentzian or Galilean. Equivalently, all you have to 
say is whether or not there is an upper limit on the speed of any object. If there 
is, then you can simply postulate the existence of something that moves with this 
limiting speed. In other words, to create your universe, you simply have to say, 
“Let there be light.” 


11.11 Problems 
Section 11.3: The fundamental effects 


11.1. No transverse length contraction x 


Two meter sticks, A and B, move past each other as shown in Fig. 11.34. 
Stick A has paint brushes on its ends. Use this setup to show that in the 
frame of one stick, the other stick still has a length of one meter. 


11.2. Explaining time dilation «x 


Two planets, A and B, are at rest with respect to each other, a distance L 
apart, with synchronized clocks. A spaceship flies at speed v past planet 
A toward planet B and synchronizes its clock with A’s right when it 
passes A (they both set their clocks to zero). The spaceship eventually 
flies past planet B and compares its clock with B’s. We know, from 
working in the planets’ frame, that when the spaceship reaches B, B’s 
clock reads L/v. And the spaceship’s clock reads L/y v, because it runs 
slow by a factor of y when viewed in the planets’ frame. 

How would someone on the spaceship quantitatively explain to you 
why B’s clock reads L/v (which is more than its own L/y v), considering 
that the spaceship sees B’s clock running slow? 


A 


Fig. 11.34 
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11.3. 


11.5. 


11.7. 


Explaining length contraction +» 


Two bombs lie on a train platform, a distance L apart. As a train passes 
by at speed v, the bombs explode simultaneously (in the platform frame) 
and leave marks on the train. Due to the length contraction of the train, 
we know that the marks on the train are a distance yL apart when viewed 
in the train’s frame, because this distance is what is length-contracted 
down to the given distance L in the platform frame. 

How would someone on the train quantitatively explain why the marks 
are a distance y L apart, considering that the bombs are a distance of only 
L/y apart in the train frame? 


. A passing stick «x 


A stick of length L moves past you at speed v. There is a time interval 
between the front end coinciding with you and the back end coinciding 
with you. What is this time interval in 


(a) your frame? (Calculate this by working in your frame.) 

(b) your frame? (Work in the stick’s frame.) 

(c) the stick’s frame? (Work in your frame. This is the tricky one.) 
(d) the stick’s frame? (Work in the stick’s frame.) 


Rotated square «x 


A square with side L flies past you at speed v, in a direction parallel 
to two of its sides. You stand in the plane of the square. When you see 
the square at its nearest point to you, show that it looks to you like it is 
rotated, instead of contracted. (Assume that L is small compared with 
the distance between you and the square.) 


. Train in a tunnel xx 


A train and a tunnel both have proper lengths L. The train moves toward 
the tunnel at speed v. A bomb is located at the front of the train. The 
bomb is designed to explode when the front of the train passes the far 
end of the tunnel. A deactivation sensor is located at the back of the train. 
When the back of the train passes the near end of the tunnel, the sensor 
tells the bomb to disarm itself. Does the bomb explode? 


Seeing behind the stick +x 


A ruler is positioned perpendicular to a wall, and you stand at rest with 
respect to the ruler and the wall. A stick of length L flies by at speed v. 
It travels in front of the ruler, so that it obscures part of the ruler from 
your view. When the stick hits the wall it stops. Which of the following 
two reasonings is correct (and what is wrong with the incorrect one)? 

In your reference frame, the stick is shorter than L. Therefore, right 
before it hits the wall, you are able to see a mark on the ruler that is 
closer than L units to the wall (see Fig. 11.35). 


11.11 Problems 551 


But in the stick’s frame, the marks on the ruler are closer together. 
Therefore, when the wall hits the stick, the closest mark to the wall that 
you can see on the ruler is greater than Z units (see Fig. 11.35). 


11.8. Cookie cutter +x 


Cookie dough (chocolate chip, of course) lies on a conveyor belt that 
moves at speed v. A circular stamp stamps out cookies as the dough 
rushes by beneath it. When you buy these cookies in a store, what shape 
are they? That is, are they squashed in the direction of the belt, stretched 
in that direction, or circular? 


11.9. Getting shorter «x 


Two balls move with speed v along a line toward two people standing 
along the same line. The proper distance between the balls is yZ, and 
the proper distance between the people is L. Due to length contraction, 
the people measure the distance between the balls to be L, so the balls 
pass the people simultaneously (as measured by the people), as shown in ý ý 
Fig. 11.36. Assume that the people’s clocks both read zero at this time. 
If the people catch the balls, then the resulting proper distance between 
the balls becomes L, which is shorter than the initial proper distance of ri t 
yL. Your task: By working in the frame in which the balls are initially at 

rest, explain how the proper distance between the balls decreases from 
yL to L. Do this in the following way. 


Fig. 11.36 


(a) Draw the beginning and ending pictures for the process. Indicate 
the readings on both clocks in the two pictures, and label all 
relevant lengths. 

(b) Using the distances labeled in your pictures, how far do the people 
travel? Using the times labeled in your pictures, how far do the 
people travel? Show that these two methods give the same result. 

(c) Explain in words how the proper distance between the balls 


decreases from yZ to L. 4cI5 


Section 11.4: The Lorentz transformations D B 3c/5 
—> 


11.10. A bunch of L.T.’s x 
Verify that the values of Ax and At in the table in the “Passing trains” g C 
example in Section 11.6 satisfy the L.T.’s between the six pairs of 
frames, namely AB, AC, AD, BC, BD, and CD (see Fig. 11.37). Fig. 11.37 


Section 11.5: Velocity addition 
4cl5 ? 3c/5 


e —> e — e ——> 


11.11. Equal speeds « A C B 


A and B travel at 4c/5 and 3c/5 with respect to the ground, as shown 
in Fig. 11.38. How fast should C travel so that she sees A and B Fig. 11.38 
approaching her at the same speed? What is this speed? 
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v ? 
e—> e- > ° 
A C B 
Fig. 11.39 
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11.14. 


Fig. 11.40 
eB 
11.15. 
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Fig. 11.41 
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Fig. 11.42 11.17. 


More equal speeds «x 


A travels at speed v with respect to the ground, and B is at rest, as 
shown in Fig. 11.39. How fast should C travel so that she sees A and B 
approaching her at the same speed? In the ground frame (8’s frame), 
what is the ratio of the distances CB and AC (assume that A and C 
arrive at B at the same time)? The answer to this is very nice and clean. 
Can you think of a simple intuitive explanation for the result? 


Equal transverse speeds x 


In the lab frame, an object moves with velocity (ux, uy), and you move 
with velocity v in the x direction. What should v be so that you also 
see the object move with velocity u, in your y direction? One solution 
is of course v = 0. Find the other one. 


Relative speed « 


In the lab frame, two particles move with speed v along the paths 
shown in Fig. 11.40. The angle between the trajectories is 20. What is 
the speed of one particle, as viewed by the other? (Note: This problem 
is posed again in Chapter 13, where it can be solved in a much simpler 
way, using 4-vectors.) 


Another relative speed +» 


In the lab frame, particles A and B move with speeds u and v along 
the paths shown in Fig. 11.41. The angle between the trajectories is 0. 
What is the speed of one particle, as viewed by the other? (Note: This 
problem is posed again in Chapter 13, where it can be solved in a much 
simpler way, using 4-vectors.) 


Transverse velocity addition +x 


For the special case of u! = 0, the transverse velocity-addition for- 
mula, Eq. (11.38), yields u, = u’,/y. Derive this in the following way: 
In frame S’, a particle moves with velocity (0, u’), as shown in the first 
picture in Fig. 11.42. Frame S moves to the left with speed v, so the sit- 
uation in S looks like what is shown in the second picture in Fig. 11.42, 
with the y speed now u. Consider a series of equally spaced dotted 
lines, as shown. The ratio of the times between passes of the dotted 
lines in frames S and S’ is Ts /Ts = u’/u. Derive another expression 
for this ratio by using time-dilation arguments, and then equate the two 
expressions to solve for u in terms of u’ and v. 


Many velocity additions +» 


An object moves at speed vj /c = 6; with respect to S1, which moves 
at speed £2 with respect to $2, which moves at speed 63 with respect to 
S3, and so on, until finally Sy_; moves at speed By with respect to Sy 
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(see Fig. 11.43). Show by induction that the speed, call it Bix), of the 
object with respect to Sy can be written as 


ay = PAA PR 
()) =>? 
PEP, 

a n (11.75) 


where Py = [[a + Bi), and Py = [[a — Bi). 


i=l i=l 


11.18. Velocity addition from scratch «« 


A ball moves at speed vı with respect to a train. The train moves at 
speed v2 with respect to the ground. What is the speed of the ball with 
respect to the ground? Solve this problem (that is, derive the velocity- 
addition formula, Eq. (11.31)) in the following way (don’t use time 
dilation, length contraction, etc; use only the relativity postulate and 
the fact that the speed of light is the same in any inertial frame): 

Let the ball be thrown from the back of the train. At the same instant, 
a photon is released next to it (see Fig. 11.44). The photon heads to the 
front of the train, bounces off a mirror, heads back, and eventually runs 
into the ball. In both the frame of the train and the frame of the ground, 
calculate the fraction of the way along the train where the meeting 
occurs, and then equate these fractions. 


Fig. 11.44 


11.19. Modified twin paradox x+» 


Consider the following variation of the twin paradox. A, B, and C each 
have a clock. In A’s reference frame, B flies past A with speed v to 
the right. When B passes A, they both set their clocks to zero. Also, in 
A’s reference frame, C starts far to the right and moves to the left with 
speed v. When B and C pass each other, C sets his clock to read the 
same as B’s. Finally, when C passes A, they compare the readings on 
their clocks. At this moment, let A’s clock read T4, and let C’s clock 
read Tc. 


(a) Working in A’s frame, show that Te = T,/y, where y = 


1/f1 — v2 /c?. 
(b) Working in B’s frame, show again that Te = Ty/y. 
(c) Working in C’s frame, show again that Te = T,/y. 


Section 11.6: The invariant interval 


11.20. Throwing on a train +» 


A train with proper length L moves at speed c/2 with respect to the 
ground. A ball is thrown from the back to the front, at speed c/3 with 
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respect to the train. How much time does this take, and what distance 
does the ball cover, in: 


(a) The train frame? 

(b) The ground frame? Solve this by 
i. Using a velocity-addition argument. 
ii. Using the Lorentz transformations to go from the train frame 

to the ground frame. 

(c) The ball frame? 

(d) Verify that the invariant interval is indeed the same in all three 
frames. 

(e) Show that the times in the ball frame and ground frame are 
related by the relevant y factor. 

(f) Ditto for the ball frame and train frame. 

(g) Show that the times in the train frame and ground frame are not 
related by the relevant y factor. Why not? 


Section 11.7: Minkowski diagrams 


11.21. 


11.22. 


11.23. 


11.24. 


Anew frame « 


In a given reference frame, Event 1 happens at x = 0, ct = 0, and 
Event 2 happens at x = 2, ct = 1 (in units of some specified length). 
Find a frame in which the two events are simultaneous. 


Minkowski diagram units « 


Consider the Minkowski diagram in Fig. 11.45. In frame S, the hyper- 
bola c?t? — x? = 1 is drawn. Also drawn are the axes of frame S’, 
which moves past S with speed v. Use the invariance of the interval 
s? = c*t? — x? to derive the ratio of the unit sizes on the ct’ and ct axes, 


and check your result with Eq. (11.46). 


Velocity addition via Minkowski «« 


An object moves at speed vı with respect to frame S”. Frame S” moves 
at speed v2 with respect to frame S (in the same direction as the motion 
of the object). What is the speed, u, of the object with respect to frame 
S? Solve this problem (that is, derive the velocity-addition formula) 
by drawing a Minkowski diagram with frames S and S’, drawing the 
worldline of the object, and doing some geometry. 


Clapping both ways +» 


Twin A stays on the earth, and twin B flies to a distant star and back. For 
both of the following setups, draw a Minkowski diagram that explains 
what is happening. 


11.25. 


11.26. 
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(a) Throughout the trip, B claps in such a way that his claps occur 
at equal time intervals At in A’s frame. At what time intervals 
do the claps occur in B’s frame? 

(b) Now let A clap in such a way that his claps occur at equal time 
intervals At in B’s frame. At what time intervals do the claps 
occur in A’s frame? (Be careful on this one. The sum of all 
the time intervals must equal the increase in A’s age, which is 
greater than the increase in B’s age, as we know from the usual 
twin paradox.) 


Acceleration and redshift xxx 


Use a Minkowski diagram to solve the following problem: Two people 
stand a distance d apart. They simultaneously start accelerating in the 
same direction (along the line between them) with the same proper 
acceleration a. At the instant they start to move, how fast does each 
person’s clock tick in the (changing) frame of the other person? 


Break or not break? xxx 


Two spaceships float in space and are at rest relative to each other. They 
are connected by a string (see Fig. 11.46). The string is strong, but it can- 
not withstand an arbitrary amount of stretching. At a given instant, the 
spaceships simultaneously (with respect to their initial inertial frame) 
start accelerating in the same direction (along the line between them) 
with the same constant proper acceleration. In other words, assume 
they bought identical engines from the same store, and they put them 
on the same setting. Will the string eventually break? 


Section 11.9: Rapidity 


11.27. 


11.28. 


Successive Lorentz transformations 


The Lorentz transformation in Eq. (11.58) may be written in matrix 


form as 
x cosh@ sinh¢ x! 
= ; 11. 
( ct ) ( sinhd cosh ¢ ) ( ct’ ae) 
Show that if you apply an L.T. with vı = tanh ¢, and then another 
one with v2 = tanh q2, the result is an L.T. with v = tanh(¢; + @¢2). 


Accelerator’s time x» 


A spaceship is initially at rest in the lab frame. At a given instant, it 
starts to accelerate. Let this happen when the lab clock reads t = 0 
and the spaceship clock reads t/ = 0. The proper acceleration is a. 
(That is, at time ¢’ + dt’, the spaceship is moving at speed adt' relative 


Fig. 11.46 
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to the frame it was in at time ¢’.) Later on, a person in the lab measures 
t and ¢’. What is the relation between them? 


Exercises 


Section 11.3: The fundamental effects 


11.29. 


11.30. 


11.31. 


11.32. 


11.33. 


Effectively speed c x 


A rocket flies between two planets that are one light-year apart. What 
should the rocket’s speed be so that the time elapsed on the captain’s 
watch is one year? 


A passing train x 
A train of length 15 cs moves at speed 3c/5.°” How much time does 
it take to pass a person standing on the ground (as measured by that 


person)? Solve this by working in the frame of the person, and then 
again by working in the frame of the train. 


Overtaking a train « 


Train A has length L. Train B moves past A (on a parallel track, facing 
the same direction) with relative speed 4c/5. The length of B is such 
that A says that the fronts of the trains coincide at exactly the same time 
as the backs coincide. What is the time difference between the fronts 
coinciding and the backs coinciding, as measured by B? 


Walking on a train x 


A train of proper length L and speed 3c/5 approaches a tunnel of 
length L. At the moment the front of the train enters the tunnel, a per- 
son leaves the front of the train and walks (briskly) toward the back. 
She arrives at the back of the train right when it (the back) leaves the 
tunnel. 


(a) How much time does this take in the ground frame? 
(b) What is the person’s speed with respect to the ground? 
(c) How much time elapses on the person’s watch? 


Simultaneous waves x 


Alice flies past Bob at speed v. Right when she passes, they both set 
their watches to zero. When Alice’s watch shows a time 7, she waves 
to Bob. Bob then waves to Alice simultaneously (as measured by him) 
with Alice’s wave (so this is before he actually sees her wave). Alice 
then waves to Bob simultaneously (as measured by her) with Bob’s 
wave. Bob then waves to Alice simultaneously (as measured by him) 


39 1 cs is one “light-second.” It equals (1)(3 - 108 m/s)(1s) =3- 108 m. 
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11.35. 


11.36. 


11.37. 


11.38. 
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with Alice’s second wave. And so on. What are the readings on Alice’s 
watch for all the times she waves? And likewise for Bob? 


Here and there « 


A train of proper length L travels past you at speed v. A person on the 
train stands at the front, next to a clock that reads zero. At this moment 
in time (as measured by you), a clock at the back of the train reads 
Lv/c”. How would you respond to the following statement: 

“The person at the front of the train can leave the front right after 
the clock there reads zero, and then run to the back and get there right 
before the clock there reads Lv/c”. You (on the ground) will therefore 
see the person simultaneously at both the front and the back of the train 
when the clocks there read zero and Lv/c”, respectively.” 


Photon on a train x 


A train of proper length L has clocks at the front and back. A photon is 
fired from the back of the train to the front. Working in the train frame, 
we can easily say that if the photon leaves the back of the train when 
the clock there reads zero, then it arrives at the front when the clock 
there reads L/c. 

Now consider this setup in the ground frame, where the train travels 
by at speed v. Rederive the above result (that the difference in the 
readings of the two clocks is L/c) by working only in the ground frame. 


Triplets x 


Triplet A stays on the earth. Triplet B travels at speed 4c/5 to a planet 
(a distance L away) and back. Triplet C travels out to the planet at speed 
3c/4, and then returns at the necessary speed to arrive back exactly 
when B does. How much does each triplet age during this process? 
Who is youngest? 


Seeing the light +» 


A and B leave from a common point (with their clocks both reading 
zero) and travel in opposite directions with relative speed v. When B’s 
clock reads T, he (B) sends out a light signal. When 4 receives the 
signal, what time does his (A’s) clock read? Answer this question by 
doing the calculation entirely in (a) A’s frame, and then (b) B’s frame. 
(This problem is basically a derivation of the longitudinal Doppler 
effect, discussed in Section 11.8.1.) 


Two trains and a tree xx 


Two trains of proper length L move toward each other in opposite 
directions on parallel tracks. They both move at speed v with respect 
to the ground. Both trains have clocks at the front and back, and these 
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11.40. 


11.41. 


clocks are synchronized as usual in the frame of the train they are in. 
A tree is located on the ground at the place where the fronts of the 
trains pass each other. The clocks at the fronts of the trains both read 
zero when they pass. Find the reading on the clocks at the backs of the 
trains when they (the backs) pass each other at the tree. Do this in three 
different ways: 


(a) Imagine that you stand next to the tree on the ground, and you 
observe what one of the rear clocks is doing. 

(b) Imagine that you are on one of the trains, and you observe what 
your own rear clock is doing during the time the tree travels the 
relevant distance. 

(c) Imagine that you are on one of the trains, and you observe what 
the other train’s rear clock is doing during the time the tree travels 
the relevant distance. (You'll need to use velocity addition.) 


Twice simultaneous «« 


A train of proper length L moves at speed v with respect to the ground. 
When the front of the train passes a tree on the ground, a ball is simul- 
taneously (as measured in the ground frame) thrown from the back of 
the train toward the front, with speed u with respect to the train. What 
should u be so that the ball hits the front simultaneously (as measured 
in the train frame) with the tree passing the back of the train? Show that 
in order for a solution for u to exist, we must have v/c < (/5 — 1) /2, 
which happens to be the inverse of the golden ratio. 


People clapping +» 


Two people stand a distance L apart along an east-west road. They both 
clap their hands at precisely noon in the ground frame. You are driving 
eastward down this road at speed 4c/5. You notice that you are next 
to the western person at the same instant (as measured in your frame) 
that the eastern person claps. Later on, you notice that you are next to 
a tree at the same instant (as measured in your frame) that the western 
person claps. Where is the tree along the road? (Describe its location 
in the ground frame.) 


Photon, tree, and house +x 


(a) A train of proper length L moves at speed v with respect to the 
ground. At the instant the back of the train passes a certain tree, 
someone at the back of the train shines a photon toward the front. 
The photon happens to hit the front of the train at the instant the 
front passes a certain house. As measured in the ground frame, 
how far apart are the tree and the house? Solve this by working 
in the ground frame. 
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(b) Now look at the setup from the point of view of the train frame. 
Using your result for the tree—house distance from part (a), verify 
that the house meets the front of the train at the same instant the 
photon meets it. 


Tunnel fraction +» 


A person runs with speed v toward a tunnel of length L. A light source 
is located at the far end of the tunnel. At the instant the person enters 
the tunnel, the light source simultaneously (as measured in the tunnel 
frame) emits a photon that travels down the tunnel toward the person. 
When the person and the photon eventually meet, the person’s location 
is a fraction f along the tunnel. What is f? Solve this by working in the 
tunnel frame, and then again by working in the person’s frame. 


Overlapping trains +» 


An observer on the ground sees two trains, A and B, both of proper 
length L, move in opposite directions at speed v with respect to the 
ground. She notices that when the trains “overlap,” clocks at the front 
of A and rear of B both read zero. From the “rear clock ahead” effect, 
she therefore also notices that clocks at the rear of A and front of B 
read Lu/c? and —Lv/c’, respectively (as shown in Fig. 11.47). Now 
imagine riding along on A. When the rear of B passes the front of your 
train (A), clocks at both of these places read zero (as stated above). 
Explain, by working only in the frame of A, why clocks at the back of 
A and the front of B read Lv/c? and —Lv/c’, respectively, when these 
points coincide. (You’ll need to use velocity addition.) 


Bouncing stick +x 


A stick, oriented horizontally, falls and bounces off the ground. Qual- 
itatively, what does this look like in the frame of someone running by 
at speed v? 


Through the hole? xx 


A stick of proper length L moves at speed v in the direction of its length. 
It passes over an infinitesimally thin sheet that has a hole of diameter L 
cut in it. As the stick passes over the hole, the sheet is raised so that the 
stick passes through the hole and ends up underneath the sheet. Well, 
maybe... 

In the lab frame, the stick’s length is contracted to L/y, so it appears 
to easily make it through the hole. But in the stick frame, the hole is 
contracted to L/y, so it appears that the stick does not make it through 
the hole (or rather, the hole doesn’t make it around the stick, since the 
hole is what is moving in the stick frame). So the question is: Does the 
stick end up on the other side of the sheet or not? 
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Short train in a tunnel +» 


Consider the scenario in Problem 11.6, with the only change being that 
the train now has length rL, where r is some numerical factor. What 
is the largest value of r, in terms of v, for which it is possible for the 
bomb not to explode? Verify that you obtain the same answer working 
in the train frame and working in the tunnel frame. 


Section 11.4: The Lorentz transformations 


11.47. 


11.48. 


Successive L.T.’s +x 


Show that the combination of an L.T. (with speed v1) and an L.T. (with 
speed v2) yields an L.T. with speed u = (vı + v2)/(1 + v1 v2/C*). 


Loss of simultaneity «« 


A train moves at speed v with respect to the ground. Two events occur 
simultaneously, a distance L apart, in the train frame. What are the time 
and space separations in the ground frame? Solve this by: 
(a) Using the Lorentz transformations. 
(b) Using only the results in Section 11.3. Do this by working in the 
ground frame, and then again by working in the train frame. 


Section 11.5: Velocity addition 


11.49. 


11.50. 


11.51. 


11.52. 


Some y’s « 


Show that the relativistic addition (or subtraction) of the velocities u 
and v has a y factor given by y = pyyy(1 + uv). 


Slanted time dilation « 


A clock moves vertically with speed u in a given frame, and you run 
horizontally with speed v with respect to this frame. Show that you see 
the clock run slow by the nice simple factor, y,yy. 


Pythagorean triples x 


Let (a,b,h) be a Pythagorean triple. (We’ll use A to denote the 
hypotenuse, instead of c, for obvious reasons.) Consider the relativis- 
tic addition or subtraction of the two speeds, 6; = a/h and b2 = b/h. 
Show that the numerator and denominator of the result are the leg and 
hypotenuse of another Pythagorean triple, and find the other leg. What 
is the associated y factor? 


Running away « 
A and B both start at the origin and simultaneously head off in opposite 


directions at speed 3c/5 with respect to the ground. A moves to the 
right, and B moves to the left. Consider a mark on the ground at x = L. 
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As viewed in the ground frame, A and B are a distance 2L apart when A 
passes this mark. As viewed by A, how far away is B when A coincides 
with the mark? 


Angled photon « 


A photon moves at an angle 6 with respect to the x’ axis in frame S’. 
Frame S’ moves at speed v with respect to frame S (along the x’ axis). 
Calculate the components of the photon’s velocity in S, and verify that 
the speed is c. 


Running on a train +x 


A train of proper length L moves at speed vı with respect to the ground. 
A passenger runs from the back of the train to the front at speed v2 
with respect to the train. How much time does this take, as viewed by 
someone on the ground? Solve this in two different ways: 


(a) Find the relative speed of the passenger and the train (as viewed 
by someone on the ground), and then find the time it takes for 
the passenger to erase the initial “head start” that the front of the 
train had. 

(b) Find the time elapsed on the passenger’s clock (by working in 
whatever frame you want), and then use time dilation to get the 
time elapsed on a ground clock. 


Velocity addition +» 


The fact that the previous exercise can be solved in two different ways 
suggests a method of deriving the velocity-addition formula: A train of 
proper length L moves at speed vı with respect to the ground. A ball is 
thrown from the back of the train to the front at speed v2 with respect 
to the train. Let the speed of the ball with respect to the ground be V. 
Calculate the time of the ball’s journey, as measured by an observer 
on the ground, in the following two different ways, and then set them 
equal to solve for V in terms of vı and v2. (This gets a bit messy. And 
yes, you have to solve a quadratic.) 


(a) First way: Find the relative speed of the ball and the train (as 
viewed by someone on the ground), and then find the time it 
takes for the ball to erase the initial “head start” that the front of 
the train had. 

(b) Second way: Find the time elapsed on the ball’s clock (by work- 
ing in whatever frame you want), and then use time dilation to 
get the time elapsed on a ground clock. 


Velocity addition again ++ 


A train of proper length L moves at speed v with respect to the ground. 
A ball is thrown from the back of the train to the front at speed u with 
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respect to the train. When finding the time of this process in the ground 
frame, a common error is to use time dilation to go from the train frame 
to the ground frame, which gives an incorrect answer of yy (L/u). This 
is incorrect because time dilation is valid only if the two relevant events 
occur at the same place in one of the frames; otherwise simultaneity 
becomes an issue. 


(a) Find the total time in the ground frame correctly by looking 
at how much a clock at rest in the train frame advances (for 
example, a clock at the front of the train), and by applying time 
dilation to this clock. 

(b) Find the total time in the ground frame by applying time dilation 
to the ball’s clock. Your answer will contain the unknown speed 
V of the ball with respect to the ground. 

(c) Equate your results from parts (a) and (b) to show that yy = 
YuYv(1 + uv/c*). Then solve for V to produce the velocity- 
addition formula. 


11.57. Bullets on a train +» 


A train moves at speed v. Bullets are successively fired at speed u 
(relative to the train) from the back of the train to the front. A new 
bullet is fired at the instant (as measured in the train frame) the previous 
bullet hits the front. In the frame of the ground, what fraction of the 
way along the train is a given bullet, at the instant (as measured in the 
ground frame) the next bullet is fired? What is the maximum number 
of bullets that are in flight at a given instant, in the ground frame? 


11.58. Time dilation and Ly/c? xx 


A person walks very slowly at speed u from the back of a train of 
proper length Z to the front. The time-dilation effect in the train frame 
can be made arbitrarily small by picking u to be sufficiently small 
(because the effect is second order in u). Therefore, if the person’s 
watch agrees with a clock at the back of the train when he starts, then 
it also (essentially) agrees with a clock at the front when he finishes. 

Now consider this setup in the ground frame, where the train moves 
at speed v. The rear clock reads Lv/c* more than the front, so in view of 
the preceding paragraph, the time gained by the person’s watch during 
the process must be Lu/c? less than the time gained by the front clock. 
By working in the ground frame, explain why this is the case. Assume 
u<v. 


40 Tf you line up a collection of these train systems around the circumference of a rotating platform, 
then the above result implies the following fact. Let person A be at rest on the platform, and let 
person B walk arbitrarily slowly around the circumference. Then when B returns to A, B’s clock 
will read less than A’s. This is true because the above reasoning shows (as you will figure out) 


11.12 Exercises 


Section 11.6: The invariant interval 


11.59. 


11.60. 


11.61. 


Passing a train +» 


Person A stands on the ground, train B with proper length L moves to 
the right at speed 3c/5, and person C runs to the right at speed 4c/5. 
C starts behind the train and eventually passes it. Let event £; be “C 
coincides with the back of the train,” and let event E> be “C coincides 
with the front of the train.” Find the At and Ax between the events £1 
and E> in the frames of A, B, and C, and show that c? At? — Ax? is the 
same in all three frames. 


Passing trains +x 


Train A with proper length L moves eastward at speed v, while train 
B with proper length 2L moves westward also at speed v. How much 
time does it take for the trains to pass each other (defined as the time 
between the fronts coinciding and the backs coinciding): 


(a) In A’s frame? 

(b) In B’s frame? 

(c) In the ground frame? 

(d) Verify that the invariant interval is the same in all three frames. 


Throwing on a train +x 


A train with proper length L moves at speed 3c/5 with respect to the 
ground. A ball is thrown from the back to the front, at speed c/2 with 
respect to the train. How much time does this take, and what distance 
does the ball cover, in: 


(a) The train frame? 

(b) The ground frame? Solve this by 
i. Using a velocity-addition argument. 
ii. Using the Lorentz transformations to go from the train frame 

to the ground frame. 

(c) The ball frame? 

(d) Verify that the invariant interval is indeed the same in all three 
frames. 

(e) Show that the times in the ball frame and ground frame are 
related by the relevant y factor. 


that an inertial observer sees B’s clock running slower than A’s. This result, that you can walk 


arbitrarily slowly in a particular reference frame and have your clock lose synchronization with 


other clocks, is a consequence of the fact that in some accelerating reference frames it is impossible 


to produce a consistent method (that is, one without a discontinuity) of clock synchronization. See 


Cranor et al. (2000) for more details. 
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(f) Ditto for the ball frame and train frame. 
(g) Show that the times in the train frame and ground frame are not 
related by the relevant y factor. Why not? 


Section 11.7: Minkowski diagrams 


11.62. 


11.63. 


11.64. 


11.65. 


Time dilation via Minkowski « 


In the spirit of the example in Section 11.7, use a Minkowski diagram 
to derive the time-dilation result between frames S and S’ (in both 
directions, as in the example). 


Ly/c? via Minkowski « 

In the spirit of the example in Section 11.7, use a Minkowski diagram 
to derive the Lu/c* rear-clock-ahead result for frames S and S’ (in 
both directions, as in the example). 

Simultaneous waves again «x 


Solve Exercise 11.33 by using a Minkowski diagram from the point 
of view of someone who sees Alice and Bob moving with equal and 
opposite speeds. 


Short train in a tunnel again xxx 


Solve Exercise 11.46 by using a Minkowski diagram from the point of 
view of the train, and also of the tunnel. 


Section 11.8: The Doppler effect 


11.66. 


11.67. 


Transverse Doppler +x 


As mentioned in Remark 2 of Section 11.8.2, it is possible to solve the 
transverse Doppler effect for Case 1 by working in the frame of the 
observer, provided that you account for the longitudinal component 
of the source’s motion. Solve the problem this way and reproduce 
Eq. (11.52). 


Twin paradox via Doppler +x 


Twin A stays on the earth, and twin B flies at speed v to a distant star 
and back. The star is a distance L from the earth in the earth—star frame. 
Use the Doppler effect to show that B is younger by a factor y when 
she returns (don’t use any time dilation or length contraction results). 
Do this in the following two ways; both are doable by working in either 
A’s frame or B’s frame(s), so take your pick. 


(a) A sends out flashes at intervals of £ seconds (as measured 


in his frame). By considering the numbers of redshifted and 
blueshifted flashes that B receives, show that Tg = T4/y. 


11.13 Solutions 


(b) B sends out flashes at intervals of ¢ seconds (as measured 
in her frame). By considering the numbers of redshifted and 
blueshifted flashes that A receives, show that Tg = T4/y. 


Section 11.9: Rapidity 


11.68. Time of travel +x 


Consider the setup in Problem 11.28 (and feel free to use the results 
from that problem in this exercise). Let the spaceship travel to a planet 
a distance L from the earth. 


(a) By working in the frame of the earth, find the time of the journey, 
as measured by the earth. Check the large and small Z (compared 
with c?/a) limits. 

(b) By working in the (changing) frame of the spaceship, find the 
time of the journey, as measured by the spaceship (an implicit 
equation is fine). Check the small L limit. How does the time 
behave for large L? 


11.13 Solutions 


11.1. 


11.2. 


No transverse length contraction 


Assume that the paint brushes are capable of leaving marks on stick B if B is long 
enough, or if A is short enough. The key fact we need here is the second postulate of 
relativity, which says that the frames of the sticks are equivalent. That is, if A sees 
B shorter than (or longer than, or equal to) itself, then B also sees A shorter than (or 
longer than, or equal to) itself. The contraction factor must be the same when going 
each way between the frames. 

Assume (in search of a contradiction) that A sees B shortened. Then B won’t extend 
out to the ends of A, so there will be no marks on B. But in this case, B must also 
see A shortened, so there will be marks on B (see Fig. 11.48). This is a contradiction. 
Likewise, if we assume that A sees B lengthened, we also reach a contradiction. We 
are therefore left with only the third possibility, namely that each stick sees the other 
stick as one meter long. 


Explaining time dilation 

The resolution to the apparent paradox is the “head start” that B’s clock has over A’s 
clock, as seen in the spaceship frame. From Eq. (11.6), we know that in the spaceship 
frame, B’s clock reads Lv/c? more than A’s. (The two planets may be considered to 
be at the ends of the train in the example in Section 11.3.1.) 

Therefore, what a person on the spaceship says is: “My clock advances by L/yv 
during the whole trip. I see B’s clock running slow by a factor y, so I see B’s clock 
advance by only (L/yv)/y = L/y?v. However, B’s clock started not at zero but at 
Lv/c?. Therefore, the final reading on B’s clock when I get there is 


Lv | L E v 1 E v 1 v? L (11.77) 
C2 yw vie’ y v\e c22 v’ 


as we wanted to show.” 


<— 


A 


A 


Fig. 11.48 


B 


B 
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11.3. Explaining length contraction 


The resolution to the apparent paradox is that the explosions do not occur simulta- 
neously in the train frame. As the platform rushes past the train, the “rear” bomb 
explodes before the “front” bomb explodes.*! The front bomb then gets to travel 
farther by the time it explodes and leaves its mark. The distance between the marks 
is therefore larger than you might naively expect. Let’s be quantitative about this. 

Let the two bombs contain clocks that read zero when they explode (they are 
synchronized in the platform frame). Then in the train frame, the front bomb’s clock 
reads only —Lv/c? when the rear bomb explodes when showing a time of zero. 
(This is the “rear clock ahead” result from Eq. (11.6).) The front bomb’s clock must 
therefore advance by a time of Lv/c* before it explodes. But the train sees the bombs’ 
clocks running slow by a factor y, so in the train frame the front bomb explodes a 
time y Lv/c? after the rear bomb explodes. During this time of y Lv/ c?, the platform 
moves a distance (yLv/c*)v relative to the train. 

Therefore, what a person on the train says is: “Due to length contraction, the 
distance between the bombs is L/y. The front bomb is therefore a distance L/y ahead 
of the rear bomb when the latter explodes. The front bomb then travels an additional 
distance of y Lv? /c? by the time it explodes, at which point it is a distance of 


L y? 1 vw v v? 
a zZ = yL zz } a =yL\{(1l a t 5 =yL (11.78) 


ahead of the rear bomb’s mark, as we wanted to show.” 


11.4. A passing stick 


(a) The stick has length L/y in your frame, and it moves with speed v. Therefore, 
the time taken in your frame to cover the distance L/y is L/yv. 

(b) The stick sees you fly by at speed v. The stick has length L in its own frame, so 
the time elapsed in the stick frame is L/v. During this time, the stick sees the 
watch on your wrist run slow by a factor y. Therefore, a time of L/yv elapses 
on your watch, in agreement with part (a). 

Logically, the two solutions in (a) and (b) differ in that one uses length 
contraction and the other uses time dilation. Mathematically, they differ simply 
in the order in which the divisions by y and v occur. 

(c) You see the rear clock on the stick showing a time of Lu/ c? more than the front 
clock. In addition to this head start, more time will of course elapse on the rear 
clock by the time it reaches you. The time in your frame is L/yv (because the 
stick has length L/y in your frame). But the stick’s clocks run slow, so a time 
of only L/y?v will elapse on the rear clock by the time it reaches you. The total 
additional time (compared with the front clock’s reading when it passed you) 
that the rear clock shows is therefore 


lv | L L v 1 L v 1 v? L (11.79) 
e yw vie’ yp) vie! eJ) v’ : 


in agreement with the quick calculation that follows in part (d). 
(d) The stick sees you fly by at speed v. The stick has length L in its own frame, 
so the time elapsed in the stick frame is L/v. 


4l Since we’ll be working in the train frame here, we’ll use the words “rear” and “front” in the way 
that someone on the train uses them as she watches the platform rush by. That is, if the train is 
heading east with respect to the platform, then from the point of view of the train, the platform 
is heading west, so the eastern bomb on the platform is the rear one, and the western bomb is 
the front one. So they have the opposite orientation compared with the way that someone on the 
platform labels the rear and front of the train. Using the same orientation would entail writing the 
phrase “front clock ahead” below, which would make me cringe. 


11.13 Solutions 


11.5. Rotated square 


Figure 11.49 shows a top view of the square at the instant (in your frame) when it is 
closest to you. Its length is contracted along the direction of motion, so it takes the 
shape of a rectangle with sides L and L/y. This is what the shape is in your frame 
(where is-ness is defined by where all the points of an object are at simultaneous 
times). But what does the square look like to you? That is, what is the nature of the 
photons hitting your eye at a given instant?“ 

Photons from the far side of the square have to travel an extra distance L to get to 
your eye, compared with photons from the near side. So they need an extra time L/c 
of flight. During this time L/c, the square moves a distance Lu/c = LB sideways. 
Therefore, referring to Fig. 11.50, a photon emitted at point A reaches your eye at 
the same time as a photon emitted from point B. This means that the trailing side of 
the square spans a distance Lf across your field of vision, while the near side spans 
a distance L/y = Ly 1 — $? across your field of vision. But this is exactly what a 
rotated square of side L looks like, as shown in Fig. 11.51, where the angle of rotation 
satisfies sin @ = £. For the case ofa circle instead of a square, see Hollenbach (1976). 


11.6. Train in a tunnel 


Yes, the bomb explodes. This is clear in the frame of the train (see Fig. 11.52). In 
this frame, the train has length L, and the tunnel speeds past it. The tunnel is length- 
contracted down to L/y. Therefore, the far end of the tunnel passes the front of the 
train before the near end passes the back, so the bomb explodes. 

We can, however, also look at things in the frame of the tunnel (see Fig. 11.53). Here 
the tunnel has length L, and the train is length-contracted down to L/y. Therefore, 
the deactivation device gets triggered before the front of the train passes the far end 
of the tunnel, so you might think that the bomb does not explode. We appear to have 
a paradox. 

The resolution to this paradox is that the deactivation device cannot instantaneously 
tell the bomb to deactivate itself. It takes a finite time for the signal to travel the 
length of the train from the sensor to the bomb. And it turns out that this transmission 
time makes it impossible for the deactivation signal to get to the bomb before the 
bomb gets to the far end of the tunnel, no matter how fast the train is moving. Let’s 
show this. 

The signal has the best chance of winning this “race” if it has speed c, so let’s 
assume this is the case. Now, the signal gets to the bomb before the bomb gets to 
the far end of the tunnel if and only if a light pulse emitted from the near end of the 
tunnel (at the instant the back of the train goes by) reaches the far end of the tunnel 
before the front of the train does. The former takes a time L/c. The latter takes a time 
L(1 — 1/y)/v, because the front of the train is already a distance L/y through the 
tunnel. So if the bomb is not to explode, we must have 


L/e < LQ —1/y)/v 
= p<1-vi--f 
v1- <1-£ 

< Jl+B <VJ1-8B. (11.80) 


This is never true. Therefore, the signal always arrives too late, and the bomb always 
explodes. 


42 In relativity problems, we virtually always subtract off the time it takes light to travel from the 
object to your eye (that is, we find out what really is). But along with the Doppler effect discussed 
in Section 11.8, this problem is one of the few exceptions where we actually want to determine 
what your eye registers. 
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11.7. Seeing behind the stick 


The first reasoning is correct. You will be able to see a mark on the ruler that is less 
than L units from the wall. You will actually be able to see a mark even closer to the 
wall than L/y, as we'll show below. 

The error in the second reasoning (in the stick’s frame) is that the second picture in 
Fig. 11.35 is not what you see. This second picture shows where things are at simul- 
taneous times in the stick’s frame, which are not simultaneous times in your frame. 
Alternatively, the error is the implicit assumption that signals travel instantaneously. 
But in fact the back of the stick cannot know that the front of the stick has been hit 
by the wall until a finite time has passed. During this time, the ruler (and the wall 
and you) travels farther to the left, allowing you to see more of the ruler. Let’s be 
quantitative about this and calculate (in both frames) the closest mark to the wall that 
you can see. 

Consider your reference frame. The stick has length L/y. Therefore, when the stick 
hits the wall, you can see a mark a distance L/y from the wall. You will, however, 
be able to see a mark even closer to the wall, because the back end of the stick will 
keep moving forward, since it doesn’t know yet that the front end has hit the wall. 
The stopping signal (shock wave, etc.) takes time to travel. 

Let’s assume that the stopping signal travels along the stick at speed c. (We could 
instead work with a general speed u, but the speed c is simpler, and it yields an upper 
bound on the closest mark you can see.) Where will the signal reach the back end? 
Starting from the time the stick hits the wall, the signal travels backward from the wall 
at speed c, while the back end of the stick travels forward at speed v (from a point L/y 
away from the wall). So the relative speed (as viewed by you) of the signal and the 
back end is c + v. Therefore, the signal hits the back end after a time (L/y)/(c + v). 
During this time, the signal has traveled a distance c(L/y)/(c + v) from the wall. 
The closest point to the wall that you can see on the ruler is therefore the mark with 


the value 
OREN e (11.81) 
yd +B) TB 


Now consider the stick’s reference frame. The wall is moving to the left toward the 
stick at speed v. After the wall hits the right end of the stick, the signal moves to 
the left with speed c, while the wall keeps moving to the left with speed v. Where 
is the wall when the signal reaches the left end? The wall travels v/c as fast as the 
signal, so it travels a distance Lu/c in the time that the signal travels the distance L. 
This means that the wall is L(1 — v/c) away from the left end of the stick. In the 
stick’s frame, this corresponds to a distance yL(1 — v/c) on the ruler, because the 
ruler is length contracted. So the left end of the stick is at the mark with the value 


= 
Ly(l— p) = L | — 11.82 
ya= B) TF3’ ( ) 


in agreement with Eq. (11.81). 


. Cookie cutter 


Let the diameter of the cookie cutter be L, and consider the two following reasonings. 


e In the lab frame, the dough is length contracted, so the diameter L corresponds 
to a distance larger than L (namely yL) in the dough’s frame. Therefore, when 
you buy a cookie, it is stretched out by a factor y in the direction of the belt.*? 


43 The shape is an ellipse, since that’s what a stretched-out circle is. The eccentricity of an ellipse is 


the focal distance divided by the semi-major axis length. As an exercise, you can show that this 


equals B = v/c here. 


11.13 Solutions 


e Inthe frame of the dough, the cookie cutter is length contracted down to L/y in 
the direction of motion. So in the frame of the dough, the cookies have a length 
of only L/y. Therefore, when you buy a cookie, it is squashed by a factor y in 
the direction of the belt. 


Which reasoning is correct? The first one is. The cookies are stretched out. The fallacy 
in the second reasoning is that the various parts of the cookie cutter do not strike the 
dough simultaneously in the dough frame. What the dough sees is this: Assuming 
that the cutter moves to the left, the right side of the cutter stamps the dough, then 
nearby parts of the cutter stamp it, and so on, until finally the left side of the cutter 
stamps the dough. But by this time the front (that is, the left) of the cutter has moved 
farther to the left. So the cookie turns out to be longer than L. It takes a little work to 
demonstrate (by working in the dough frame) that the length is actually yL, but let’s 
do that now. 

Consider the moment when the rightmost point of the cutter strikes the dough. In 
the dough frame, a clock at the rear (the right side) of the cutter reads Lu /c? more than 
a clock at the front (the left side). The front clock must therefore advance by Lv/c? 
by the time it strikes the dough. (This is true because all points on the cutter strike 
the dough simultaneously in the cutter frame. Hence, all cutter clocks read the same 
when they strike.) But due to time dilation, this takes a time y (Lv/c?) in the dough 
frame. During this time, the cutter travels a distance u(y Lu /c?). Since the front of 
the cutter was initially a distance L/y (due to length contraction) ahead of the back, 
the total length of the cookie in the dough frame is 


Ly yLv 1 x v wv 
ae a ele as we ei cs | Geary paar Boa 


as we wanted to show. Ifthe dough is then slowly decelerated, the shape of the cookies 
won't change. So this is the shape you see in the store. 


11.9. Getting shorter 


(a) In the frame in which the balls are initially at rest, the beginning picture (when 
the left person catches the left ball when his clock reads zero) is shown in 
Fig. 11.54. The balls are a distance yL apart, and the people’s separation is 
length contracted down to L/y. The front person’s clock is Lv/ c? behind the 
back person’s clock, so it reads —Lv/c’. 

The ending picture (when the right person catches the right ball when his 
clock reads zero) is shown in Fig. 11.55. By the time the right person catches 
the ball, the left person has moved to the right while holding the left ball. The 
left person’s clock is ahead, so it reads Lu/c?. 

(b) By looking at the distances in the figures, we see that the people travel a distance 
yL—L/y. 

Let’s now use the clock readings to get the distance. The total time for the 
process is y(Lv/c*) because each person’s clock advances by Lv/c?, but these 
clocks run slow in the frame we’re working in. Since the speed of the people is 
v, the distance they travel is v(yLv/c*). This had better be equal to yL — L/y. 
And it is, because 


L 1 vw yL? 
yL sii s)=rt(1 (1 =) =. (11.83) 
y y c c 
If we then shift to the frame in which everything is at rest, we see that the proper 
distance between the balls is L, as we wanted to show. 

(c) To sum up, the proper distance between the balls decreases because in the frame 
in which the balls are initially at rest, the left person catches the left ball first 
and then drags it closer to the right ball by the time the right person catches 
that one. So it all comes down to the loss of simultaneity. 
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11.10. A bunch of L.T.’s 


Using the results from the “Passing trains” examples in Sections 11.5.1 and 11.6, the 
relative speeds and the associated y factors for the six pairs of frames are 


AB AC AD BC BD CD 


v 5c/13 4c/5 c/5 3c/5 c/5 5c/7 
y 13/12 5/3 5/2/6 5/4 5/2/6 71/246 


From the example in Section 11.6, the separations between the two events in the four 
frames are 


A B C D 


Ax -L L 5L 0 
At 5L/e 5L/e 7L/e 2J6L/c 


The Lorentz transformations are 


Ax = y (Ax + v At’), 
(11.84) 
At = y (Af +v Ax /c?). 


For each of the six pairs, we’ll transform from the faster frame to the slower frame. 
This means that the coordinates of the faster frame will be on the right-hand side of 
the L.T.’s. The sign on the right-hand side of the L.T.’s will therefore always be a 
“+.” In the AB case, for example, we’ll write “Frames B and A,” in that order, to 
signify that the B coordinates are on the left-hand side and the A coordinates are on 
the right-hand side. We’ll simply list the L.T.’s for the six cases, and you can check 


that they do indeed all work out. 
5c 5L 
13/\ cjJ)’ 


5L 13 (= sce) 


13 
Frames B and A : L= ( L 


c c2 


Frames C and A : 5L (= (55) 
3 5 c 


mM 5(5L CD 
c 3\ c c2 f 


Frames D and A : 0 z ( L 816) 
i ~ 2/6 5 c > 
2/6L 5 (5L §(-L) 
Cc! l c c2 ) 


5 3c 5L 
Frames C and B : ILS AC } E )( )). (11.85) 
c 


11.13 Solutions 


Frames B and D : L zl ; (<) (““)). 
5L 5 (2/6L $0) 
c ~ 2/6 È i c2 i 
Frames C and D : sL= i 04 (=) 2/6L 
2/6 T c 


IL 7 {we FO 
c 2/6 ne 
11.11. Equal speeds 
FIRST SOLUTION: Let C move at speed v with respect to the ground, and let the relative 
speed of C and both A and B be u (as viewed by C). Then two different expressions 
for u are the relativistic subtraction of v from 4c/5, and the relativistic subtraction of 
3c/5 from v. Therefore (dropping the c’s), 
4 3 
4_y y—3 
a os (11.86) 
1—30 l- šv 


This gives 0 = 35v? —74v+35 = (5v—7)(Tv—5). Since the v = 7/5 root represents 
a speed larger than c, we must have 
v= Ža (11.87) 
7 


Plugging this back into Eq. (11.86) gives u = c/5. 


SECOND SOLUTION: With v and u defined as above, two different expressions for v 
are the relativistic subtraction of u from 4c/5, and the relativistic addition of u to 
3c/5. Therefore, 


4 3 
57u ztu 


=v= : 
1— fu 1+ du 


(11.88) 


This gives 0 = 5u? — 26u + 5 = (5u — 1)(u — 5). Since the u = 5 root represents a 
speed larger than c, we must have 
=. 11.89 
u 5 ( ) 
Plugging this back into Eq. (11.88) gives v = 5c/7. 


THIRD SOLUTION: The relative speed of A and B is 
4_ 3 5 


5 6 
4,3 43° 
1-3-3 13 
From C’s point of view, this 5/13 is the result of relativistically adding u with another 


u. Therefore, 


(11.90) 


5 2u 
13 1+2 
as in the second solution. 


5u? — 26u + 5 = 0, (11.91) 


11.12. More equal speeds 
Let u be the speed at which C sees A and B approaching her. So u is the desired speed 
of C with respect to B, that is, the ground. From C’s point of view, the given speed v 
is the result of relativistically adding u with another u. Therefore (dropping the c’s), 
2u 1-VJ1l—v? 


= = ‘ 11.92 
ie 1 +u? j v ( ) 
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11.13. 


11.14. 


The quadratic equation for u also has a solution with a plus sign in front of the square 
root, but this solution cannot be correct, because it is greater than 1 (and in fact goes 
to infinity as v goes to zero). The above solution for u has the proper limit as v goes 
to zero, namely u — v/2, which can be obtained by using the Taylor expansion for 
the square root. 

The ratio of the distances CB and AC in the lab frame is the same as the ratio of the 
differences in the velocities in the lab frame (because both 4 and C run into B at the 
same time, so you could imagine running the scenario backwards in time). Therefore, 


CB Vc-Vs Ee =p 
AC” V4—Vo 7 y— y7 
Vv 


1—41- vx? 
v1 — v2- (1—9?) 
1 
= —— =y. (11.93) 
VI=v2 
We see that C is y times as far from B as she is from A, as measured in the lab frame. 
Note that for nonrelativistic speeds, we have y © 1, so C is midway between A and B, 
as expected. An intuitive reason for the simple factor of y is the following. Imagine 
that A and B are carrying identical jousting sticks as they run toward C. Consider 
what the situation looks like when the tips of the sticks reach C. In the lab frame (in 
which B is at rest), B’s stick is uncontracted, but A’s stick is length contracted by a 
factor y. Therefore, in the lab frame, A is closer to C than B is, by a factor y. 


y= 


Equal transverse speeds 

From your point of view, the lab frame is moving with speed v in the negative x 
direction. The transverse velocity-addition formula, Eq. (11.38), therefore gives the 
y speed in your frame as uy/y (1 — uxv). Demanding that this equals uy gives 


2 
yA- uv) =1 Jier =O ui vest: (11.94) 
u: 


X 


or v = 0, of course. This v is simply the relativistic addition of uy with itself. This 
makes sense, because it means that both your frame and the original lab frame move 
with speed uy (but in opposite directions) relative to the frame in which the object has 
no speed in the x direction. By symmetry, therefore, the y speed of the object must 
be the same in your frame and in the lab frame. 


Relative speed 
Consider the frame S’ that travels along with the point P midway between the 
particles. S’ moves at speed v cos 8, so the y factor relating it to the lab frame is 
1 
y=. 
V1 — v? cos?0 


Let’s find the vertical speeds of the particles in $”. Since the particles have ui, = 0, 
the transverse velocity-addition formula, Eq. (11.38), gives vsind = w,/y. 


(11.95) 


Therefore, in S’ each particle moves along the vertical axis away from P with speed 
ul, = yvsinð. (11.96) 


The speed ofone particle as viewed by the other can now be found via the longitudinal 
velocity-addition formula, 


, 2v sind $ 
2u, Jinv? costo 2v sin OV 1 — v2 cos26 


= A 12 v2 sin?ð 1 — v? cos 20 
LEG LeTo 


(11.97) 
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11.13 Solutions 


If desired, this can be written as (for future reference in Chapter 13) 


Fadi dew 11.98 
E (1 — v? cos 20)? ` (11.98) 


Remark: If20 = 180°, then V =2v/(1 + v2), as it should. And if @ = 0°, then V = 
0, as it should. If @ is very small, then the result reduces to V ~% 2v sin 6/V1 —v?, 
which is simply the nonrelativistic addition of (essentially) the speed in Eq. (11.96) 
with itself, as it shouldbe. & 


Another relative speed 


Let the velocity of A point in the x direction, as shown in Fig. 11.56. Let S’ be the lab 
frame, and let S be 4’s frame (so frame S’ moves at velocity —u with respect to S). The 
x and y speeds of B in frame S’ are v cos @ and v sin 0. Therefore, the longitudinal and 
transverse velocity-addition formulas, Eqs. (11.31) and (11.38), give the components 
of B’s velocity in S as 


_ vcosd —u 
~~ 1—uveos@’ (11.99) 
r vsin ð ~ 1— 4 vsing i 
Y p(l—uvcosð)  l—uvcosð ` 
The total speed of B in frame S (that is, from A’s point of view) is therefore 
= [i+ = (e 2 aay 
G P. 1 — uv cos 0 
Vu? + v2 — 2uv cos 8 — u2v? sin20 
= (11.100) 


1 — uv cos 0 


If desired, this can be written as 


— 42 ie 
r=! eee (11.101) 


(1 — uv cos 6)? 


The reason why this can be written in such an organized form will become clear in 
Chapter 13. 


Remark: Ifu = v, this reduces to the result of the previous problem (if we replace 
0 by 20). If 6 = 180°, then V = (u + v)/(1 + uv), as it should. And if 0 = 0°, then 
V = |w — u|/(1 — uv), as it should. & 


Transverse velocity addition 
Assume that a clock on the particle shows a time T between successive passes of the 
dotted lines. In frame S’, the speed of the particle is u’, so the time-dilation factor is 
y’ = 1/1 — u’?. The time between successive passes of the dotted lines is therefore 
T S = y'T š 

In frame S, the speed of the particle is v/v? + u?. (Yes, the Pythagorean theo- 
rem holds for these speeds, because both speeds are measured with respect to the 
same frame.) Hence, the time-dilation factor is y = 1/v1 — v? — u?. The time 
between successive passes of the dotted lines is therefore T; = yT. Equating our 
two expressions for Ts/Ts gives 


u! Ts ~ 1 — u? 


u Ty yi>- 


(11.102) 


u 
S'e 


get s 


Fig. 11.56 
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11.17. 


Solving for u gives the desired result, 


f 
=u] -P =. (11.103) 


Yv 


REMARK: A slightly quicker method is the following. Imagine a clock at rest in S’, 
with the same x’ value as the particle. Let this clock tick simultaneously (as seen 
in S’) with the particle’s crossings of the dotted lines. Then the clock also ticks 
simultaneously with the particle’s crossings of the dotted lines in S, because the 
clock and the particle have the same x’ values. But the clock ticks slower in S by a 
factor v1 — v?. Therefore, the y speed is smaller in S by this factor, as we wanted 
to show. & 


Many velocity additions 
Let’s first check the formula for N = 1 and N = 2. When N = 1, the formula gives 


PE Py _ +6) - 0-61) _ 
PI +P U+6i)+U- fi) 

as it should. And when N = 2, the formula gives 

Py =P) — d+ ANd + 62)-d- 6)d — 2) _ Bith 


P +P (1+ 61)U + 62) + 1 — Bi) — Bo) 1 + Bi fa’ 
(11.105) 


Bi, (11.104) 


Bay = 


in agreement with the velocity-addition formula. 

Let’s now prove the formula for general N. We will use induction. That is, we will 
assume that the result holds for N and then show that it holds for N + 1. To find the 
speed, Biv +1), of the object with respect to Sy+1, we can relativistically add the speed 
of the object with respect to Sy (which is (y) ) with the speed of Sy with respect to 
Sw+1 (which is By+1). This gives 


ea oa (11.106) 
Under the assumption that our formula holds for N, this becomes 
P-P 
j Pns + pips pnya (Pt + Py) + (Pt — Py) 
MDT Te Ba te CHP Bre Py) 
Py (i+ n+) — Py — Bv+1) 
PAO + Bni) + PCO — B41) 
ga, (11.107) 
Phi HPN 


as we wanted to show. We have therefore shown that if the result holds for N, then it 
also holds for N + 1. Since we know that the result does indeed hold for N = 1, it 
therefore holds for all N. 

The expression for £y) has some expected properties. It is symmetric in the fj. 
And if the given object is a photon with 6; = 1, then Py = 0, which gives By) = 1 
as it should. And if the given object is a photon with 6; = —1, then Px = 0, which 
gives Bij) = —1 as it should. 


REMARK: Wecanuse the result of this problem to derive the v(t) given in Eq. (11.64). 
First, note that if all the 6; here are equal, and if their common value is sufficiently 
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11.13 Solutions 


small, then 

(1+ py¥ —(—By% _ ePN —e BN 
FANHA- ~ eBN + e-BN 
Let £ equal adt/c, which is the relative speed of two frames at nearby times in the 
spaceship scenario leading up to Eq. (11.64). If we let N = t/dt be the number of 
frames (and if we take the limit dt — 0), then we have reproduced the spaceship 
scenario. Therefore, the By) in Eq. (11.108) should equal the u(t) in Eq. (11.64). 
And indeed, with £ = adt/c and N = t/dt, Eq. (11.108) gives £w) = tanh(at/c), 
as desired. & 


Boy = = tanh(AN). (11.108) 


Velocity addition from scratch 


As stated in the problem, we will use the fact that the meeting of the photon and 
the ball occurs at the same fraction of the way along the train, independent of the 
frame. This is true because although distances may change depending on the frame, 
fractions remain the same, because length contraction doesn’t depend on position. 
We’ll compute the desired fraction in the train frame S’, and then in the ground 
frame S. 


TRAIN FRAME: Let the train have length Z’ in the train frame. Let’s first find the time 
at which the photon meets the ball (see Fig. 11.57). From the figure, we see that the 
sum of the distances traveled by the ball and the photon, which is vı? + ct’, must 
equal twice the length of the train, which is 2L’. The time of the meeting is therefore 
J 2L! 
t= ; (11.109) 
c+v 
The distance the ball has traveled is then vı? = 2v,L'/(c + vı), and the desired 
fraction F” is 


j 2v1 
F = : (11.110) 
c+, 

GROUND FRAME: Let the speed of the ball with respect to the ground be v, and let the 
train have length L in the ground frame (L equals L’/y, but we’re not going to use 
this). Again, let’s first find the time at which the photon meets the ball (see Fig. 11.58). 
Light takes a time L/(c — v2) to reach the mirror, because the mirror is receding at 
speed v2. At this time, the light has traveled a distance cL /(c — v2). From the figure, 
we see that we can use the same reasoning as in the train-frame case, but now with the 
sum of the distances traveled by the ball and the photon, which is ut + ct, equaling 
2cL/(c — v2). The time of the meeting is therefore 


p= 2cL 
— (e= w)(e+v) 
The relative speed of the ball and the back of the train (as viewed in the ground frame) 


is v — v2, so the distance between the ball and the back of the train at this time is 
2(v — v2)cL/[(c — v2)(c + v)]. The desired fraction F is therefore 


_ 2(v — v2)e 
~ (c=w)(e+v)~ 


We can now equate the above expressions for F’ and F. For convenience, define 
B = v/c, Bi = v/c, and By = v2/c. Then F’ = F yields 


Bi b= po 


(11.111) 


(11.112) 


= ; (11.113) 
1+8 (1—f2)0+8) 
Solving for £ in terms of 6; and f2 gives 
Bi + Bo 
= 11.114 
Po T+ BPs oe 


as desired. This problem is solved in Mermin (1983). 
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11.19. Modified twin paradox 


(a) In 4’s reference frame, the worldlines of A, B, and C are shown in Fig. 11.59. 


B’s clock runs slow by a factor 1/y. Therefore, if A’s clock reads t when B 
meets C, then B’s clock reads t/y when he meets C. So the time he gives to C 
is t/y. 

In A’s reference frame, the time between this event and the event where C 
meets A is again t, because B and C travel at the same speed. But A sees C’s 
clock run slow by a factor 1/y, so A sees C’s clock increase by t/y. Therefore, 
when A and C meet, A’s clock reads 2t, and C’s clock reads 2¢/y. In other 
words, Te = T,/y. 


(b) Let’s now look at things in B’s frame. The worldlines of A, B, and C are shown 


in Fig. 11.60. From B’s point of view, there are two competing effects that lead 
to the relation Te = T,/y. The first is that B sees A’s clock run slow, so the 
time he hands off to C is /arger than the time A’s clock reads at this moment. 
The second effect is that from this point on, B sees C’s clock run slower than 
A’s (because the relative speed of C and B is greater than the relative speed of 
A and B). It turns out that this slowness wins out over the head start that C’s 
clock had over A’s. So in the end, C’s clock reads a smaller time than A’s. Let’s 
be quantitative about this. 

Let B’s clock read tg when he meets C. Then when B hands off this time to 
C, A’s clock reads only t/y. We’ll find all relevant times below in terms of 
tz. We must determine how much additional time elapses on A’s clock and C’s 
clock, by the time they meet. From the velocity-addition formula, B sees C 
move to the left at speed 2v/(1 + v). He also sees A move to the left at speed 
v. But A had a head start of vtz in front of C, so if ¢ is the time (as viewed from 
B) between the meeting of B and C and the meeting of A and C, then 


2ut l+v 
E Sat tita t (1) (11.115) 


During this time, B sees A’s and C’s clocks increase by ¢ divided by the relevant 
time-dilation factor. For A, this factor is y = 1/y 1 — v? /c?. For C, it is 


1 1+% 
2 1—2" 
2v 
= (2) 


Therefore, the total time shown on A’s clock when A and C meet is 


t 140? 
T, ae ty 1 — v? = ty 1 — v2 4 s( Er) 1—2 
¥ 


l-—v 


(11.116) 


2t 
eee (11.117) 


V1 — vw" 


And the total time shown on C’s clock when A and C meet is 


l-v? ; l+? l-v? 
To =tg+t (ae = tg + tg T Ipa =2tg. (11.118) 


Therefore, Te = Tav 1 — v? = T,/y. 


(c) Let’s now work in C’s frame. The worldlines of A, B, and C are shown in 


Fig. 11.61. As in part (b), the relative speed of B and C is 2u/(1 + v?), and the 
time-dilation factor between B and C is (1 + v?) /A- v). Also, as in part (b), 
let B and C meet when B’s clock reads tg. So this is the time that B hands off 
to C. We’ll find all relevant times below in terms of tz. 


11.13 Solutions 


C sees B’s clock running slow, so from C’s point of view, B travels for a time 
t(l + v2)/(1 — v?) after his meeting with A. In this time, B covers a distance 
in C’s frame equal to 


1+? 2v 2vutg 
d=t = ` 11.119 
1E 5) 1 +v? 1—v? ( ) 


A must travel this same distance (from where he met B) to meet up with C. We 
can now find 7,. The time (as viewed by C) that it takes A to travel the distance 
d to reach C is d/v = 2tg/(1 — v?). But since C sees A’s clock running slow 
by a factor v 1 — v?, A’s clock will read only 

pa-s (11.120) 

s VI= u2 : 

Now let’s find Te. To find Te, we must take tẹ and add to it the extra time it 
takes A to reach C, compared with the time it takes B to reach C. From above, 
this extra time is 2t,/(1 — v2) —t,(1 + v”)/(1 — v?) = tp. Therefore, C’s clock 
reads 


To = 2p. (11.121) 


Hence, Te = TV1 — v? = T,/y. 
11.20. Throwing on a train 


(a) In the train frame, the distance is simply d = L. And the time is 
t = L/(c/3) = 3L/c. 


(b) i. The velocity of the ball with respect to the ground is (with u = c/3 and 
v =c/2) 


= z, (11.122) 


The length of the train in the ground frame is L/y1/2 = V/3L/2. Therefore, at 
time ¢ the position of the front of the train is V3L/2 + vt. And the position of 
the ball is Vgt. These two positions are equal when 


V3L ME aD 
z z 

2 Fig Vac 
Equivalently, this time is obtained by noting that the ball closes the initial 


head start of V3 L /2 that the front of the train had, at a relative speed of Vg — v. 
The distance the ball travels is d = Vgt = (Sc/7) (7L/V3c) = 5L/V3. 

ii. In the train frame, the space and time intervals are x’ = Land t’ = 3L/c, from 
part (a). The y factor between the frames is y1/2 = 2/ /3, so the Lorentz 
transformations give the coordinates in the ground frame as 


1) 2 (« c ( 3L ) ) 5L 
v + = ; 
By Okey) 
2 (3L 5%) 7L 
t= {+x /c?) = ( p J= A 
y( ne) B\e zZ Je 
in agreement with the above results. 

(c) In the ball frame, the train has length, L/1/3 = V8L/ 3. Therefore, the time it 
takes the train to fly past the ball at speed c/3 ist = (/8L/3)/(c/3) = 2V2L/c. 
And the distance is d = 0, of course, because the ball doesn’t move in the ball 
frame. 


(11.123) 


x=yx 
(11.124) 
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(d) The values of ct? — x? in the three frames are: 
Train frame: CP —x? = e BLY —L? = BL’. 
Ground frame: £ — x? = eL) = GL/ V3} = 8L. 
Ball frame: CPx =e (2/2L/c)? — (0)? = 8L?. 


These are all equal, as they should be. 
(e) The relative speed of the ball frame and the ground frame is 5c/7. Therefore, 
¥5/7 = 7 /24/6, and the times are indeed related by 


7L 7 [22L 
3e 2/6 


(£) The relative speed of the ball frame and the train frame is c/3. Therefore, 
v1/3 =3 /2V2, and the times are indeed related by 


3L 3 [NOL 

cC 2/2 

(g) The relative speed of the train frame and the ground frame is c/2. Therefore, 
viy2 = 2/ 3, and the times are not related by a simple time-dilation factor, 


because 
7L 2 (3L 
t ho <4 — ` 11.127 
gryt WE WA ( F ) ( ) 


We don’t obtain an equality because time dilation is legal to use only if the 
two events happen at the same place in one of the frames. Mathematically, the 
Lorentz transformation At = y (Ať + (v/c*) Ax! ) leads to At = y A?’ only 
if Ax’ = 0. In this problem, the “ball leaving the back” and “ball hitting the 
front” events happen at the same place in the ball frame, but in neither the 
train frame nor the ground frame. Equivalently, neither the train frame nor the 
ground frame is any more special than the other, as far as these two events go. 
So if someone insisted on trying to use time dilation, he would have a hard time 
deciding which side of the equation the y should go on. 


b=yh < ) which is true. (11.125) 


Cc 


t=yħ < ) , which is true. (11.126) 


c 


11.21. A new frame 


FIRST SOLUTION: Consider the Minkowski diagram in Fig. 11.62. In frame S, Event 
1 is at the origin, and Event 2 is at the point (2, 1). Consider now the frame S’ whose 
x’ axis passes through the point (2, 1). Since all points on the x’ axis are simultaneous 
in the frame S’ (they all have ? = 0), we see that S’ is the desired frame. From 
Eq. (11.47), the slope of the x’ axis equals B = v/c. Since the slope is 1/2, we have 
v = c/2. Note that by looking at our Minkowski diagram, it is clear that if the relative 
speed of S and S” is greater than c/2, then Event 2 occurs before Event | in S’. And 
if it is less than c/2, then Event 2 occurs after Event 1 in S’. 


Fig. 11.62 
SECOND SOLUTION: Let the original frame be S, and let the desired frame be S’. Let 
S’ move at speed v (in the positive direction) with respect to S. Our goal is to find v. 
The Lorentz transformations from S to S’ are 


Ax’ =y(Ax—vAt), At =y(At—vAx/c’). (11.128) 


<n iia We want to make Az’ equal to zero, so the second of these equations yields At — 
v Ax/c? = 0, or v = C7 At/Ax. We are given Ax = 2 and At = 1/c, so the desired 
x=0 x=2 v is c/2. 


i THIRD SOLUTION: Consider the setup in Fig. 11.63, which explicitly constructs the 
Fig. 11.63 two given events. Receivers are located at x = 0 and x = 2, and a light source is 


11.22. 


11.23. 


11.13 Solutions 


located at x = 1/2. The source emits a flash, and when the light hits a receiver we 
will say an event has occurred. So the left event happens at x = 0, ct = 1/2. And 
the right event happens at x = 2, ct = 3/2. If we want, we can shift our clocks by 
—1/(2c) in order to make the events happen at ct = 0 and ct = 1, but this shift is 
irrelevant because all we are concerned with is differences in time. 

Now consider an observer moving to the right at speed v. She sees the apparatus 
moving to the left at speed v (see Fig. 11.64). Our goal is to find the v for which the 
photons hit the receivers at the same time in her frame. Consider the photons moving 
to the left. She sees them moving at speed c, but the left-hand receiver is receding at 
speed v. So the relative speed (as measured by her) of the photons and the left-hand 
receiver is c — v. By similar reasoning, the relative speed of the photons and the 
right-hand receiver is c+ v. The light source is three times as far from the right-hand 
receiver as it is from the left-hand receiver. Therefore, if the light is to reach the two 
receivers at the same time, we must have c + v = 3(c — v). This gives v = c/2. 


Minkowski diagram units 


All points on the cf’ axis have the property that x’ = 0. All points on the hyperbola 
have the property that c?t’? — x? = 1, due to the invariance of s?. So the ct’ value at 
the intersection point A equals 1. Therefore, we simply have to determine the distance 
on the paper from A to the origin (see Fig. 11.65). We'll do this by finding the (x, ct) 
coordinates of A. We know that tan@ = £. But tan 0 = x/ct. Therefore, x = B(ct) 
(which is just the statement that x = vt). Plugging this into the given information, 


eP — x? = 1, we find ct = 1/,/1 — f?. The distance from A to the origin is then 


VER +x = ct/1 +p = +5. (11.129) 


This quantity is therefore the ratio of the unit sizes on the ct’ and ct axes, in agreement 
with Eq. (11.46). Exactly the same analysis holds for the x-axis unit size ratio. 


Velocity addition via Minkowski 

Pick a point P on the object’s worldline. Let the coordinates of P in frame S be (x, ct). 
Our goal is to find the speed u = x/t. Throughout this problem, it will be easier to 
work with the quantities 6 = v/c, so our goal is then to find B, = x/(ct). 

The coordinates of P in S’, namely (x’, ct’), are indicated by the parallelogram in 
Fig. 11.66. For convenience, let ct’ have length a on the paper. Then from the given 
information, we have x’ = vt’ = ßı(ct') = Bia. This is the distance from A to P 
on the paper. In terms of a, we can now determine the coordinates (x, ct) of P. The 
coordinates of point A are 


(x, ct)4 = (asin 0,a cos). (11.130) 
The coordinates of P, relative to A, are 
(x, ct)p_4 = (Bia cos 0, B,asin@). (11.131) 
Adding these two sets of coordinates gives the coordinates of point P as 
(x, ct)p = (asin@ + B,acos@,acos@ + Basing). (11.132) 
The ratio of x to ct at the point P is therefore 


x  sind+f,cos@  tand+f, — Bo+ fi 


bu a oso t pismo 1+ fytand 1+ ABs’ 


(11.133) 


where we have used tan 0 = v2/c = B, because S’ moves at speed v2 with respect 
to S. Ifwe change from the £’s back to the v’s, the result isu = (v2+v1)/(1+u1 v/e), 
as expected. 
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11.24. Clapping both ways 


ct 


(a) From the usual time-dilation result, A sees B’s clock run slow, so B must clap 
his hands at time intervals of At/y in order for the intervals to be At in A’s 
frame. The relevant Minkowski diagram is shown in Fig. 11.67. Let B clap at 
the spacetime locations where the horizontal lines (lines of simultaneity in A’s 
frame) intersect B’s worldline. From the given information, the vertical spacing 
between the lines is c At. Therefore, the spacing along B’s tilted worldline 
(which has a slope of +1/£) is y1 + B2 c At. But we know from Eq. (11.46) 
that the unit size of B’s ct axis on the paper is y (1 + 62)/(1 — 8?) times the 


unit size of A’s ct axis. Therefore, the time interval between claps in B’s frame is 


CERBE a See (11.134) 
vV +823) — B?) 4 


as above. 

(b) From the usual time-dilation result, B sees A’s clock run slow, so A must 
clap his hands at time intervals of At/y in order for the intervals to be Aż in 
B’s frame. However, B can invoke this standard time-dilation reasoning only 
during the parts of the trip where he is in an inertial frame. That is, he cannot 
invoke it during the turnaround period. We therefore have the situation shown 
in Fig. 11.68. Let A clap at the spacetime locations where the tilted lines (lines of 
simultaneity in B’s frame) intersect A’s worldline. From the given information, 
the tilted spacing between the lines along B’s worldline is c At time units, which 
has a length /(1 + B)/(1 — 8°) c At on the paper. But then by the same type 
of geometry reasoning that led to Eq. (11.49), the vertical spacing between the 
lines along A’s worldline is (as you should verify) y 1 — £? c At, which gives 
a time interval of At/y, as above. 

But the critical point here is that since the slope of the tilted lines abruptly 

Fig. 11.68 changes when B turns around, there is a large interval of time in the middle 

of A’s worldline where the tilted lines don’t hit it. The result is that A claps 

frequently for a while, then doesn’t clap at all for a while, then claps frequently 
again. The overall result, as we will now show, is that more time elapses in 

A’s frame (it will turn out to be 2L/v, of course, where L is the distance in 

A’s frame to the star) than in B’s frame (2L/yv, from the usual time-dilation 

result). 

Since the time between claps is shorter in A’s frame than in B’s frame (except 

in the middle region), the time elapsed on A’s clock while he is clapping is 1/y 

times the total time elapsed on B’s clock, which gives (2L/yv)/y = 2L/y7v. 

But as shown in Fig. 11.68, the length of the region on A’s ct axis where there 

is no clapping is 28L, which corresponds to a time of 2vL/c?. The total time 

elapsed on A’s clock during the entire process is therefore 


2L 2L 2L 1 wv 2E i v? vw 2L 
yu e vily' e) v e) e) v’ 


(11.135) 


as expected. 


11.25. Acceleration and redshift 


There are various ways to solve this problem, for example, by sending photons 
between the people, or by invoking the Equivalence Principle in General Relativity. 
We’ll do it here by using a Minkowski diagram, to demonstrate that this redshift result 
can be derived perfectly fine using only basic Special Relativity. 

Draw the world lines of the two people, A and B, as seen by an observer, C, in the 
Fig. 11.69 frame where they were both initially at rest. We have the situation shown in Fig. 11.69. 
Consider an infinitesimal time Aż, as measured by C. At this time (in C’s frame), 
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A and B are both moving at speed a At. The axes of the A frame are shown in Fig. 11.70. 
Both A and B have moved a distance a(At)?/2, which can be neglected because At 
is small (it will turn out that the leading-order terms in the result are of order At, 
so any (At)? terms can be ignored.) Also, the special-relativistic time-dilation factor 
between any of the A, B, C frames can be neglected, because the relative speeds are 
at most v = aAt, so the time-dilation factors differ from 1 by order ( At). Let A 
make a little explosion (call this event £1) at time At in C’s frame. Then Aż is also 
the time of the explosion as measured by A, up to an error of order (Ad)?. 

Let’s figure out where A’s x axis (that is, the “now” axis in 4’s frame) meets B’s 
worldline. The slope of A’s x axis in the figure is v/c = a At/c. So the axis starts at 
a height c At, and then climbs up by the amount ad At/c, over the distance d (the 
distance is indeed d, up to corrections of order (At)*). Therefore, the axis meets B’s 
worldline at a height c At +ad At/c as viewed by C, that is, at a time At+ ad At/c?, 
as viewed by C. But C’s time is the same as B’s time (up to order (Af)*), so B’s clock Fig. 11.70 
reads At(1 + ad/c?). Let’s say that B makes a little explosion (event £2) at this time. 

Events £; and Ez both occur at the same time in A’s frame, because they both lie 
along a line of constant time in A’s frame. This means that in A’s frame, B’s clock 
reads At(1 + ad/c*) when A’s clock reads At. Therefore, in A’s (changing) frame, 

B’s clock is sped up by a factor, 


—=Hl+ >. (11.136) 


We can perform the same procedure to see how A’s clock behaves in B’s frame. 
Drawing B’s x axis at time At, we quickly find that in B’s (changing) frame, A’s clock 
is slowed down by a factor, 

Atı ad 


— =] - —. 11.137 
Ats c? ( ) 


We’ll see much more of these results in Chapter 14. 


REMARKS: 

1. In the usual Special Relativity situation where two observers fly past each other 
with relative speed v, they both see the other person’s time slowed down by 
the same factor. This had better be the case, because the situation is symmetric 
between the observers. But in this problem, A sees B’s clock sped up, and B sees 
A’s clock slowed down. This difference is possible because the situation is not 
symmetric between A and B. The acceleration vector determines a direction in 
space, and one person (namely B) is farther along this direction than the other 
person. 

2. Another derivation of this ad/c? result is the following. Consider the setup a 
short time after the start. An outside observer sees A’s and B’s clocks showing 
the same time. Therefore, by the usual vd /c? rear-clock-ahead result in Special 
Relativity, B’s clock must read vd/c? more than A’s, in the moving frame. The 
increase per unit time, as viewed by A, is therefore (vd/ c?)/t = ad /¢?. Note that 
any special-relativistic time-dilation or length-contraction effects are of second 
order in v/c, and hence negligible since v is small here. At any later time, we 
can repeat (roughly) this derivation in the instantaneous rest frame of A. & 


11.26. Break or not break? 
There are two possible reasonings, so we seem to have a paradox: 


e To an observer in the original rest frame, the spaceships stay the same distance d 
apart. Therefore, in the frame of the spaceships, the distance between them, d’, 
must equal yd. This is true because d’ is the distance that gets length-contracted 
down to d. After a long enough time, y will differ appreciably from 1, so the 
string will be stretched by a large factor. Therefore, it will break. 
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e Let A be the rear spaceship, and let B be the front spaceship. From A’s point of 
view, it looks like B is doing exactly what he is doing (and vice versa). A says 
that B has the same acceleration that he has. So B should stay the same distance 
ahead of him. Therefore, the string shouldn’t break. 


The first reasoning is correct (or mostly correct; see the first remark below). The 
string will break. So that’s the answer to our problem. But as with any good relativ- 
ity paradox, we shouldn’t feel at ease until we’ve explained what’s wrong with the 
incorrect reasoning. 

The problem with the second reasoning is that A does not see B doing exactly what 
he is doing. Rather, we know from Problem 11.25 that A sees B’s clock running fast 
(and B sees A’s clock running slow). A therefore sees B’s engine running faster, and 
so B pulls away from A. Therefore, the string eventually breaks.“ 

Things become more clear if we draw a Minkowski diagram. Figure 11.71 shows 
the x’ and ct’ axes of A’s frame. The x’ axis is tilted up, so it meets B’s worldline 
farther to the right than you might think. The distance PQ along the x’ axis is the 
distance that A measures the string to be. Although it isn’t obvious that this distance 
in A’s frame is larger than d (because the unit size on the x’ axis is larger than that 
in the original frame), we can demonstrate this as follows. In A’s frame, the distance 
PQ is greater than the distance PQ’. But PQ’ is simply the length of something in A’s 
frame that has length d in the original frame. So PQ’ is yd in A’s frame. And since 
PQ > yd > d in A’s frame, the string breaks. 


REMARKS: 

1. There is one slight (inconsequential) flaw in the first reasoning above. 
There isn’t one “frame of the spaceships.” Their frames differ, because 
they measure a relative speed between themselves. Therefore, it isn’t clear 
exactly what is meant by the “length” of the string, because it isn’t clear 
what frame the measurement should take place in. This ambiguity, how- 
ever, does not change the fact that A and B observe their separation to be 
(roughly) yd. 

If we want there to eventually be a well-defined “frame of the spaceships,” 
we can modify the problem by stating that after a while the spaceships stop 
accelerating simultaneously, as measured by someone in the original inertial 
frame. Equivalently, A and B turn off their engines after equal proper times. 
What A sees is the following. B pulls away from A. B then turns off his engine. 
The gap continues to widen. But A continues to fire his engine until be reaches 
B’s speed. They then sail onward, in a common frame, keeping a constant 
separation (which is greater than the original separation, by a factor y). 

2. The main issue in this problem is that it depends on exactly how we choose to 
accelerate an extended object. If we accelerate a stick by pushing on the back 
end (or by pulling on the front end), its length will remain essentially the same 
in its own frame, and it will become shorter in the original frame. But if we 
arrange for each end (or perhaps a number of points on the stick) to speed up in 
such a way that they always move at the same speed with respect to the original 
frame, then the stick will be torn apart. 

3. This problem gives the key to the classic problem of the relativistic wagon 
wheel, which can be stated as follows (you may want to cover up the following 
paragraph, so you can solve the problem on your own). A wheel is spun faster 


44 This also follows from the Equivalence Principle and the general-relativistic time-dilation effect 
which we’ll discuss in Chapter 14. Since A and B are accelerating, they may be considered (by 
the Equivalence Principle) to be in a gravitational field, with B “higher” in the field. But high 
clocks run fast in a gravitational field. Therefore, A sees B’s clock running fast (and B sees A’s 
clock running slow). 
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and faster, until the points on the rim move at a relativistic speed. In the lab 
frame, the circumference is length contracted, but the spokes aren’t (because 
they always lie perpendicular to the direction of motion). So ifthe rim has length 
2zr in the lab frame, then it has length 27 yr in the wheel frame. Therefore, in 
the wheel frame, the ratio of the circumference to the diameter is larger than 7. 
So the question is: Is this really true? And if so, how does the circumference 
become longer in the wheel frame? 

I’m putting this sentence here just in case you happened to see the first sentence 
of this paragraph as you were trying to cover it up and solve the problem on 
your own; it should probably even be a little longer, maybe like ... this. The 
answer is that it is indeed true. If we imagine little rocket engines placed around 
the rim, and if we have them all accelerate with the same proper acceleration, 
then from the above results, the separation between the engines will gradually 
increase, thereby increasing the length of the circumference. Assuming that 
the material in the rim can’t stretch indefinitely, the rim will eventually break 
between each engine. In the rotating (and hence accelerating) wheel frame, 
the ratio of the circumference to the diameter is indeed larger than x. In other 
words, space is curved in the wheel frame. This is consistent with the facts that 
(1) the Equivalence Principle (discussed in Chapter 14) states that acceleration 
is equivalent to gravity, and (2) gravitational fields in General Relativity are 
associated with curved space. & 


11.27. Successive Lorentz transformations 


It isn’t necessary, of course, to use matrices in this problem, but things look nicer if 
we do. The desired composite L.T. is obtained by multiplying the matrices for the 
individual L.T.’s. So we have 


cosh? sinh d2 cosh; sinh ¢ġı 
sinh¢@2 cosh 2 sinh¢@,; cosh ¢ı 


_ (cosh | cosh ¢2 + sinh ġı sinh ġ2 sinh ġı cosh ¢2 + cosh ġı sinh ¢2 
— \ cosh ġ; sinh ¢2 + sinh ġı coshø2 sinh dj sinh $2 + cosh ¢ cosh ¢2 


_ (cosh(ġı + ¢2) sinh(ġı + ¢2) 
~ \sinh(¢; + ¢2) cosh(ġi + 2) J` 


This is an L.T. with v = tanh(¢) + ¢2), as desired. Except for a few minus signs, 
this proof is just like the one for successive rotations in a plane. 


(11.138) 


11.28. Accelerator’s time 
Equation (11.64) gives the speed as a function of the spaceship’s time (which we are 
denoting by ¢’ here) as 


n vt’) ; 
(t) = — = tanh (ať /c). (11.139) 
c 


The person in the lab frame sees the spaceship’s clock slowed down by a factor 


1/y = y1 — B2, which means that dt = dt’ /4y/ 1 — B2. So we have 


cosh(at'/c) dt’ = £ sinh(at'/c). (11.140) 
a 


t P dt ř 
t= dt = = 
Í Í J1— pY Í 


For small a or t' (more precisely, for at’/c < 1), we obtain t ~ ¢’, as we should. For 
very large times, we essentially have 


Cc Pie É 
ta Fi [e or t = —In(2at/c). (11.141) 
a a 


The lab frame will see the astronaut read all of Moby Dick, but it will take an 
exponentially long time (not that it doesn’t already). 
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In the previous chapter, we dealt only with abstract particles flying through 
space and time. We didn’t concern ourselves with the nature of the particles, 
how they got to be moving, or what would happen if various particles interacted. 
In this chapter we will deal with these issues. That is, we will discuss masses, 
forces, energy, momentum, etc. The two main results of this chapter are that the 
momentum and energy of a particle are given by 


p=ymv, and E=ymce’, (12.1) 


where y = 1/,/1—v2/c?, and m is the mass of the particle! When v <c, the 
expression for p reduces to p= mv, as it should for a nonrelativistic particle. 


When v = 0, the expression for E reduces to the well-known E = me. 


12.1 Energy and momentum 


In this section, we’ll give some justification for Eqs. (12.1). The reasoning 
here should convince you of their truth. An alternative, and perhaps more con- 
vincing, motivation comes from the 4-vector formalism in Chapter 13. In the 
end, however, the justification for Eqs. (12.1) is obtained through experiments. 
And indeed, experiments in high-energy accelerators are continually verifying 
the truth of these expressions. (More precisely, they are verifying that these 
energy and momenta are conserved in any type of collision.) We therefore con- 
clude, with reasonable certainty, that Eqs. (12.1) give the correct expressions 
for energy and momentum. But actual experiments aside, let’s consider a few 
thought-experiments that motivate the above expressions. 


12.1.1 Momentum 


Consider the following system. In the lab frame, identical particles A and B move 
as shown in Fig. 12.1. They move with equal and opposite small speeds in the 


' There are various ways that people use the word “mass” in relativity. In particular, some 
people talk about “rest mass” and “relativistic mass.” But we won’t use these terms. We’ll use 
“mass” to refer only to what other people would call “rest mass.” See the discussion on page 590 
for more on this. 
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x direction, and with equal and opposite large speeds in the y direction. Their 
paths are arranged so that they glance off each other and reverse their motion in 
the x direction. Imagine a series of equally spaced vertical lines for reference. 
Assume that A and B have identical clocks that tick every time they cross one of 
these lines. 

Consider now the reference frame that moves in the y direction with the same 
vy as A. In this frame, the situation is shown in Fig. 12.2. The collision simply 
changes the sign of the x velocities of the particles. Therefore, the x momenta 
of the two particles must be the same. This is true because if, say, A’s py were 
larger than B’s py, then the total py would point to the right before the collision, 
and to the left after the collision. Since momentum is something we want to be 
conserved, this cannot be the case. 

However, the x speeds of the two particles are not the same in this frame. 
A is essentially at rest in this frame, and B is moving with a very large speed, 
v. Therefore, B’s clock is running slower than A’s, by a factor essentially equal 
to 1/y = y1 — v?/c?. And since B’s clock ticks once for every vertical line it 
crosses (this fact is independent of the frame), B must therefore be moving slower 
in the x direction, by a factor of 1/y. Therefore, the Newtonian expression, 
Px = muy, cannot be the correct one for momentum, because B’s momentum 
would be smaller than A’s (by a factor of 1/y), due to their different v,’s. But 
the y factor in 


mx 
Px = ymv; = ——— 
” i Jl -— v?/c? 


precisely takes care of this problem, because y ~ 1 for A, andy = 1/y1 — v?/c? 
for B, which cancels the effect of B’s smaller vy. 

To obtain the three-dimensional form of p, we can use the fact that the vector 
p must point in the same direction as the vector v, because any other direction 
for p would violate rotation invariance. If someone claims that p points in the 
direction shown in Fig. 12.3, then he would be hard-pressed to explain why it 
doesn’t instead point along the direction p’ shown. In short, the direction of v 
is the only preferred direction in space. Therefore, Eq. (12.2) implies that the 
momentum vector must be 


(12.2) 


mV 
y1- v/e ? 
in agreement with Eqs. (12.1). Note that all the components of p have the same 
denominator, which involves the whole speed, v? = v2 + v? + vŽ. The denom- 
inator of, say, py is not \/1 — v2/c*. This would yield a p that doesn’t point 
along v. 

The above setup is only one specific type of collision among an infinite 
number of possible types of collisions. What we’ve shown with this setup is that 
the only possible vector of the form f (v)mv (where f is some function) that has 


p=ymv= (12.3) 


> 


Fig. 12.2 


Fig. 12.3 


See eee 
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any chance at being conserved in all collisions is ymv (or some constant multiple 
of this). We haven’t proved that it actually is conserved in all collisions. This is 
where the gathering of data from experiments comes in. But we’ve shown above 
that it would be a waste of time to consider, for example, the vector y> mv. 


12.1.2 Energy 


Having given some justification for the momentum expression, p = ymy, let’s 
now try to justify the energy expression, 


E = yme. (12.4) 


More precisely, we’ll show that the above form of the momentum implies that 
ymc? is conserved in interactions (or at least in the specific interaction below). 
There are various ways to do this. The best way, perhaps, is to use the 4-vector 
formalism in Chapter 13. But we’ll study one simple setup here that should 
do the job. 

Consider the following system. Two identical particles of mass m head toward 
each other, both with speed u, as shown in Fig. 12.4. They stick together and 
form a particle of mass M. M is at rest, due to the symmetry of the situation. At 
the moment we can’t assume anything about the size of M, but we’ll find below 
that it does not equal the naive value of 2m. This setup is a fairly uninteresting 
one (conservation of momentum gives 0 = 0), so let’s instead consider the less 
trivial view of it in the frame moving to the left at speed u. This situation is 
shown in Fig. 12.5. The right mass 1s at rest, the left mass moves to the right at 
speed v = 2u/(1+ u) from the velocity-addition formula,” and the final mass M 
moves to the right at speed u. Note that the y factor associated with the speed v is 


1 1 
Yu = — z: 


-er z NB 
1- (24) 


Conservation of momentum in this collision then gives 


1+ 2u 
ymv+0=yMu => m 


(12.5) 


7 Mu 
1 — u? 142) sI 
2m 


Klar - 


Conservation of momentum therefore tells us that M does not equal 2m. But if u 
is very small, then M is approximately equal to 2m, as we know from everyday 


= M= (12.6) 


experience. 


> We're going to set c = 1 for a little while here, because this calculation would get a bit messy if 
we kept in the c’s. We’ll discuss the issue of setting c = 1 in more detail at the end of this section. 
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Using the value of M from Eq. (12.6), let’s now check that our candidate for 
energy, E = ymc’, is conserved in this collision. There is no freedom left in any 
of the parameters, so ymc* is either conserved or it isn’t. In the original frame 
where M is at rest, E is conserved if 


2m 1 
2 2 
yoMc? = Uyum?) = ——— =2 (=) m, (12.7 
' Ta eF i 
which is indeed true. Let’s also check that E is conserved in the frame where 
the right mass is at rest. Æ is conserved if 


yame? + yomc? = yuMe? 


(5) P M 
1-w V1 — uw 


2m =( 2m ) 1 (12.8) 
1-2 V1 — u2 VI- u l 


which is indeed true. So Æ is also conserved in this frame. This example should 
convince you that ymc? is at least a believable expression for the energy of a 
particle. But just as in the case of momentum, we haven’t proved that ymc? 
actually is conserved in all collisions. This is the duty of experiments. But 
we’ve shown that it would be a waste of time to consider, for example, the 
quantity y4mc?. 

One thing that we certainly need to check is that if E and p are conserved in 
one reference frame, then they are conserved in any other. We’ll demonstrate this 
in Section 12.2. A conservation law shouldn’t depend on what frame you’re in, 
after all. 


REMARKS: 


1. As mentioned above, we’re technically not trying to justify Eqs. (12.1) here. These two 
equations by themselves are devoid of any meaning. All they do is define the letters p and 
E. Our goal is to make a meaningful physical statement, not just a definition. 

The meaningful physical statement that we want to make is that the quantities ymv 
and ymc? are conserved in an interaction among particles (and this is what we tried to 
justify above). This fact then makes these quantities worthy of special attention, because 
conserved quantities are very helpful in understanding what is happening in a given 
physical situation. And anything worthy of special attention certainly deserves a label, 
so we may then attach the names “momentum” and “energy” to ymv and ymc?. Any 
other names would work just as well, of course, but we choose these because in the limit 
of small speeds, ymv and ymc? reduce (as we will soon show) to some other nicely 
conserved quantities, which someone already tagged with the labels “momentum” and 
“energy” long ago. 

2. As we’ve noted, the fact of the matter is that we can’t prove that ymv and ymc? are 
conserved. In Newtonian physics, conservation of p = mv is basically postulated by 
Newton’s third law, and we’re not going to be able to do any better than that here. All we 
can hope to do as physicists is provide some motivation for considering ymv and ymc?, 
then show that it is consistent for ymv and ymc* to be conserved during an interaction, 
and then gather a large amount of experimental evidence, all of which is consistent with 
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ymy and ymc~ being conserved. As far as the experimental evidence goes, suffice it to 
say that high-energy accelerators, cosmological observations, and many other forums are 
continually verifying everything that we think is true about relativistic dynamics. If the 
theory isn’t correct, then we know that it must be the limiting case of a more correct 
theory.’ But all this experimental induction has to count for something ... 


“To three, five, and seven, assign 
Aname,” the prof said, “We’ll define.” 
But he botched the instruction 

With woeful induction 

And told us the next prime was nine. 


3. Conservation of energy in relativistic mechanics is actually a much simpler con- 
cept than in nonrelativistic mechanics, because E = ym is conserved, period. We 
don’t have to worry about the generation of internal energy (heat or internal poten- 
tial energy), which ruins conservation of the nonrelativistic E = mv?/2. The internal 
energy is simply built into the total energy. In the above example, the two m’s col- 
lide and generate internal energy (heat) in the resulting mass M. This internal energy 
shows up as an increase in mass, which makes M larger than 2m. The energy that 
corresponds to the increase in mass comes from the initial kinetic energy of the 
two m’s. 

4. Problem 12.1 gives an alternate derivation of the energy and momentum expressions in 
Eq. (12.1). This derivation uses additional facts, namely that the energy and momentum 
of a photon are given by E = hv and p = hv/c, where v is the frequency of the light, and 
his Planck’s constant. & 


Any multiple of ymc? is also conserved, of course. Why did we pick ymc? to 
label as “E” instead of, say, 5ymc?? Consider the approximate form that y mc? 
takes in the Newtonian limit, that is, in the limit v < c. We have, using the 
Taylor series expansion for (1 — x)~!/?, 


m e 


aaa 


2 v 3wt 


E = ym’ = 


2 1 2 
mc“ + z” +e. (12.9) 


The dots represent higher-order terms in v? /c?, which may be neglected ifv < c. 
In an elastic collision in Newtonian physics, no heat is generated, so mass is 
conserved. That is, the quantity mc? has a fixed value. We therefore see that 
conservation of E = ymc? reduces to the familiar conservation of Newtonian 
kinetic energy, mv? /2, for elastic collisions in the limit of slow speeds. This is an 
example of the correspondence principle, which says that relativistic formulas 
must reduce to the familiar nonrelativistic ones in the nonrelativistic limit. Like- 
wise, for the momentum we picked p = ymv instead of, say, 6ymc*v, because 


3 And in fact it isn’t correct, because it doesn’t incorporate quantum mechanics. The more complete 
theory that includes both relativity and quantum mechanics is quantum field theory. 
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the former reduces to the familiar Newtonian momentum, mv, in the limit of slow 
speeds. 


Whether abstract, profound, or just mystic, 
Or boring, or somewhat simplistic, 

A theory must lead 

To results that we need 


In limits, nonrelativistic. 


Whenever we use the term “energy,” we will mean the total energy, ymc’. If 
we use the term “kinetic energy,” we will mean a particle’s excess energy over 
the energy it has when it is motionless. In other words, the kinetic energy is 
ymc* — mc?. Kinetic energy is not necessarily conserved in a collision, because 
mass is not necessarily conserved, as we saw in Eq. (12.6) in the above example. 
In the CM frame, there was kinetic energy before the collision, but none after. 
Kinetic energy is a rather artificial concept in relativity. You virtually always 
want to use the total energy, ymc*, when solving a problem. 


Note the following important relation, 


2 er eS eee: 
E% — |p|“ = = 


= mc, (12.10) 


This is a primary ingredient in solving relativistic collision problems, as we’ll 
soon see. It replaces the K = p”/2m relation between kinetic energy and momen- 
tum in Newtonian physics. It can be derived in more profound ways, as we’ ll see 
in Chapter 13. It’s so important that I like to call it the Very Important Relation. 
Let’s put it in a box: 


P=pe+m |. (12.11) 


Whenever you know two of the three quantities £, p, and m, this equation gives 
you the third. In the case where m = 0 (as with photons), Eq. (12.11) says that 


E =pc (for photons). (12.12) 


This is the key equation for massless objects. For photons, the two equations, 
p = ymy and E = ymc’, don’t tell us much, because m = 0 and y = oo, 
so their product is undetermined. But E? — |p|?c* = m?c* still holds, and we 
conclude that E = pc. Note that any massless particle must have y = oo. That 
is, it must travel at speed c. If this weren’t the case, then E = ymc? would equal 
zero, in which case the particle wouldn’t be much of a particle. We’d have a hard 
time observing something with no energy. 
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Another nice relation, which follows from Eqs. (12.1) and holds for particles 
of any mass, is 


p y 
T= 12.13 
pZ ( ) 
Given p and E, this is definitely the quickest way to get v. 
The general size of mc? 
29 4 2 


What is the general size of mc If we let m = 1kg, then we have mc* = 
(1kg)(3 - 108 m/s)? ~ 10!7J. How big is this? A typical household electric 
bill might be around $50 per month, or $600 per year. At about 10 cents per 
kilowatt-hour, this translates to 6000 kilowatt-hours per year. Since there are 
3600 seconds in an hour, this converts to (6000)(10°)(3600) ~ 2 - 10'° watt- 
seconds. That is, 2 - 10!° joules per year. We therefore see that if one kilogram 
were converted completely into usable energy (that is, kinetic energy, which can 
then be used to drive a turbine), it would be enough to provide electricity to about 
10!7/(2- 10!°), or 5 million, homes for a year. That’s a lot. 

In a nuclear reactor, only a small fraction of the mass is converted into usable 
energy (heat). Most of the mass remains in the final products, which doesn’t help 
in lighting your home. If a particle were to combine with its antiparticle, then it 
would be possible for all of the mass energy to be converted into usable energy. 
But we’re a while away from being able to do this productively. However, even a 
small fraction of the very large quantity, E = mc?, can be large, as evidenced by 
the use of nuclear power and nuclear weapons. Any quantity with a few factors 
of c is bound to change the face of the world. 


Mass 

Some treatments of relativity use the term “rest mass,” mo, to refer to the mass of 
a motionless particle, and “relativistic mass,” mye, to refer to the quantity y mo of 
a moving particle. We won’t use this terminology here. The only thing we’ll call 
“mass” is what the above treatments call “rest mass.” (For example, the mass of 
an electron is 9.11 - 107°?! kg, and the mass of a liter of water is 1 kg, independent 
of their speed.) And since we’ll refer to only one type of mass, there is no need 
to use the qualifier “rest” or the subscript “0.” We’ll therefore simply use the 
notation “m.” 

Of course, you can define the quantity ym to be anything you want. There’s 
nothing wrong with calling it “relativistic mass.” But the point is that ym already 
goes by another name. It’s just the energy, up to factors of c. The use of the word 
“mass” for this quantity, although quite permissible, is certainly not needed. 

Furthermore, the word “mass” is used to describe what is on the right-hand 
side of the equation, E? — |p|*c? = m?. The m? here is an invariant, that is, 


4 See Fadner (1988) for a history of the mass-energy relation. 


12.1 Energy and momentum 


it is something that is independent of the frame of reference. Although E and 
p depend on the frame because they involve v, this v dependence cancels out 
when taking the difference of their squares, yielding a frame-independent m?. If 
“mass” is to be used in this definite way to describe an invariant, then it doesn’t 
make sense to also use it to describe the quantity ym, which is frame-dependent. 
The prefixes “rest” and “relativistic” are introduced to avoid this problem, but 
the result of this is just a watering down of the very important invariance quality 
of “mass.” 

However, since there is in fact nothing actually wrong with using “relativis- 
tic mass,” it mainly comes down to personal preference whether or not you 
like the term. Certainly one motivation for labeling ym as some kind of mass 
is that the expression for momentum looks like p = ymov = mrev, which mim- 
ics the Newtonian expression. And the expression for energy takes the nice form, 
E = ymoc* = mec?. But considering that Newtonian physics is only a limiting 
case of relativistic physics, it is questionable what is gained by molding a theory 
that is more correct into one that is less correct. At any rate, my view is that 
these nice formulas don’t outweigh the usefulness of having the word “mass” 
mean a very specific invariant quantity, with the word “energy” referring to the 
frame-dependent quantity ym. Invariant quantities have a certain sacred place 
in physics, so they should be given a name that doesn’t have any noninvariant 
connotations. 


Lagrangian method 

Another way to motivate the above expressions for E and p is to use the 
Lagrangian method. To take this route, we must of course figure out what the 
relativistic Lagrangian is. This might seem like a daunting task, considering that 
the Lagrangians we dealt with in Chapter 6 involved the kinetic energy, and we 
have no idea yet what the energy is here (that’s one of our goals, after all). 

It turns out, however, that it’s fairly easy to write down the relativistic 
Lagrangian, provided that we think outside the “7 — V” box and concentrate 
instead on the all-important fact that the action is something that is stationary 
for the classical path. Combining this fact with the relativity postulate, which 
states that there is no preferred reference frame, we see that the action must be 
stationary in all frames. A sensible property to require of the action is therefore 
that it be independent of the frame. This way, if it is stationary in one frame, then 
it is stationary in any other.” 

What invariant quantities do we know of? Well, there are certainly m and c. 
But the more interesting one that we’re familiar with is the invariant interval (the 
proper time), cdt = Vc? dt? — dx?. This is pretty much all we have to work 


> I suppose you could imagine a quantity that depends on the frame, but that is still stationary in 
any frame. But clearly the simplest thing to try (which will in fact work) is something that is 
independent of the frame. 
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with, so let’s try an action of the form (we’ll just deal with a free particle), 
S= -me f dt = -me | y c? dt? — dx? = -me f vc? — x? dt. (12.14) 


We have put the mc out front to give S the usual units of energy times time. And 
the minus sign (which doesn’t affect the stationary property) is included so that 
we will obtain the correct signs for the energy and momentum. Our Lagrangian 
is therefore 


L = -mcy c2 — x?. (12.15) 


The Euler-Lagrange equation is thus 


d (ƏL ðL d mcx 
—({—)ja— as —(——_)=0. 12.16 
“ (5) Ox = ( =) ( ) 


The quantity in the parentheses can be written as 


i = ym, (12.17) 
V1 =v? /c? 
This is simply the momentum we defined earlier, which, in view of Eq. (12.16), 
is conserved, as it should be. What about the energy? The energy associated with 
our Lagrangian is given by (using Eq. (6.52)) 


aL ý 
B=32-L=3/ Ha ) ( mey — i? ) 
Ox c2 — x2 


P 


mc 3 MC g 


= = 2 
a Jane = Ji we yme’, (12.18) 


which agrees with our earlier expression for Æ. We can also calculate the angular 
momentum. In polar coordinates, the Lagrangian can be written as 


L = —mc;/ c2 — (72 +726). (12.19) 


The Euler-Lagrange equation obtained from varying 8 is therefore 


d (aL\ ƏL d mer? 
coy (peas) ee ee : =0. (12.20) 
dt \ 30 00 dt \ Se? — (72 + 7262) 
The quantity in the parentheses, which we see is conserved, can be written 
as ymr?6. Because it is associated with the angle 0, we call it the angular 


momentum. And as expected, it reduces properly to the nonrelativistic result 
when y = 1. 


12.1 Energy and momentum 


Of course, this Lagrangian justification for E and p might seem a little silly, 
because we’re just dealing with a free particle, and any quantity involving v is 
conserved for a free particle. But the fact that the above E and p can be derived 
from a Lagrangian involving an invariant quantity (which is the only nontrivial 
invariant quantity we know of) gives them a reasonable amount of credence. 

If we want to go beyond a free particle and incorporate a potential energy into 
the system, things get more complicated; see Brehme (1971). The first thought 
that comes to mind is to simply subtract off a potential V (x) in the Lagrangian, 
as we did in the nonrelativistic case. We then have L = —mceV c? — x? — V(x), 
and the Euler-Lagrange equation becomes 


d (aL ðL d mcă OV dp F(x) 
=s — = — => = => — = Afe 
dt \ ax Ox dt C — x2 Ox dt 


(12.21) 


This is the correct F = dp/dt statement, so it seems that we picked the right 
Lagrangian. However, there is something amiss here; the action now involves the 
quantity f V (x) dt which isn’t invariant, because the time in one frame doesn’t 
equal the time in another. Although the above Lagrangian does indeed yield 
the correct equation of motion in the particular reference frame in which the 
function V (x) is defined, if we switch to another frame there is no guarantee that 
it will do so again. In order to maintain an invariant action, we must construct 
it from 4-vectors (discussed in Chapter 13), or other tensors, multiplied in an 
appropriate way. The classic system involving 4-vectors is a charged particle 
in an electromagnetic field. Other systems involving more complicated tensors 
occur in General Relativity. But we’re not going to get into any of that here, 
because our goal was only to motivate the expressions for E and p. ° 


Setting c = 1 

For the remainder of our treatment of relativity, we will invariably work in units 
where c = 1. For example, instead of one meter being the unit of distance, we 
can make 3 - 108 meters equal to one unit. Or, we can keep the meter as is, and 
make 1/(3 - 108) seconds the unit of time. In such units, our various expressions 
become 


p=ymv, E=ym, E =p +m, r =v. (12.22) 
Said in another way, you can simply ignore all the c’s in your calculations (which 
will generally save you a lot of strife), and then put them back into your final 


answer to make the units correct. For example, let’s say the goal of a certain 


é In multiparticle systems, it turns out that in general it is impossible to create a consistent relativistic 
Lagrangian formalism. The only way to proceed is to work in terms of fields instead of particles. 
But this is well beyond the scope of what we’re doing here. See Goldstein et al. (2002). 
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problem is to find the time of some event. If your answer comes out to be £, 
where £ is a given length, then you know that the correct answer (in terms of the 
usual mks units) has to be £/c, because this has units of time. In order for this 
procedure to work, there must be only one way to put the c’s back in at the end. 
This is always the case, because if there were two ways, then we would have 
c^ = c?, for some numbers a Æ b. But this is impossible, because c has units. 


12.2 Transformations of E and p 


Consider the following one-dimensional situation, where all the motion is along 
the x axis. A particle has energy E£’ and momentum p’ in frame S’. Frame S’ moves 
at speed v with respect to frame S, in the positive x direction (see Fig. 12.6). What 
are E and p in S? 
Let u’ be the particle’s speed in S”. From the velocity-addition formula, the 
particle’s speed in S is (dropping the factors of c) 
w +v 


= : 12.23 
j l+u'v ( ) 


This is all we need to know, because a particle’s velocity determines its energy 
and momentum. But we’ll need to go through a little algebra to make things look 
nice and pretty. The y factor associated with the speed u is 


1 lt+u'v 


Ji (438) ~ V0= wd) 


I+u/v 


Yu = = yyyyl+tu'v). (12.24) 


The energy and momentum in S’ are 
E’=yym, and p = yymu', (12.25) 
while the energy and momentum in S are, using Eq. (12.24), 


E = yım = Yw Yo(l + u'v)m, 


/ (12.26) 


+ uv 


u 
P = Yumu = Yy Youd + u'v)m ( ) = yw Yo (u + v)m. 


u'u 
Using the Z’ and p’ from Eqs. (12.25), we can rewrite E and p as (with y = yy) 
E = y(E' + vp’), 

(12.27) 
p= yp + vE’). 


These are transformations for E and p between frames. If you want to put the 
factors of c back in, then the vE” term becomes vE"/ c’, to make the units correct. 
These transformations are easy to remember, because they look exactly like the 


12.2 Transformations of £ and p 


Lorentz transformations for the coordinates ź and x in Eq. (11.21). More precisely 
with the c’s included, E and pc transform like ct and x, respectively.’ This is no 
coincidence, as we will see in Chapter 13. As a check on Eqs. (12.27), if u = 0 
(so that p' = 0 and E’ = m), then E = ym and p = ymv, as expected. Also, if 
u’ = —v (so that p' = —ymv and E’ = ym), then E = mand p = 0, as expected. 

Because the transformations in Eqs. (12.27) are linear, they also hold if £ 
and p represent the total energy and momentum of a collection of particles. 
That is, 


we=r(Le tee), 
Xr=r ($r +0 E). 


In fact, any (corresponding) linear combinations of the energies and momenta 
are valid here, in place of the sums. For example, we can use the combinations 
(E? +35 — TE2) and (p? + 3p5 — Tp) in Eqs. (12.27), where the subscripts 
indicate which particle, and the superscripts indicate before or after a colli- 
sion. You can verify this by simply taking the appropriate linear combination of 
Eqs. (12.27) for the various particles. This consequence of linearity is a very 
important and useful result, as will become clear in the remarks below. 
You can use Eqs. (12.27) to show that 


(12.28) 


Pap = E? — p°, (12.29) 


just as we showed that £? —x? = t? —x’? in Eq. (11.40). The proof there was based 
on the fact that ¢ and x transform under the Lorentz transformations, so exactly 
the same proof works here with £ and p, in view of the Lorentz transformations 
in Eqs. (12.27). The £’s and p’s in Eq. (12.29) can represent any (corresponding) 
linear combinations of the E’s and p’s of the various particles (for example, the 
total E and p of the particles), due to the linearity of Eqs. (12.27). For one particle, 
we already know that Eq. (12.29) is true, because both sides are equal to m?, from 
Eq. (12.10). For many particles, the invariant quantity Eci — Pi equals the 
square ofthe total energy in the CM frame (which reduces to m? for one particle), 


because Ptotal = 0 in the CM frame, by definition. 


REMARKS: 


1. In the previous section, we said that we needed to show that if E and p are conserved in one 
reference frame during a collision, then they are conserved in any other frame (because 
a conservation law shouldn’t depend on what frame you’re in). This can be shown as 
follows. The total AZ is a linear combination of the initial and final £’s, and likewise for 


7 Since the Lorentz transformations are symmetric in ct and x, and likewise in £ and pc, it isn’t clear 
which coordinate in one transformation corresponds to which coordinate in the other. But since 
x and p are the components of vectors, while ¢ and E aren’t, the correct correspondence must be 
ct <—> Eandx <—> pe. 
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Ap. Therefore, since Eq. (12.27) is a linear equation in the £’s and p’s, it also holds for 
the AE’s and Ap’s. That is, 


AE = y(AE’ +vAp’), and Ap=y(Ap'+vAE’). (12.30) 


So if the total AE’ and Ap’ in S’ are zero, then the total AE and Ap in S must also be zero. 

2. Equation (12.30) makes it clear that if you accept the fact that p = ymv is conserved in 
all frames, then you must also accept the fact that E = ym is conserved in all frames 
(and vice versa). This is true because the second of Eqs. (12.30) says that if Ap and 
Ap’ are both zero, then AE’ must also be zero. E and p have no choice but to go hand 
inhand. & 


Equation (12.27) applies to the x component of the momentum. How do the 
transverse components, p, and pz, transform? Just as with the y and z coordinates 
in the Lorentz transformations, p, and pz don’t change between frames. The 
analysis in Chapter 13 makes this obvious, so for now we’ll simply state that if 
the relative velocity between the frames is in the x direction, then 


Py=py, and p, =p. (12.31) 


If you really want to show explicitly that the transverse components don’t change 
between frames, or if you’re worried that a nonzero speed in the y direction will 
mess up the relationship between p, and E that we calculated in Eq. (12.27), then 
Exercise 12.23 is for you. But it’s a bit tedious, so feel free to settle for the much 
cleaner reasoning in Chapter 13. 


12.3 Collisions and decays 


The strategy for studying relativistic collisions is the same as that for studying 
nonrelativistic ones. You just have to write down all the conservation of energy 
and momentum equations, and then solve for whatever variables you want to 
solve for. The conservation principles are the same as they’ve always been. The 
only difference is that now the energy and momentum take the new forms in 
Eqs. (12.1). In writing down the conservation of energy and momentum 
equations, it proves extremely useful to put Æ and p together into one 
four-component vector, 


P = (E, p) = (E, Px, Py, Pz). (12.32) 


This is called the energy-momentum 4-vector, or the 4-momentum, for short. If we 
were keeping in the factors of c, then the first term would be E'/c, although some 
people instead multiply the p by c; either convention is fine. Our notation in this 
chapter will be to use an uppercase P to denote a 4-momentum and a lowercase 
p or p to denote a spatial momentum. The components of a 4-momentum are 
usually indexed from 0 to 3, so that Py) = E, and (P1, P2,P3) = p. For one 
particle, we have 


P=(ym,ymvy, ymvy, ymvz). (12.33) 


12.3 Collisions and decays 


The 4-momentum for a collection of particles consists of the total Æ and total 
p of all the particles. There are deep reasons for considering the 4-momentum, 
as we'll see in Chapter 13, but for now we’ll just view it as a matter of conve- 
nience. If nothing else, it helps with the bookkeeping. Conservation of energy 
and momentum in a collision reduce to the concise statement, 


Poefore = Patter; (12.34) 


where these are the total 4-momenta of all the particles. 
If we define the inner product between two 4-momenta, 4 = (Ao, 41,42,43) 
and B = (Bo, B1, B2, B3), to be 


A- B = AọBo — A,B, — A2B2 — A3B3, (12.35) 


then the Very Important Relation in Eq. (12.11), E? — p? = m?, which is true for 
one particle, may be concisely written as 


P-P=m, or P?=m’, (12.36) 


where P? = P - P. In other words, the square of a particle’s 4-momentum equals 
the square of its mass. This relation will prove to be very useful in collision 
problems. Note that it is frame-independent, as we saw in Eq. (12.29). 

This inner product is different from the one we’re used to in three-dimensional 
space. It has one positive sign and three negative signs, in contrast with the usual 
three positive signs. But we are free to define it however we wish, and we 
did indeed pick a good definition, because our inner product is invariant under 
Lorentz transformations, just as the usual 3-D inner product is invariant under 
rotations. For the inner product of a 4-momentum with itself (which could be 
any linear combination of 4-momenta of various particles), this invariance is 
the statement in Eq. (12.29). For the inner product of two different 4-momenta, 
we’ll prove the invariance in Section 13.3. 


Example (Relativistic billiards): A particle with mass m and energy E 
approaches an identical particle at rest. They collide elastically® in such a way that 
they both scatter at an angle @ relative to the incident direction (see Fig. 12.7). What 
is ô in terms of E and m? What is 8 in the relativistic and nonrelativistic limits? 


Solution: The first thing we should do is write down the 4-momenta. The 
4-momenta before the collision are 


Pı = (E,p,0,0), P2 = (m,0,0,0), (12.37) 


8 An elastic collision in nonrelativistic physics is defined as one in which no heat is generated. In 
relativity, heat shows up as mass. So an elastic collision in relativistic physics is defined as one in 
which none of the masses change. 


Fig. 12.7 
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where p = y E? — m2. The 4-momenta after the collision are (primes now denote 
“after’’) 


P| = (E',p' cos0,p'sin6,0), Ps = (E’,p’ cos6, —p’ sin 0,0), (12.38) 


where p' = y E’/2 — m2. Conservation of energy gives E’ = (E + m)/2, and conser- 
vation of py gives p’ cos 0 = p/2. Therefore, the 4-momenta after the collision are 


E 
Bye (=. P, +? tan9, 0). (12.39) 


From Eq. (12.36), the squares of these 4-momenta must be m°. Therefore, 


2 
m = (= =) = (2) (1+ tan?0) 


2 2 
2 — m2 
=> An? = (E+ m)? — emi 
cos40 
E? — m? E 
=— cos? = 2 L A (12.40) 


~ E2 +2Em — 3m2 E+3m 


The relativistic limit is E >> m, which yields cos 6 © 1. This means that both particles 
scatter almost directly forward. You can convince yourself that 0 should be small by 
looking at the collision in the CM frame and then shifting back to the lab frame. The 
transverse speeds decrease during this shift of frames. 

The nonrelativistic limit is E ~ m (it’s not E © 0), which yields cos @ © 1//2. So 
0 ~ 45°, and the particles scatter with a 90° angle between them. This agrees with 
the result from the “Billiards” example in Section 5.7.2, a result which pool players 
are very familiar with. 


Note that we never wrote down any v’s in the above solution. That is, we 
never used the relations E = ymc* and p = ymv. There was no need to find 
the velocities; doing so would have essentially involved going in circles. Using 
E = ymc? and p = ymv certainly provides a valid way of solving the problem, 
but in many situations like this one, where velocities aren’t given or explicitly 
asked for, it leads to a very messy solution. A far cleaner method is to use the 
E? — p? =m’ relation. 


If ymv yields frustration, 
And the similar £, irritation, 
Just ditch all the v’s, 

And use (won’t you please) 
The Very Important Relation! 


Let’s now look at a decay. Decays are basically the same as collisions. All you 
have to do is conserve energy and momentum, as the following example shows. 


12.3 Collisions and decays 


Example (Decay at an angle): A particle with mass M and energy E decays 
into two identical particles. In the lab frame, one of them is emitted at a 90° angle, 
as shown in Fig. 12.8. What are the energies of the created particles? We’ ll give two 
solutions. The second one shows how 4-momenta can be used in a very clever and 
time-saving way. 


First solution: The 4-momentum before the decay is 
P = (E,p,0,0), (12.41) 


where p = y E? — M?. Let the created particles have mass m, and let the second 
particle make an angle 0 with the x axis. The 4-momenta after the decay are 


Pi a (E1,0,p1,0), Po = (E2,p2 cos 6, —p2 sin 6,0). (12.42) 


Conservation of py immediately gives p2cos@=p, which then implies that 
p2 sin 0 =p tan 6. Conservation of py says that the final p,’s are opposites. Therefore, 
the 4-momenta after the decay are 


P| = (£},0,ptan6,0), P = (E2,p, —p tan 0,0). (12.43) 


Conservation of energy gives E = E + E>. Writing E] and £> in terms of the 
momenta and masses, this becomes 


E= JP tan20 + m2 + JP + tan20) + m2. (12.44) 


Putting the first radical on the left side, squaring, and solving for that radical (which 
is £1) gives 


E2 -p° M2 
Ei = = —. (12.45) 
2E 2E 
In a similar manner, we find that £7 equals 
E2 2 2E2 3 M2 
pee a E l (12.46) 


2E 2E 


These add up to £, as they should. 


Second solution: With the 4-momenta defined as in Eqs. (12.41) and (12.42), 
conservation of energy and momentum can be combined into the statement, 
P = P| + P2. Therefore, 


P= Pi = Pz, 
=> (P-P): (P-P) = P2- Po, 
= P-2P.P +P? =P, 

= M- 2EE, + m = m, 
M2 


=> E = —., 12.47 
1= > ( ) 


Fig. 12.8 
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And then Ey = E — E] = QE? — M?) /2E. This solution should convince you that 
4-momenta can save you a lot of work. What happened here was that the expression 
for Pz was fairly messy, but we arranged things so that it appeared only in the form 
of P which is simply m2. 4-momenta provide a remarkably organized method for 
sweeping unwanted garbage under the rug. 


12.4 Particle-physics units 


A branch of physics that uses relativity as one of its main ingredients is 
elementary-particle physics, which is the study of the building blocks of mat- 
ter (electrons, quarks, neutrinos, etc.). It’s unfortunately the case that most 
of the elementary particles we want to study don’t exist naturally in the 
world. We therefore have to create them in particle accelerators by collid- 
ing other particles together at very high energies. The high speeds involved 
require the use of relativistic dynamics. Newtonian physics is essentially 
useless. 

What is a typical size of a rest energy, mc”, of an elementary particle? The 
rest energy of a proton (which isn’t really elementary; it’s made up of quarks, but 
never mind) is 


Ep = mpe = (1.67 - 107?" kg)(3 - 108 m/s)? = 1.5-107!° joules. (12.48) 


This is very small, of course. So a joule is probably not the best unit to work 
with. We would get very tired of writing the negative exponents over and 
over. We could perhaps work with “nanojoules,” but particle physicists like to 
work instead with the “eV,” the e/ectron-volt. This is the change in energy of an 
electron when it passes through a potential of one volt. The electron charge is 
(negative) e = 1.602 - 107!°C, and a volt is defined as 1V = 1J/C. So the 
conversion from eV to joules is’ 


1 eV = (1.602 - 107? C)(1J/C) = 1.602- 107" J. (12.49) 


Therefore, in terms of eV, the rest energy of a proton is 938 - 10° eV. We now 
have the opposite problem of having a large exponent hanging around. But this 
is easily remedied by the prefix “M,” which stands for “mega” or “million.” So 
we finally have a proton rest energy of 


Ey = 938 MeV. (12.50) 


? This is getting a little picky, but “eV” should technically be written as “eV,” because when people 
write “eV,” they actually mean that two things are being multiplied together (in contrast with, for 
example, the “kg” symbol for “kilogram”). One of these things is the electron charge, which is 
usually denoted by e. 


12.5 Force 


You can work out for yourself that an electron has a rest energy of 
Es = 0.511 MeV. The rest energies of various particles are listed in the table 
below. The ones preceded by a “~” are the averages of differently charged parti- 
cles, whose energies differ by a few MeV. These (and the many other) elementary 
particles have specific properties (spin, charge, etc.), but for the present purposes 
they need only be thought of as point objects having a definite mass. 


Particle Rest energy (MeV) 


electron (e) 0.511 
muon (u) 105.7 
tau (1) 1784 
proton (p) 938.3 
neutron (n) 939.6 
lambda (A) 1115.6 
sigma (X) 1193 
delta (A) 1232 
pion (7) x137 
kaon (K) x496 


For higher energies, the prefixes “G” (for “giga,” meaning 10°) and “T” (for 
“tera,” meaning 101°) are used. 

We now come to a slight abuse of language. When particle physicists talk 
about masses, they say things like, “The mass of a proton is 938 MeV.” This, 
of course, makes no sense, because the units are wrong; a mass can’t equal an 
energy. But what they mean is that if you take this energy and divide it by c’, 
then you get the mass. It would truly be a pain to keep saying, “The mass is such- 
and-such an energy, divided by c*.” For a quick conversion back to kilograms, 
you can show that 


1 MeV/c? = 1.783 - 107°? kg. (12.51) 


12.5 Force 
12.5.1 Force in one dimension 


In nonrelativistic physics, Newton’s second law is F = dp/dt, which reduces 
to F = ma if the mass is constant. We’ll carry this law over to relativity and 
continue to write (we’ll just deal with one-dimensional motion for now) 


LW 


F=—., 
(12.52) 
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However, in relativity we have p = ymv, and y can change with time. This 
complicates things, and it turns out that F does not equal ma. But if dp/dt 
and ma are different, why does F equal dp/dt instead of ma? Perhaps the best 
reason arises from the 4-vector formalism in Chapter 13. But another reason is 
that the F in Eq. (12.52) leads to a familiar work—energy theorem, as we’ll see 
in Eq. (12.57). 

To see what form the F in Eq. (12.52) takes in terms of the acceleration 
a = ù, let’s first calculate dy /dt: 


dy df 1 \. w _, aes 
dt da\/joy) Gd—vse” * ene) 


Assuming that m is constant, we therefore have 


Fe d(ymv) 
dt 


= m(ýv + yb) = may (y? + 1) = y3ma. (12.54) 


This doesn’t look as nice as F = ma, but that’s the way it goes. However, F 
correctly reduces to ma in the limit of small speeds (where y œ 1), as it must. 


They said, “F is ma, bar none.” 

What they meant wasn’t quite as much fun. 
It’s dp by dt, 

Which just happens to be 

Good ol’ “ma” when y is 1. 


Consider now the quantity dE/dx, where E is the energy, E = ym. We have 


dE d(1/ VI 02 d 
„eo _ O ymt., (12.55) 
dx dx dx dx 


But v(dv/dx) = (dx/dt)(dv/dx) = dv/dt = a. Therefore, dE/dx = y?ma. 
Combining this with Eq. (12.54) gives 


dE 


F (12.56) 
Note that Eqs. (12.52) and (12.56) take exactly the same form as in nonrelativistic 
physics. The only new thing in relativity is that the expressions for p and E are 
modified. 


REMARK: The result in Eq. (12.56) suggests another way to motivate the E = ym expression. 
The reasoning is exactly the same as in the derivation of nonrelativistic energy conservation in 
Section 5.1. Define F, as we have done, through Eq. (12.52). Then integrate Eq. (12.54) from 
xı to x2 to obtain 


X2 x2 x2 du v2 v2 
f Fax= / (y3ma) dx = f (vom) dx = f ymv dv=ym| , (12.57) 
x] x] x] dx vi vI 
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where we have used the dy = y?v dv relation from Eq. (12.55). We see that if we define the 
energy as E = ym, then the work—energy theorem, f F dx = AE, holds in relativity just as it 
does in Newtonian physics. The only difference is that Æ is ym instead of mv? /2.!0 & 


12.5.2 Force in two dimensions 


In two dimensions, the concept of force becomes a little strange. In particular, 
as we'll see, the acceleration of an object need not point in the same direction as 
the force. We start with 


dp 
F = —. 12.58 
P7 (12.58) 
This is a vector equation. Without loss of generality, we’ll deal with only two 
spatial dimensions. Consider a particle moving in the x direction, and let us apply 


a force, F = (Fy, Fy). The particle’s momentum is 


pS ee (12.59) 
l-v- vy 


Taking the derivative of this, and using the fact that v, is initially zero, we obtain 


_ 4p 
dt |y o 
= ( Ox + Ux (Ux Vx + UyVy) Ùy i Vy (UxÙx + 2) 
VIZ WIZ 7 VT WIE I <0 
. 2 . 
Ux VU Uy 
= = jj - 
„(= ( +) 2) 
(Goa ve) 
=m 7 
(J1—v2)3) VI- 
= m(yrax, yay). (12.60) 


This is not proportional to (ax, ay). The first component agrees with Eq. (12.54), 
but the second component has only one factor of y . The difference comes from the 
fact that y has a first-order change if v, changes, but not if v, changes, assuming 
that v, is initially zero. The particle therefore responds differently to forces in the 
x and y directions. It is easier to accelerate something in the transverse direction. 


10 Actually, this reasoning suggests only that Æ is given by ym up to an additive constant. For all 
we know, E might take the form, E = ym — m, which would make the energy of a motionless 
particle equal to zero. An argument along the lines of Section 12.1.2 is required to show that the 
additive constant is zero. 
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12.5.3 Transformation of forces 


Let a force act on a particle. How are the components of the force in the particle’s 
frame, S’, related to the components of the force in another frame, S?!! Let the 
relative motion be along the x and x’ axes, as shown in Fig. 12.9. In frame S, 
Eq. (12.60) says 


(Fx, Fy) = m(y7ax, yay). (12.61) 


And in frame S’, the y factor for the particle equals 1, so Eq. (12.60) reduces to 
the usual expression, 


(Fi, %) = m(@,,4,). (12.62) 


Let’s now relate these two forces, by writing the primed accelerations on the 
right-hand side of Eq. (12.62) in terms of the unprimed accelerations. 

First, we have a, = yay. This is true because transverse distances are the 
same in the two frames, but times are shorter in S’ by a factor y. That is, dt’ = 
dt/y. We have indeed put the y in the right place here, because the particle 
is essentially at rest in S’, so the usual time dilation holds. Therefore, a, = 
d?y' Jdt? = d?y/(dt/yy’ = y? ay. l 

Second, we have a’. = y? ax. In short, this is true because time dilation brings 
in two factors of y (as in the a, case), and length contraction brings in one. In 
more detail: Let the particle move from one point to another in frame S’, as it 
accelerates from rest in S”. Mark these two points, which are essentially a distance 
a'.(dt')?/2 apart, in S’. As S’ flies past S, the distance between the two marks 
is length contracted by a factor y, as viewed by S. This distance (which is the 
excess distance the particle travels over what it would have traveled if there were 
no acceleration) is what S calls ay (dt)? /2. Therefore, 


l 1/1 dt \* 
5h dt? = 7 (54 a?) => d =ya (5) = yay. (12.63) 
Equation (12.62) may now be written as 
(FF) = m(y?ax, yay). (12.64) 
Comparing Eqs. (12.61) and (12.64) then gives 
F! 
F, =F, ad F=. (12.65) 
i Y 


We see that the longitudinal force is the same in the two frames, but the transverse 
force is larger by a factor of y in the particle’s frame. 


11 To be more precise, S' ’ is the instantaneous inertial frame of the particle. Once the force is applied, 
the particle will accelerate and will therefore no longer be at rest in S”. But for a very small elapsed 
time, the particle will still essentially be in S’. 
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REMARKS: 


1. What if someone comes along and switches the labels of the primed and unprimed frames 
in Eq. (12.65), and concludes that the transverse force is smaller in the particle’s frame? 
He certainly can’t be correct, given that Eq. (12.65) is true, but where is the error? The 
error lies in the fact that we (correctly) used dt’ = dt/y above, because this is the relevant 
expression concerning two events along the particle’s worldline. We are interested in two 
such events, because we want to see how the particle moves. The inverted expression, 
dt = dt'/y, deals with two events located at the same position in S, and therefore has 
nothing to do with the situation at hand. Similar reasoning holds for the relation between 
dx and dx'. Because we are dealing with a given particle, there is indeed one frame that 
is special among all possible frames, namely the particle’s instantaneous inertial frame. 

2. If you want to compare forces in two frames, neither of which is the particle’s rest frame, 
then just use Eq. (12.65) twice and relate each of the forces to the rest-frame forces. It 
quickly follows that for another frame $”, we have F? = Fy, and y”F}? = yF,, where 
the y’s are measured relative to the rest frame S’. & f 


Example (Bead on a rod): A spring with a tension has one end attached to the 
end ofa rod, and the other end attached to a bead that is constrained to move along the 
rod. The rod makes an angle 6’ with respect to the x’ axis, and is fixed at rest in the S” 
frame (see Fig. 12.10). The bead is released and is pulled along the rod by the spring. 
Right after the bead is released, what does the situation look like in the frame, S, of 


someone moving to the left at speed v? In answering this, draw the directions of 


(a) the rod, 
(b) the acceleration of the bead, 
(c) the force on the bead. 


In frame S, does the wire exert a force of constraint? Fig. 12.10 
Solution: In frame S: 


(a) The horizontal span of the rod is decreased by a factor y, due to length con- tang = Ttan 0’ 
traction, and the vertical span is unchanged, so we have tan@ = y tan 6’, as 
shown in Fig. 12.11. 

(b) The acceleration must point along the rod, because the bead always lies on 
the rod, and because the rod moves at constant speed in S. Quantitatively, the 
position of the bead in S takes the form of (x,y) = (vt — ayt? /2, —ayt? /2), by 
the definition of acceleration. The position relative to the starting point on the 
rod, which has coordinates (vt, 0), is then (Ax, Ay) = (—axt? /2, —ayt? /2). 
The condition for the bead to stay on the rod is that the ratio of these coordinates 


equals the slope of the rod in S. Therefore, ay/ax = tan 0, and the acceleration Fig. 12.11 
points along the rod. 
(c) The y component of the force on the bead is decreased by a factor y, by Eq. 
(12.65). Therefore, since the force points at an angle 6’ in S’ (because it points 
along the rod in S’), the angle in S is given by tan ọ = (1/y) tan 6’, as shown 
in the figure. 
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As a double-check that a does indeed point along the rod, we can use Eq. (12.60) 
to write ay/ay = y?Fy/Fx. Then Eq. (12.65) gives ay/ay = yFy/Fy = y tan 6! = 
tan 8, which is the direction of the wire. 

The wire does not exert a force of constraint. The bead doesn’t need to touch the 
wire in S’, so it doesn’t need to touch it in S. There is no need to have an extra force 
to combine with F to make the result point along a, because F simply doesn’t have 
to be collinear with a. 


12.6 Rocket motion 


Up to this point, we have dealt with situations where the masses of our particles 
are constant, or where they change abruptly (as in a decay, where the sum of 
the masses of the products is less than the mass of the initial particle). But in 
many setups, the mass of an object changes continuously. A rocket is the classic 
example of this, so we’ll use the term “rocket motion” to describe the general 
class of problems where the mass changes continuously. 

The relativistic rocket itself encompasses all of the important ideas, so we’ ll 
study that example here. Many more examples are left for the problems. We’ ll 
present three solutions to the rocket problem, the last of which is rather slick. In 
the end, the solutions are all basically the same, but it should be helpful to see 
the various ways of looking at things. 


Example (Relativistic rocket): Assume that a rocket propels itself by con- 
tinually converting mass into photons and firing them out the back. Let m be the 
instantaneous mass of the rocket, and let v be the instantaneous speed with respect 
to the ground. Show that 


=0. 12.66 
m 1—v2 ( ) 


If the initial mass is M, and the initial v is zero, integrate Eq. (12.66) to obtain 


l-v 
l+v- 


m=M (12.67) 
First solution: The strategy of this solution will be to use conservation of momen- 
tum in the ground frame. Consider the effect of a small mass being converted into 
photons. The mass of the rocket goes from m to m + dm (where dm is negative). So in 
the frame of the rocket, photons with total energy Er = —dm (which is positive) are 
fired out the back. In the frame of the rocket, these photons have momentum py = dm 
(which is negative). We’ll drop the c’s here. 

Let the rocket move at speed v with respect to the ground. Then the momentum of 
the photons in the ground frame, pg, can be found via the Lorentz transformation, 


Pg = Y (pr + vEr) = y (dm + v(—dm)) = y(1—v)dm. (12.68) 


This is still negative, of course. 
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REMARK: A common error is to say that the converted mass (—dm) takes the form 
of photons of energy (—dm) in the ground frame. This is incorrect, because although 
the photons have energy (—dm) in the rocket frame, they are redshifted (due to the 
Doppler effect) in the ground frame. From Eq. (11.51), we see that the frequency (and 
hence the energy) of the photons decreases by a factor of /(1 — v)/(1 + v) when going 
from the rocket frame to the ground frame. This factor equals the y(1 — v) factor 
in Eq. (12.68). & 


We can now use conservation of momentum in the ground frame to say that 


(my vVold = YU — v)dm + (myv)new => yl —v)dm+d(myv) = 0. 


(12.69) 
The d(myv) term may be expanded to give 
d(myv) = (dm)yv + m(dy)u + my (dv) 
= yvdm+m(yrudvyu + my dv 
= yvdm + my (y7v> + 1)dv 
= yv dm + my? dv. (12.70) 
Therefore, Eq. (12.69) gives 
0 = y(1 — v) dm + yv dm + my?dv 
= y dm + my? dv. (12.71) 
Hence, 
Bg AO iy (12.72) 


in agreement with Eq. (12.66). We must now integrate this. With the given initial 


values, we have 
m dm v dv 
— + =0. (12.73) 
mm o l-v 


We can look up the dv integral in a table, but let’s instead do it from scratch. z Writing 
1/a- v?) as the sum of two fractions gives 


f dv sf 1 1 
= + dv 
o l-vw 2/Jo l+v 1l-v 


Vv 
= 5(Inci +») -ma —v)) f 
1 l+v 
= e =), (12.74) 


12 Tables often list the integral of 1/(1 — v?) as tanh—!(v). You can show that this is equivalent to 
the result in Eq. (12.74). 
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Equation (12.73) therefore gives 


l () t l+v = M l—v (12.75) 
n = n = ae ; 
M 2° (Izv ON Thee? 


in agreement with Eq. (12.67). This result is independent of the rate at which the 


mass is converted into photons. It is also independent of the frequency of the emitted 
photons. Only the total mass expelled matters. Note that Eq. (12.75) quickly tells us 
that the energy of the rocket, as a function of velocity, is 


l-v M 


k= =yM = ‘ 
KET l+v l+v 


(12.76) 


This has the interesting property of approaching M /2 as v — c. In other words, 
half of the initial energy remains with the rocket, and half ends up as photons (see 
Exercise 12.38). 


REMARK: From Eq. (12.68), or from the previous remark, we see that the ratio of the 
energy of the photons in the ground frame to that in the rocket frame is /(1 — v)/(1 + v). 
This factor is the same as the factor in Eq. (12.75). In other words, the photons’ energy in 
the ground frame decreases in exactly the same manner as the mass of the rocket (assuming 
that the photons are ejected with the same frequency in the rocket frame throughout the 
process). Therefore, in the ground frame, the ratio of the photons’ energy to the mass of 
the rocket doesn’t change with time. There must be a nice intuitive explanation for this, 
but it eludes me. & 


Second solution: The strategy of this solution will be to use F = dp/dt in 
the ground frame. Let t denote the time in the rocket frame. Then in the rocket 
frame, dm/dt is the rate at which the mass of the rocket decreases and is converted 
into photons (dm is negative). The photons therefore acquire momentum at the rate 
dp/dt = dm/dt in the rocket frame. Since force is the rate of change in momentum, 
we see that a force of dm/dt pushes the photons backward, and so an equal and 
opposite force of F = —dm/dt pushes the rocket forward in the rocket frame. 

Now go to the ground frame. We know from Eq. (12.65) that the longitudinal force 
is the same in both frames, so F = —dm/dT is also the force on the rocket in the 
ground frame. And since dt = ydt, where t is the time on the ground (the photon 
emissions occur at the same place in the rocket frame, so we have indeed put the 
time-dilation factor of y in the right place), we have 


dm 
Y` 


(12:77) 


REMARK: We can also calculate the force on the rocket by working entirely in the ground 
frame. Consider a mass (—dm) that is converted into photons. Initially, this mass is traveling 
along with the rocket, so it has momentum (—dm)yv. After it is converted into photons, it 
has momentum y(1 — v) dm (from the first solution above). The change in momentum is 
therefore y (1 — v) dm—(—dm)yv = y dm. Since force is the rate of change in momentum, 
a force of y dm/dt pushes the photons backward, and so an equal and opposite force of 
F = —y dm/dt pushes the rocket forward. &æ 


Now things get a little tricky. It is tempting to write down F = dp/dt = d(myv)/dt, 
which gives F = (dm/dt)yv + md(yv)/dt. This, however, is incorrect, because the 
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dm/dt term isn’t relevant here. When the force is applied to the rocket at an instant 
when the rocket has mass m, the only thing the force cares about is that the mass of 
the rocket at the given instant is m. It doesn’t care that m is changing.!° Therefore, 
the correct expression we want is 


d(yv) 
mMm 


12.78 
i (12.78) 


As in the first solution above, or in Eq. (12.54), we have d(yv)/dt = y? dv/dt. Using 
the F from Eq. (12.77), we arrive at 
dm 3 dv 


-y— =m 


am 12.79 
d "Y d oe 


which is equivalent to Eq. (12.71). The solution proceeds as above. 


Third solution: The strategy of this solution will be to use conservation of energy 
and momentum in the ground frame, ina slick way. Consider a clump of photons fired 
out the back. The energy and momentum of these photons are equal in magnitude 
and opposite in sign (with the convention that the photons are fired in the negative 
direction). By conservation of energy and momentum, the same statement must be 
true about the changes in energy and momentum of the rocket. That is, 


d(ym) =—d(ymv) => d(ym+ymv) =0. (12.80) 


Therefore, ym(1 + v) is a constant. We are given that m = M when v = 0. Hence, 
the constant equals M. Therefore, 


l-v 
1+ 


ymi+v)=M => m=M (12.81) 


Now, that’s a quick solution, if there ever was one! 


12.7 Relativistic strings 


Consider a “massless” string with a tension that is constant, that is, independent 


of length.'* We call such objects relativistic strings, and we will study them for 


two reasons. First, these strings, or reasonable approximations thereof, actually 
do occur in nature. For example, the gluon force which holds quarks together is 
approximately constant over distance. And second, they open the door to a whole 
new class of setups we can study, such as the two below. Relativistic strings might 


'3 Said in a different way, the momentum associated with the ejected mass still exists. It’s just that 


it’s not part of the rocket anymore; it’s in the photons. This issue is expanded on in Appendix C. 
14 By “massless,” we mean that the string has no mass in its unstretched (that is, zero-length) state. 
Once it is stretched, it will have energy in its rest frame, and hence mass. 
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seem a bit strange, but any one-dimensional problem involving them basically 
comes down to the two equations, 


dp dE 
F=2 P=, 12.82 
o dx 282) 


Example (Mass connected to a wall): A mass m is connected to a wall by a 
relativistic string with tension 7. The mass starts next to the wall and has an initial 
speed v away from it (see Fig. 12.12). How far from the wall does the mass get? How 
much time does it take to reach this point? 


Solution: Let £ be the maximum distance from the wall. The initial energy of 
the mass is E = ym. The final energy at x = £ is simply m, because the mass is 
instantaneously at rest there. Integrating F = dE/dx, and using the fact that the force 
always equals —T, gives 


my — 1) 


FAx=AE = (-T)l=m-ym = l= 7 


(12.83) 
Let ż be the time it takes to reach this point. The initial momentum of the mass is 
p = ymv. Integrating F = dp/dt, and using the fact that the force always equals 
—T, gives 


ymv 


FAt=Ap => (-T)t=0-ymv => t=} (12.84) 


Note that we cannot use F = ma to do this problem. F does not equal ma. It equals 
dp/dt (and also dE/dx). 


Example (Where the masses meet): A relativistic string of length £ and ten- 
sion T connects a mass m and a mass M (see Fig. 12.13). The masses are released 
from rest. Where do they meet? 


Solution: Let the masses meet at a distance x from the initial position of m. At this 
meeting point, F = dE/dx tells us that the energy of m is m+ Tx, and the energy of M 
is M + T(€ — x). Using p = y E2 — m? we see that the magnitudes of the momenta 
at the meeting point are 


Pm = (m+ Tx)2—m2 and py = jm +7(€—x))2—M2. (12.85) 


But F = dp/dt tells us that these must be equal, because the same force (in magnitude, 


but opposite in direction) acts on the two masses for the same time. Equating the above 
p’s yields 


ew MF TE) | (12.86) 
M+m4+Te 
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This result is reassuring, because it is the location of the initial center of mass, with 
the string being treated (quite correctly) like a stick of length £ and mass T£ (divided 
by c2). 


Remark: Let’s check a few limits. In the limit of large T or £ (more precisely, in the 
limit T£ >> Mc? and T£ >> mc?), we have x = £/2. This makes sense, because in this 
case the masses are negligible and therefore both move at essentially speed c, and hence 
meet in the middle. In the limit of small T or £ (more precisely, in the limit Tl « Mc? 
and T£ < mc”), we have x = M£/(M + m), which is simply the Newtonian result for an 
everyday-strength massless spring. & 


12.8 Problems 


Section 12.1: Energy and momentum 


12.1. Deriving E and p «x 


Accepting the facts that the energy and momentum ofa photon are E = hv 
and p = hv/c (where v is the frequency of the light wave, and A is Planck’s 
constant), derive the relativistic formulas for the energy and momentum 
of a massive particle, E = ymc? and p = ymv. Hint: Consider a mass m 
that decays into two photons. Look at this decay in the rest frame of the 
mass, and then in a frame where the mass has speed v. You’! need to use 
the Doppler effect. 


Section 12.3: Collisions and decays 


12.2. Colliding photons « 
Two photons each have energy E. They collide at an angle 6 and create 
a particle of mass M. What is M? 

12.3. Increase in mass x 


A large mass M, moving at speed V, collides and sticks to a small mass 
m, initially at rest. What is the mass of the resulting object? Work in the 
approximation where M > m. 


12.4. Two-body decay « 
A stationary mass M4 decays into masses Mg and Mc. What are the 
energies of Mg and Mc? What are their momenta? 

12.5. Threshold energy x» 


A particle of mass m and energy E collides with an identical stationary 
particle. What is the threshold energy for a final state containing N par- 
ticles of mass m? (“Threshold energy” is the minimum energy for which 
the process can occur.) 
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12.6. 


Section 


12.8. 


12.9, 


Head-on collision +» 


A ball of mass M and energy E collides head-on elastically with 
a stationary ball of mass m. Show that the final energy of mass 
M is 


= 2mM2 + E(m + M?) 
~ 2Em+m +M? ` 


(12.87) 


Hint: This problem is a little messy, but you can save yourself a lot of 
trouble by noting that £’ = E must be a root of the equation you get for 
E'. (Why?) As a reward for trudging through the mess, there are lots of 
interesting limits you can take. 


. Compton scattering ++ 


A photon collides with a stationary electron. If the photon scatters at an 
angle 6 (see Fig. 12.14), show that the resulting wavelength, A’, is given 
in terms of the original wavelength, A, by 


l 
ee eee ee (12.88) 
mc 


where m is the mass of the electron. Note: The energy of a photon is 
E = wv =he/2. 


12.5: Force 


System of masses «x 


Consider a dumbbell made of two equal masses, m. The dumbbell spins 
around, with its center pivoted at the end of a stick (see Fig. 12.15). Ifthe 
speed of the masses is v, then the energy of the system is 2ym. Treated 
as a whole, the system is at rest. Therefore, the mass of the system must 
be 2ym. (Imagine enclosing it in a box, so that you can’t see what’s 
going on inside.) Convince yourself that the system does indeed behave 
like a mass of M = 2ym, by pushing on the stick (when the dumbbell 
is in the “transverse” position shown in the figure) and showing that 
F = dp/dt = Ma. 


Relativistic harmonic oscillator x» 


A particle of mass m moves along the x axis under a force F = —mwx. 


The amplitude is b. Show that the period is given by 


2: 
dx, where y= 1+ S(b? x). (12.89) 
2c? 


pen 4 
cJo yy?—1 


12.8 Problems 


Section 12.6: Rocket motion 


12.10. 


12.11. 


12.12. 


12.13. 


12.14. 


12.15. 


Relativistic rocket xx 


Consider the relativistic rocket in Section 12.6. Let mass be converted 
into photons at a rate o in the rest frame of the rocket. Find the time 
t in the ground frame as a function of v. (Alas, it isn’t possible to 
invert this, to obtain v as a function of t.) You’ll need to evaluate a 
slightly tricky integral. Pick your favorite method — pencil, book, or 
computer. 


Relativistic dustpan I « 


A dustpan of mass M is given an initial relativistic speed. It gathers up 
dust with mass density À per unit length on the floor (as measured in the 
lab frame). At the instant the speed is v, find the rate (as measured in 
the lab frame) at which the mass of the dustpan-plus-dust-inside system 
is increasing. 


Relativistic dustpan II x» 


Consider the setup in Problem 12.11. If the initial speed of the dustpan 
is V, find v(x), v(t), and x(t). All quantities here are measured with 
respect to the lab frame. 


Relativistic dustpan III xx 


Consider the setup in Problem 12.11. Calculate, in both the dustpan 
frame and the lab frame, the force on the dustpan-plus-dust-inside sys- 
tem (due to the newly acquired dust particles smashing into it) as a 
function of v, and show that the results are equal. 


Relativistic cart I +++» 


A long cart moves at relativistic speed v. Sand is dropped into the cart 
at a rate dm/dt = o in the ground frame. Assume that you stand on the 
ground next to where the sand falls in, and you push on the cart to keep 
it moving at constant speed v. What is the force between your feet and 
the ground? Calculate this force in both the ground frame (your frame) 
and the cart frame, and show that the results are equal. 


Relativistic cart II xxx 


A long cart moves at relativistic speed v. Sand is dropped into the cart at 
arate dm/dt = o in the ground frame. Assume that you grab the front 
of the cart and pull on it to keep it moving at constant speed v (while 
running with it). What force does your hand apply to the cart? (Assume 
that the cart is made of the most rigid material possible.) Calculate this 
force in both the ground frame and the cart frame (your frame), and 
show that the results are equal. 
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Section 12.7: Relativistic strings 


12.16. 


12.17. 


12.18. 


12.19. 


Different frames «« 


(a) Two masses m are connected by a string of length £ and constant 
tension T. The masses are released simultaneously, and they 
collide and stick together. What is the mass, M, of the resulting 
blob? 

(b) Consider this scenario from the point of view of a frame moving 
to the left at speed v (see Fig. 12.16). The energy of the resulting 
blob must be yMc?, from part (a). Show that you obtain this 
same result by computing the work done on the two masses. 


Splitting mass xx 

A massless string with constant tension T has one end attached to a wall 
and the other end attached to a mass M. The initial length of the string 
is £ (see Fig. 12.17). The mass is released. Halfway to the wall, the 
back half of the mass breaks away from the front half (with zero initial 
relative speed). What is the total time it takes the front half to reach 
the wall? 


Relativistic leaky bucket xxx 


Let the mass M in Problem 12.17 be replaced by a massless bucket 
containing an initial mass M of sand (see Fig. 12.18). On the way 
to the wall, the bucket leaks sand at a rate dm/dx = M/€, where 
m denotes the mass at later positions. Note that dm and dx are both 
negative here. 


(a) What is the energy of the bucket, as a function of distance from 
the wall? What is its maximum value? What is the maximum 
value of the kinetic energy? 

(b) What is the momentum of the bucket, as a function of distance 
from the wall? Where is it maximum? 


Relativistic bucket xxx 


(a) A massless string with constant tension T has one end attached to 
a wall and the other end attached to a mass m. The initial length 
of the string is £ (see Fig. 12.19). The mass is released. How long 
does it take to reach the wall? 

Let the string now have length 22, with a mass m on the end. 
Let another mass m be positioned next to the £ mark on the 
string, but not touching it (see Fig. 12.20). The right mass is 
released. It heads toward the wall (while the left mass remains 
motionless) and then sticks to the left mass to make one large 


(b 


~ 


12.9 Exercises 


blob, which then heads toward the wall.'!° How much time does 
this whole process take? Hint: You can solve this in various 
ways, but one method that generalizes nicely for part (c) is to 
show that the change in p? from the start to a point right before 
the wall is A(p*) = (Eż — E?) + (Ei — E2), where the energies 
of the moving object (that is, the initial m or the resulting blob) 
are: E; right at the start, E> just before the collision, £3 just 
after the collision, and £4 right before the wall. Note that this 
method doesn’t require knowledge of the mass of the blob (which 
is not 2m). 

(c) Let there now be N masses and a string of length N£ (see 
Fig. 12.21). How much time does this whole process take? 

(d) Consider now a massless bucket at the end of the string (of 
length L) that gathers up a continuous stream of sand (of total 
mass M) as it gets pulled to the wall (see Fig. 12.22). How much 
time does this whole process take? What are the mass and speed 
of the bucket right before it hits the wall? 


12.9 Exercises 


Section 12.2: Transformations of E and p 


12.20. 


12.21. 


12,22. 


Energy of two masses « 


Two masses M move at speed V, one to the east and one to the west. 
What is the total energy of the system? Now consider the setup as 
viewed in a frame moving to the west at speed u. Find the energy of 
each mass in this frame. Is the total energy larger or smaller than the 
total energy in the lab frame? 


System of particles x 


Given Ptotal and Etota] for a system of particles, use a Lorentz transfor- 
mation to find the velocity of the CM. More precisely, find the speed 
of the frame in which the total momentum is zero. 


CM frame +» 
A mass m travels at speed 3c/5, and another mass m sits at rest. 


(a) Find the energy and momentum of the two particles in the lab 
frame. 

(b) Find the speed of the CM of the system, by using a velocity- 
addition argument. 


15 The left mass could actually be attached to the string, and we would still have the same situation. 


The mass wouldn’t move during the first part of the process, because there would be equal tensions 
T on both sides of it. 
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(c) Find the energy and momentum of the two particles in the CM 
frame, without using the Lorentz transformations. 
(d) Verify that the Z’s and p’s are related by the relevant Lorentz 
transformations. 
(e) Verify that E? — p*c* for each mass is the same in both frames. 
Likewise for EŽ ial = Porac? 
12.23. Transformations for 2-D motion +x 


A particle has velocity (u4, u,) in frame S’, which travels at speed v in 


the x direction relative to frame S. Use the velocity-addition formulas 
in Section 11.5.2 (Eqs. (11.36) and (11.38)) to show that 


Yu = Yu Yo (l + ulv), where u = ,/u2 + u? and u = ,/u2 + u? 
(12.90) 


are the speeds in the two frames. Then verify that E and py transform 
according to Eqs. (12.27), and also that p, = Py- 


Section 12.3: Collisions and decays 


12.24. Photon and mass collision « 
A photon with energy E collides with a stationary mass m. They combine 
to form one particle. What is the mass of this particle? What is its 
speed? 

12.25. A decay x 


m y 120° A stationary mass M decays into a particle and a photon. If the speed 
120° of the particle is v, what is its mass? What is the energy of the photon? 


12.26. Three photons « 


A mass m moves at speed v. It decays into three photons, one of which 


Fig. 12.23 
i travels in the forward direction, and the other two of which move at 
angles of 120° (in the lab frame) as shown in Fig. 12.23. What are the 
E M energies of these three photons? 
C a e@ 
J 12.27. Perpendicular photon « 
A photon with energy E collides with a mass M. The mass M scatters 
at an angle. If the resulting photon moves perpendicular to the incident 
photon’s direction, as shown in Fig. 12.24, what is its energy? 
ss, 12.28. Another perpendicular photon « 
ou A mass m moving at speed 4c/5 collides with another mass m at rest. 
Fig. 12.24 The collision produces a photon with energy Æ traveling perpendicular 


to the original direction, and a mass M traveling in another direction, 


12.29. 


12.30. 


12.31. 


12.9 Exercises 


as shown in Fig. 12.25. In terms of E and m, what is M? What is the 
largest value of Æ (in terms of m) for which this setup is possible? 


Decay into photons « 


Amass m moving at speed v decays into two photons. One photon moves 
perpendicular to the original direction, and the other photon moves off 
at an angle 0, as shown in Fig. 12.26. Show that if tan@ = 1/2, then 
v/c = (V5 — 1)/2, which just happens to be the inverse of the golden 
ratio. 


Maximum mass « 


A photon and a mass m move directly toward each other. They collide 
head-on and create a new particle. If the total energy of the system is 
E, how should it be divided between the photon and the mass m so that 
the mass of the resulting particle is as large as possible? 


Equal angles « 


A photon with energy E collides with a stationary mass m. If the mass m 
and the resulting photon (with unknown energy) scatter at equal angles 
0 with respect to the initial photon direction, as shown in Fig. 12.27, 
what is 6 in terms of E and m? What is 6 in the limit E < mc?? 


Section 12.4: Particle-physics units 


12.32. 


12.33. 


12.34. 


Pion—muon race « 


A pion and a muon have a 100m race. If they both have an energy of 
10 GeV, by how much distance does the muon win? 


Higgs production x 


The Higgs boson is a proposed elementary particle that should be exper- 
imentally detected in a few years, assuming it exists. One strategy 
for producing it in a high-energy particle accelerator is to collide pro- 
tons and antiprotons together. Taking the rest energy of a proton (and 
antiproton) to be about 1 GeV, and assuming that the rest energy of the 
Higgs is about 100 GeV, how much energy is required to produce the 
Higgs if: 

(a) A moving proton collides with a stationary antiproton? 

(b) A proton and antiproton have equal and opposite momenta? 


Maximum energy +» 


(a) A particle of mass M decays into a number of particles, some of 
which may be photons. If one of the particles has mass m, and 


Fig. 12.26 
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if the sum of the masses of all the other products is u, what is 
the maximum possible energy that m can have? Hint: Write the 
conservation of energy and momentum statements as Py — Pm = 
P, where P, is the total 4-momentum of the other products, and 
then square. The technique of Problem 12.5 may be useful. 

(b) In beta decay, a neutron decays into a proton, an electron, and a 
neutrino (which is essentially a photon, for the present purposes). 
The rest energies are En = 939.6 MeV, Ep = 938.3 MeV, Ee = 
0.5 MeV, and Ep ~ 0. What is the maximum energy that the 
electron can have? The neutrino? Interpret your results. 


Section 12.5: Force 


12.35. 


12.36. 


12.37. 


Force and a collision x 


Two identical masses m are initially at rest, a distance x apart. Aconstant 
force F accelerates one of them toward the other until they collide and 
stick together. What is the mass of the resulting particle? 


Pushing on a mass «x 


(a) A mass m starts at rest. You push on it with a constant force F. 
How much time ¢ does it take for the mass to move a distance x? 
(Both ¢ and x here are measured in the lab frame.) 

(b) After a very long time, m’s speed will approach c. It turns out that 
it approaches c sufficiently fast so that after a very long time, m 
will remain (approximately) a constant distance (as measured in 
the lab frame) behind a photon that was emitted at t = 0 from 
the starting position of m. What is this distance? 


Momentum paradox «*«« 


Two equal masses are connected by a massless string with tension T. 
The masses are constrained to move with speed v along parallel lines, as 
shown in Fig. 12.28. The constraints are then removed, and the masses 
are drawn together. They collide and make one blob which continues to 
move to the right. Is the following reasoning correct? If your answer is 
“no,” state what is invalid about whichever of the four sentences is/are 
invalid. 

“The forces on the masses point in the y direction. Therefore, there 
is no change in the momentum of the masses in the x direction. But the 
mass of the resulting blob is greater than the sum of the initial masses 
(because they collide with some relative speed). Therefore, the speed 
of the resulting blob must be less than v (to keep p, constant), so the 
whole apparatus slows down in the x direction.” 
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Section 12.6: Rocket motion 


12.38. Rocket energy x« 


As mentioned near the end of the first solution to the rocket problem 
in Section 12.6, the energy of the rocket in the ground frame equals 
M/(1 + v). Derive this result again, by integrating up the amount of 
energy that the photons have in the ground frame. 


Section 12.7: Relativistic strings 


12.39. Two masses « 


A mass m is placed right in front of an identical one. They are connected 
by arelativistic string with tension 7. The front mass suddenly acquires 
a speed 3c/5. How far from the starting point will the masses collide 
with each other? 


12.40. Relativistic bucket xx 


One of the results in part (d) of Problem 12.19 is that the bucket moves 
toward the wall at constant speed ./7/(T + p). Derive this again, 
without using the technique of taking the N — oo limit of many 
masses. 


12.10 Solutions 


12.1. Deriving E and p 


Let’s derive the energy formula, E = ymc?, first. Let the given mass decay into 
two photons, and let Eo be the energy of the mass in its rest frame. Then each of the 
resulting photons has energy Eo/2 in this frame. 

Now look at the decay in a frame where the mass moves at speed v. From 
Eq. (11.51), the frequencies of the photons are Doppler-shifted by the factors 
Jd + v)/0 = v) and /(1 = v)/(1 + v). Since the energies of the photons are given 
by E = hv, they are shifted by the same Doppler factors, relative to the Eo/2 value 
in the original frame. The total energy of the photons in the frame where the mass 
moves at speed v is therefore 


E aj , E Eo = VE (12.91) 
~OVien ZViGe woe ° 


By conservation of energy, this is the energy of the mass m moving at speed v. So we 
see that a moving mass has an energy that is y times its rest energy. 

We can now use the correspondence principle (which says that relativistic formulas 
must reduce to the familiar nonrelativistic ones in the nonrelativistic limit) to find 
Eo in terms of m and c. We just found that the difference between the energies of a 
moving mass and a stationary mass is y Eo — Eo. This must reduce to the familiar 
kinetic energy, mv? /2, in the limit v «< c. In other words, 


mv? Eo 


x panl -0-22 (12.92) 
2 ~ JS Ae 00A T 20 =e] 2’ : 


where we have used the Taylor series, 1/./1 = € © 1 + €/2. Therefore Ey = mc’, 
and so E = ymc*. 
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12.2. 


12.3. 


We can derive the momentum formula, p = ymv, in a similar way. Let the mag- 
nitude of the photons’ (equal and opposite) momenta in the particle’s rest frame be 
po/2.'° Since the photons’ momenta are given by E = hv/c, we can use the Doppler- 
shifted frequencies as we did above to say that the total momentum of the photons in 
the frame where the mass moves at speed v is 


Po /li+v po /l—-v 
aot Spas, 12.93 
P= aVi-w 2Viev P” (12:23) 


Putting the c’s back in, we have p = ypov/c. By conservation of momentum, this is 
the momentum of the mass m moving at speed v. 

We can now use the correspondence principle to find po in terms of m and c. If 
p = y (po/c)v is to reduce to the familiar p = mv result in the limit v < c, then we 
must have pọ = mc. Therefore, p = ymv. 


Colliding photons 
The 4-momenta of the photons are (see Fig. 12.29) 


Py, =(E,E,0,0), and P, = (E, E cos90, E sin 0,0). (12.94) 


Energy and momentum are conserved, so the 4-momentum of the final particle is 
Py = (2E,E + E cos9, E sin 0,0). Therefore, 


M? = Py - Py = QEY — (E + E cos 0} — (E sin 0)?, (12.95) 
which gives 
M = E,/2(1 — cos 0). (12.96) 


If0 = 180° then M = 2E, as it should (none of the final energy is kinetic). And if 
0 = 0° then M = 0, as it should (all of the final energy is kinetic; we simply have a 
photon with twice the energy). 

Increase in mass 


In the lab frame, the energy of the resulting object is yM + m, and the momentum is 
yMV. The mass of the object is therefore 


M' = [YM +m)? — (MV)? = JM? +2yMm+ m2. (12.97) 
The m? term is negligible compared with the other two terms, so we may approximate 
M' as 
_ 2ym 


Mx MJ1+ ~M(I a) =M +ym, (12.98) 


where we have used the Taylor series, V1 + € œ~ 1 + €/2. Therefore, the increase 
in mass is y times the mass of the stationary object. This increase is greater than the 
nonrelativistic answer of “m,” because heat is generated during the collision, and this 
heat shows up as mass in the final object. 


REMARK: The ym result is clear if we work in the frame where M is initially at rest. 
In this frame, the mass m comes flying in with energy ym, and then essentially all 
of this energy shows up as mass in the final object. That is, essentially none of it 
shows up as overall kinetic energy of the object. This negligible-kinetic-energy result 
is a general one whenever a small object hits a stationary large one. It follows from 
the fact that the speed of the large object is proportional to m/M, by momentum 


'6 With the given information that a photon has E = Av and p = hv/c, we can use the preceding 


Eo = mc? result to quickly conclude that pọ = mc. But let’s pretend that we haven’t found Eo 


yet. This will give us an excuse to use the correspondence principle again. 


12.4. 


12.5. 


12.10 Solutions 


conservation (there’s a factor of y if things are relativistic), so the kinetic energy 
goes like Mv? x M(m/M)* ~ 0, if M > m. In other words, the smallness of v 
wins out over the largeness of M. When a snowball hits a tree, all (essentially) of 
the initial energy goes into heat. None of it goes into changing the kinetic energy of 
the earth, & 


Two-body decay 
B and C have equal and opposite momenta. Therefore, 


EŻ — Mg = pP’ = E? — MŽ. (12.99) 
Also, conservation of energy gives 
Eg + Ec = M4. (12.100) 


Solving the two previous equations for Eg and Ec gives (using the shorthand 
a = M4, etc.) 


2 2 2 2 2 2 
b-c —b 
ppa TEOT ae pes EEC, (12.101) 
2a 2a 
Equation (12.99) then gives the momentum of the particles as 
1 
r=; Vat + bt + c4 — 2a2b? — 2a2c2 — 26202, (12.102) 
a 
REMARK: It turns out that the quantity under the radical can be factored into 
(a+tb+c\(atb—c)\(a—b+c)(a—b—c). (12.103) 


This makes it clear that ifa = b+ c, then p = 0, because there is no leftover energy for the 
particles to be able to move. Interestingly, the form of p in Eq. (12.102) looks very much 
like the area of a triangle with sides a, b, c, which is given by Heron’s formula as 


1 
A= gV H 2a2c2 +2b2c2 —at — 4 —c4. & (12.104) 


Threshold energy 


The initial 4-momenta in the lab frame are 


(E,p,0,0), and (m,0,0,0), (12.105) 


where p = y E2 — m2. Therefore, the total 4-momentum of the final particles in 
the lab frame is (E +m, p, 0, 0). The quantity E al = Prt a] 1S frame independent, 
and it equals the square of the energy in the CM frame (where p = 0). So we 


have, with the subscript “f” for “final,” 
(E + m}? — (VE? = m2)* = (ERM)? 
= 2Em+2m = (EMY. (12.106) 


We see that minimizing £ is equivalent to minimizing Ee, But EM is clearly 
minimized when all the final particles are at rest in the CM frame, so that there 
is no kinetic energy added to the rest energy. So the energy in the CM frame at 
threshold is simply the sum of the rest energies. In other words, Ee = Nm. We 
therefore have 


N2 
2Em+2m? = (Nm? => E= (> = ) m. (12.107) 
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12.6. 


Since there is no relative motion among the final particles in the CM frame 
at threshold there is also no relative motion in any other frame. This means 
that at threshold the N masses travel together as a blob in the lab frame. The 
threshold Æ is larger than the naive answer of (N —1)m, because the final state has 
inevitable “wasted” energy in the form of kinetic energy (which is necessitated 
by conservation of momentum) in the lab frame. Note that E œ N? for large N. 


Head-on collision 
The 4-momenta before the collision are 


Pu = (E,p, 0, 0), Pm = (m, 0, 0, 0), (12.108) 


where p = y E2 — M2. The 4-momenta after the collision are 


Py = (E’,p’, 0,0), Pii = (we won’t need this), (12.109) 


where p' = yE'2 — M2. Conservation of energy and momentum give Py + 
Pm = Piy + Pin. Therefore, 


P2 = (Py + Pm — Py)? (12.110) 
=> PO = Pi, + Po + Piz + 2Pm- (Pu — Ply) — 2Pu Ply 
=> m =M? +m? +M? + 2m(E — E') — 2(EE’ — pp’) 


=> —pp' = M? — EE! +m(E-E’) 


2: 2 
= 0=(W? -EE +mE-£)) — (VE? - M?VE? — M?) 
=> 0 = M?(E? —2EE' +87) +2(M? — EE')m(E — E’) +m (E-E? 


As claimed, E’ = E is a root of this equation, as it must be, because E’ = E and 
p’ = p certainly satisfy conservation of energy and momentum with the initial 
conditions, by definition. Dividing through by (E — E’) gives 


M?(E — E') + 2m(M? — EE’) + m?(E — E’ = 0. (12.111) 


Solving for E’ gives the desired result, 


jiz 2mM? + E(m? + M?) 


2Em + m? + M? a 


REMARK: Let’s look at some limits: 

1. E ~ M (barely moving): then E’ ~ M, because M is still barely moving. 

2. M = m: then E’ = M, because M stops, and m picks up all the energy that 
M had. 

3. m > E (> M) (brick wall): then E’ ~ E, because the heavy mass m picks up 
essentially no energy. 

4. (E >) M > mand M? >» Em: then E’ ~ E, because it’s essentially like m 
isn’t there. 

5. (E >) M > m but Em > M?: then E’ ~ M?/2m. This isn’t obvious, but it’s 
interesting that it doesn’t depend on E. This means that no matter how fast you 
throw a big object at a small one (head-on), the big one will always (as long 
as you throw it hard enough) end up with the same energy, M?/2m. And since 
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we’re assuming M? < Em, this resulting energy is much less than Æ. So most 
of the (very large) initial energy ends up in m. 

6. E >> m` M: then E’ ~ m/2. This isn’t obvious, but it’s similar to an analogous 
limit in the Compton scattering in Problem 12.7. As above, no matter how fast 
you throw a small object at a big one (head-on), the small one will always (as 


long as you throw it hard enough) end up with the same energy, m/2. & X, m 
a i B 
12.7. Compton scattering 
The 4-momenta before the collision are (see Fig. 12.30) y % 
he he pA 
Py = 57 708 » Pm = (mc*,0,0,0). (12.113) M 


The 4-momenta after the collision are 


, he he he i / , 7 em 
Py = are aan edn , P = (we won't need this). (12.114) 


Fig. 12.30 


If we wanted to, we could write P’, in terms of its momentum and scattering 
angle. But we’re not concerned with these quantities, and the nice thing about 
the following method is that we don’t need to introduce them. Conservation of 
energy and momentum give Py + Pm = P, + Pp: Therefore, 


PR = (Py + Pm — Py? 
=> Ph =P. +P? +P? + 2Pm- (Py — Py) — 2Py Pi, 


he he he he 
2 4 2 4 2 
mc +m c +0+ 2mc (5 ) y 


(1 — cos 0). 
(12.115) 


Canceling the mc* terms and multiplying through by AA’/ (2hmc?) gives the 
desired result, 


h 
A =à + — (1 —cos8). (12.116) 
mc 


The nice thing about this solution is that all the unknown garbage in P% 
disappeared when we squared it. 


Remark: Let’s look at some limits: 


1. If 6 © 0 (that is, not much scattering), then 4’ ~ À, as expected. 

2. If = x (that is, backward scattering), then A’ = A + 2h/mc. 

3. If6 = x andadditionally à « h/mc (thatis, mc? « he/A = E,,, so the photon’s 
energy is much larger than the electron’s rest energy), then 2’ ~ 2h/mc, so 


he he 1 4 
mc’. (12.117) 


E =>% = 
Y X  2h/me 2 
Therefore, the photon bounces back with an essentially fixed Z’,, independent 
of the initial E, (as long as Ey is large enough). This isn’t obvious. In fact, it’s 
not even obvious that the photon can bounce straight back. You might think that 
if it has enough energy, it will end up moving forward along with the electron. 
However, in the CM frame, the photon bounces backward in a head-on collision. 
So it must bounce backward in every frame, because a photon’s direction can’t 
switch when going from one frame to another. & 
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System of masses 

Let the speed of the stick go from zero to €, where € < v. Then the final speeds 
of the two masses are obtained by relativistically adding or subtracting € from v. 
(Assume that the time involved is small, so that the masses are still essentially 
moving horizontally.) Repeating the derivation leading to Eq. (12.26), we see that 
the final momenta of the two masses have magnitudes yy ye (v + €)m. But since 
€ is small, we can set ye œ 1, to first order. Therefore, the forward-moving mass 
has momentum yy(v + €)m, and the backward-moving mass has momentum 
—y (v — €)m. So the net increase in momentum is Ap = 2yme, where y = yy. 
Hence, 


pe ey, d Me (12.118) 
= — = aE ma = ” 4 
Ag Ag Y 
Relativistic harmonic oscillator 
F = dp/dt gives —mw*x = d(myv)/dt. Using Eq. (12.54), we have 
dv 
2 3 
— =y’. 12.119 

F ( ) 
We must somehow solve this differential equation. A helpful thing to do is to 
multiply both sides by v (which is equivalent to rewriting dv/dt as v dv/dx) 
to obtain —w?xx = yvi. But from Eq. (12.53), the right-hand side of this is 
dy /dt. Integration then gives —w*x?/2 + C = y, where C is a constant of 
integration. We know that y = 1 when x = b, so we find 


w2 
y=1+ sae — x), (12.120) 
C 


where we have put the c’s back in to make the units right. The period is given by 


b dx 
r=af a (12.121) 
0 v 


But y = 1/y 1 — v2 /c2, which gives v = cy y2 — 1/y. Therefore, 


(12.122) 


J4 y 
T=-] ——a&. 
c Jo /y2—1 


REMARK: Inthe limit wb « c (so that y ~ 1, from Eq. (12.120), which means 
that the speed is always small), we must recover the Newtonian limit. And indeed, to 
lowest order, y? xi+ (w/c?) (b? — x), so Eq. (12.122) gives 


Tx T dx (12.123) 
c Jo (w/e)Vb= -x2 f 


This is the correct result, because conservation of energy for a nonrelativistic spring 
gives 


1 1 
a x) = sme a (b? — x?) =v". (12.124) 


Using this v in the general expression for T given in Eq. (12.121) yields 
Eq. (12.123). & 


12.10. 


12.11. 


12.10 Solutions 


Relativistic rocket 

The relation between m and v obtained in Eq. (12.67) is independent of the rate 
at which mass is converted into photons. The point of this problem is to assume 
a certain rate, in order to obtain a relation between v and t. 


In the frame of the rocket, we are given dm = —o dt. The usual time-dilation 
effect gives dt = y dt, so we have dm = —(o/y) dt in the ground frame. 
Differentiating Eq. (12.67) to obtain another expression for dm, we have 

—Md 
dm = ? (12.125) 


CERNE 


Equating the two expressions for dm gives 


t odt v dv 
f = x (12.126) 
o M 0 A +v) -— v2) 


We could use a computer to do this dv integral, but let’s do it from scratch. Using 
a few partial-fraction tricks, we have 


dv = dv 
lamn a l n esis 
1 1 1 dv 
Sh 
1 dv 1 1 1 
aati)” 
1 


= pip t 12.127) 
=- 4 tao es 


Equation (12.126) therefore gives 


ot 1 1 i l+v v 1 l+v 
= + In = + In ‘ 
M 2 21+v) 4 l-v 21+v) 4 l-v 


(12.128) 


Remarks: Ifv < 1 (or rather, ifv < c), we can Taylor-expand the two terms in 
Eq. (12.128) to obtain ot/M ~ v, which can be written as o ~ M(v/t) = Ma. But o 
equals the force acting on the rocket (or rather oc, to make the units correct), because 
—o is the rate of change in momentum of the photons (since their momentum is 
p = —E/c = —(dmc’)/c). We therefore obtain the expected nonrelativistic F = ma 
equation. 

Ifv = 1 — e, where € is very small (that is, if v is very close to c), then we can 
make approximations in Eq. (12.128) to obtain € ~ (2e)e~4°'/™. We see that the 
difference between v and 1 decreases exponentially witht. & 


Relativistic dustpan I 


This problem is essentially the same as Problem 12.3. Let M’ be the mass of 
the dustpan-plus-dust-inside system (which we’ll label “S”) when its speed is 
v. After a small time dt in the lab frame, S has moved a distance v dt, so it has 
basically collided with an infinitesimal mass iv dt. Its energy therefore increases 
to yM + Avdt. Its momentum is still yMv, so its mass is now 


M' = Jom + Av dt)? — (yMv)2 & m? +2yMvdt, (12.129) 
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where we have dropped the second-order dt? terms. Using the Taylor series 
J/1 +e 1+ €/2, we can approximate M’ as 


j 2ydv dt yru dt 
M~M fp ~M 1+ M =M + ywwdt. (12.130) 


The rate of increase in S’s mass is therefore y Av. As in Problem 12.3, this increase 
is greater than the nonrelativistic answer of “Av,” because heat is generated dur- 
ing the collision, and this heat shows up as mass in the final object. 


Remarks: As explained in the remark in the solution to Problem 12.3, this result is 
clear if we work in the dustpan frame. In this frame, the above-mentioned infinitesimal 
mass Av dt comes flying in with energy y (àv df), and essentially all of this energy 
shows up as mass in the final object. 

Note that the rate at which the mass increases, as measured in the dustpan frame, 
is y7Av, due to time dilation. The dust-entering-dustpan events happen at the same 
location in the dustpan frame, so we have indeed put the extra y factor in the correct 
place. Alternatively, you can think in terms of length contraction. S' sees the dust 
contracted, so its density is increased to yA. And the other y factor comes from 
the fact that the dust is moving in the dustpan frame, so there is a y factor in the 
energy. & 


Relativistic dustpan I 


The initial momentum is yy MV = P. There are no external forces, so the momen- 
tum of the dustpan-plus-dust-inside system (denoted by “S”) always equals P. 
That is, ymv = P, where m and v are the mass and speed of S at any later time. 
Let’s find v(x) first. The energy of S, namely ym, increases due to the 
acquisition of new dust. Therefore, d(ym) = A dx, which we can write as 


P 
a(*) = dx. (12.131) 
v 
Integrating this, and using the fact that the initial speed is V, gives P/v — P/V = 
Ax. Therefore, 


V 


v = TEP 


(12.132) 
For large x, this approaches P/(Ax). This makes sense, because the mass of S' is 
essentially equal to Ax, and it is moving at a slow, nonrelativistic speed. 

To find v(t), write the dx in Eq. (12.131) as v dt to obtain (P/V?) dv = w dt. 
This gives 


v Pdu f P P V 
— 7 = Adt => ag TA == 0) = eM 
y v 0 v V JV14+ (2V2at/P) 
(12.133) 


Integrating this to obtain x(t) gives 


Gane in 1 (12.134) 
T T P i i 


If you want, you can rewrite all of these answers in terms of M via P = yy MV. 
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REMARKS: 

1. You can also obtain the result in Eq. (12.134) by equating the expressions for v 
in Eqs. (12.132) and (12.133). Or you can write the v in Eq. (12.132) as dx/dt, 
and then separate variables and integrate. 

2. For small t, you can show that Eq. (12.134) reduces to x = Vt, as it should. For 
large ¢, x has the interesting property of being proportional to ./. 

3. Given P, all the results in this problem (when expressed in terms of P) are 
actually the same as the ones obtained in the nonrelativistic case, because 
Eq. (12.131) is still true there; it’s simply the expression for how the mass 
changes in the nonrelativistic case. From that point on, relativity never entered 
into the reasoning. & 


12.13. Relativistic dustpan I 


DUSTPAN FRAME: Let S denote the dustpan-plus-dust-inside system at a given 
time, and consider a small bit of dust (call this system s) that enters the dustpan. 
In S’s frame, the density of the dust is yA, due to length contraction. Therefore, in 
a time dt (where T is the time in the dustpan frame), a little s system of dust with 
mass y Av dt crashes into S and loses its negative momentum of —y (yAvdt)v = 
—y?v?A dt. The force ons is therefore F = dp/dt = y?v?A. The desired force 
on S is equal and opposite to this, so 


F = -yv A. (12.135) 


LAB FRAME: Ina time dt, where f is the time in the lab frame, a little s system 
of dust with mass àv dt gets picked up by the dustpan. What is the change in 
momentum of s? It is tempting to say that it is y (Av dt)v, but this would lead to 
a force of —yv*A on the dustpan, which doesn’t agree with the result we found 
above in the dustpan frame. This would be a problem, because longitudinal forces 
should be the same in different frames. 

The key point to realize is that the mass of whatever is moving increases at 
arate yAv, and not Av (see Problem 12.11). We therefore see that the change 
in momentum of the additional moving mass is y(yAudt)v = y?v2Adt. The 
original moving system S therefore loses this much momentum, and so the force 
onitis F = dp/dt = —y?v7A, in agreement with the result in the dustpan frame. 


12.14. Relativistic cart I 


GROUND FRAME (YOUR FRAME): Using reasoning similar to that in Problem 12.3 
and Problem 12.11, we see that the mass of the cart-plus-sand-inside system 
increases at a rate yo. Therefore, its momentum increases at a rate (using the 
fact that v is constant) 


a a (yow = y? (12.136) 
—= —)v= o)vu=y*ou. ; 
di y Ht vy y 
Since F = dp/dt, this is the force that you exert on the cart. Therefore, it is also 
the force that the ground exerts on your feet, because the net force on you is zero 
(since your momentum is constant, and in fact zero). 


CART FRAME: The sand-entering-cart events happen at the same location in the 
ground frame, so time dilation says that the sand enters the cart at a slower 
rate in the cart frame, that is, at a rate o/y. The sand flies in at speed v, and 
then eventually comes to rest on the cart, so its momentum decreases at a rate 
y(o/y)v = ov. This must therefore be the force that your hand applies to 
the cart. 
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If this were the only change in momentum in the problem, then we would have 
a problem, because the force on your feet would be ov in the cart frame, whereas 
we found above that it is y2o-v in the ground frame. This would contradict the fact 
that longitudinal forces are the same in different frames. What is the resolution to 
this apparent paradox? The resolution is that while you are pushing on the cart, 
your mass is decreasing. You are moving at speed v in the cart frame, and mass 
is continually being transferred from you (who are moving) to the cart (which is 
at rest), as we will show. This is the missing change in momentum that we need. 
The quantitative reasoning is as follows. 

Go back to the ground frame for a moment. We saw above that the mass of the 
cart-plus-sand-inside system (call this system “C”’) increases at a rate yo in the 
ground frame. Therefore, the energy of C increases at a rate y (yo) in the ground 
frame. The sand provides o of this energy, so you must provide the remaining 
(y? — Io part. Therefore, since you are losing energy at this rate, you must also 
be losing mass at this rate in the ground frame (because you are at rest there). 

Now go back to the cart frame. Due to time dilation, you lose mass at a rate of 
only (y? — 1)a/y. This mass goes from moving at speed v (that is, along with 
you), to speed zero (that is, at rest on the cart). Therefore, the rate of decrease in 
momentum of this mass is y? — lo/y)u = (y? — lov. Adding this result 
to the ov result we found for the sand, we see that the total rate of decrease in 
momentum is you. This is therefore the force that the ground applies to your 
feet, in agreement with the above calculation in the ground frame. 

Note that the reason why we didn’t have to worry about your changing mass 
when doing the calculation in the ground frame was that your speed there was 
zero. Your momentum was therefore always zero, independent of what was 
happening to your mass. 


Relativistic cart II 


GROUND FRAME: Using reasoning similar to that in Problem 12.3 and Problem 
12.11, we see that the mass of the cart-plus-sand-inside system increases at a 
rate yo. Therefore, its momentum increases at a rate y(yo)v = y2ov. How- 
ever, this is not the force that your hand exerts on the cart. The reason is that 
your hand is receding from the location where the sand enters the cart, so your 
hand cannot immediately be aware of the need for additional momentum. No 
matter how rigid the cart is, it can’t transmit information faster than c. In a sense, 
there is a sort of Doppler effect going on, and your hand needs to be responsi- 
ble for only a certain fraction of the momentum increase. Let’s be quantitative 
about this. 

Consider two grains of sand that enter the cart a time ¢ apart. What is the 
difference between the two times that your hand becomes aware that the grains 
have entered the cart? Assuming maximal rigidity (that is, assuming that signals 
propagate along the cart at speed c), the relative speed (as measured by someone 
on the ground) of the signals and your hand is c — v. The distance between the 
two signals is ct. Therefore, they arrive at your hand separated by a time of 
ct/(c — v). In other words, the rate at which you feel sand entering the cart is 
(c—v)/c times the given o rate. This is the factor by which we must multiply the 
naive y2ou result for the force we found above. The force you apply is therefore 
(dropping the c’s) 


F= (1- =)yov = ley (12.137) 
E 
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CART FRAME (YOUR FRAME): The sand-entering-cart events happen at the same 
location in the ground frame, so time dilation says that the sand enters the cart 
at a slower rate in the cart frame, that is, at a rate o/y. The sand flies in at speed 
v, and then eventually comes to rest on the cart, so its momentum decreases at 
arate y(o/y)v = ov. But again, this is not the force that your hand exerts on 
the cart. As above, the sand enters the cart at a location that is receding from 
your hand, so your hand cannot immediately be aware of the need for additional 
momentum. Let’s be quantitative about this. 

Consider two grains of sand that enter the cart a time ¢ apart. What is the 
difference between the two times that your hand becomes aware that the grains 
have entered the cart? Assuming maximal rigidity (that is, assuming that signals 
propagate along the cart at speed c), the relative speed (as measured by someone 
on the cart) of the signals and your hand is c, because you are at rest. The distance 
between the two signals is ct + vt, because the sand source is moving away from 
you at speed v. Therefore, the signals arrive at your hand separated by a time of 
(c + v)t/c. In other words, the rate at which you feel sand entering the cart is 
c/(c + v) times the time-dilated o/y rate. This is the factor by which we must 
multiply the naive ov result for the force we found above. The force you apply 
is therefore (dropping the c’s) 


1 ov 

F=( Jov = , (12.138) 
1+v/c l+uv 

in agreement with Eq. (12.137). 

In a nutshell, the two naive results in the two frames, y2ou and ov, differ by 
two factors of y. But the ratio of the two “Doppler-effect” factors (which arose 
from the impossibility of absolute rigidity) precisely remedies this discrepancy. 
The reason why we didn’t need to consider this Doppler effect in Problem 12.14 
is that there your hand is always right next to the point where the sand enters 
the cart. 


12.16. Different frames 
(a) The energy of the resulting blob is 2m + T£. Since the blob is at rest, we have 
M=2m+Te. (12.139) 


(b) Let the new frame be S. Let the original frame be S’. The critical point to realize 
is that in frame S, the left mass starts to accelerate before the right mass does. 
This is due to the loss of simultaneity between the frames. 

Consider the two events at which the two masses start to move. Let the 
left mass and right mass start moving at positions x; and xr in S. The Lorentz 
transformation Ax = y(Ax’ + vA’) tells us that x, — x, = y£, because 
Ax’ = £ and A?’ = 0 for these events. Alternatively, this follows from length 
contraction, because if we picture things in the original S’ frame, then the 
length y£ in S is what is length contracted down to £ in S’. 

Let the masses collide at position xc in S. Then the gain in energy of the left 
mass is T (xe — xı), and the gain in energy of the right mass is (—T) (xe — xr) 
which is negative if xe > xr. We have used the fact that the longitudinal force 
is the same in the two frames, so the masses still feel a tension T in frame S. 
The gain in the sum of the energies of the two masses is therefore 


AE = T (xe — x1) + (-T) (Qe — xr) = TOY — x1) = Ty. (12.140) 
The initial sum of the energies was 2ym, so the final energy is 
E =2ym+Ty£ = yM, (12.141) 


as desired. 


630 


Relativity (Dynamics) 


12.17. Splitting mass 


We’ ll calculate the times for the two parts of the process to occur. The energy 
of the mass right before it splits is (with the subscript b for “before”) Ep = 


M +T(€/2), so the momentum is pp = „E? — M? = y MT£ + T?02/4. Using 
F = dp/dt => t = Ap/T, the time for the first part of the process is 


VJ MTL + T202/4 Me & 
y= T — + 


; 12.142 
T 4 ( ) 


The momentum of the front half of the mass immediately after it splits is 


Pa = Pp/2 = (1/2) MT& + T2¢2/4. The energy at the wall is Ew = 
Epy/2 + T(£/2) = M/2 + 3T£/4, so the momentum at the wall is py = 


E2, — (M/2)? = (1/2)/3MT¢ + 9T2€2/4. The change in momentum during 
the second part of the process is therefore 


Ap = pw — Pa = (1/2), 3MT£ + 9722/4 — (1/2) MT& + T282 /4. 


(12.143) 
Since we have t = Ap/T, the time for the second part is 
p= 1 {SMe 9%? 1 | Me M 4? (12.144) 
a aN T a NT i 


The total time is ¢] + t2, which simply changes the minus sign in this expression 
to a plus sign. 


12.18. Relativistic leaky bucket 


(a) Let the wall be at x = 0, and let the initial position be x = £. Consider a small 
interval during which the bucket moves from x to x + dx (where dx is negative). 
The bucket’s energy changes by (—T) dx due to the string (this is positive), and 
also changes by a fraction dx/x, due to the leaking (this is negative). Therefore, 
dE = (—T) dx + E dx/x, or 

dE E 
—=-T+-. (12.145) 
dx x 

In solving this differential equation, it is convenient to introduce the variable 

y = E/x. With this definition, we have E’ = (xy) = xy’ + y, where a prime 

denotes differentiation with respect to x. Equation (12.145) then becomes xy’ = 

—T, or dy = —T dx/x. Integration gives y = —T ln x + C, which we can write 

as y = —T In(x/€) + B, in order to have a dimensionless argument in the log. 

Since E = xy, we therefore have 


E(x) = Bx — Tx In(x/€), (12.146) 


where B is a constant of integration. The reasoning up to this point is valid for 
both the total energy and the kinetic energy, because they both change in the 
two ways described above. Let’s look at each case. 


TOTAL ENERGY: Equation (12.146) gives 
E = M(x/é) — Tx In(x/€), (12.147) 


where the constant of integration, B, has been chosen to be M /£ so that E = M 
when x = £. In terms of the fraction z = x/£, we have E = Mz — TézInz. 
Setting dE/dz = 0 to find the maximum gives 


M Te ; 
In Zmax = 1 Emax = — eM/T®, (12.148) 
Te e 
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The fraction z must satisfy z < 1, so we must have Inz < 0. Therefore, a 
solution for z exists only if M < T£. IfM > TZ, then the total energy decreases 
all the way to the wall. 


Remarks: IfM is slightly less then T£, then Zmax is slightly less than 1, so Æ 
quickly achieves a maximum of slightly more than M, then decreases for the 
rest of the way to the wall. 

IfM < Té, then E achieves its maximum at zmax œ% 1/e, where it 
has the value T£/e. Essentially all of the energy is kinetic in this case, so 
this result must agree with the result for the kinetic energy below, which it 
indeed will. & 


KINETIC ENERGY: Equation (12.146) gives 
K = —TxIn(x/0), (12.149) 


where the constant of integration, B, has been chosen to be zero so that K = 0 
when x = £. Equivalently, E — K must equal the mass M(x/€). In terms of 
the fraction z = x/€, we have K = —TézInz. Setting dK /dz = 0 to find the 
maximum gives 


1 T 
Zmax Kmax > (12.150) 
e e 


which is independent of M . This result must reduce properly in the nonrelativis- 
tic limit. But since there’s nothing that needs reducing (there aren’t any terms 
that are small compared with others when v < c), this result must exactly equal 
the nonrelativistic result. And indeed, the analogous “Leaky bucket” problem 
in Chapter 5 (Problem 5.17) gives the same answer. 


(b) With z = x/£, the momentum of the bucket is p = VE? =m? = VE? = (Mz), 
so Eq. (12.147) gives 


p = V (Mz — Tez Inz)? — (Mz)? = V—2MT ez? Inz + T2022? In? z. 
(12.151) 


Setting the derivative equal to zero gives T£ In?z + (T£ — 2M) Inz — M = 0. 

The maximum momentum therefore occurs at 

2M — Te — VT? + 4M? 
2T¢ : 


We have ignored the other root, because it gives Inz > 0 => z > 1. 


(12.152) 


In Zmax = 


REMARKS: 

1. IfM <« T4, then In zmax % —1 => Zmax © 1/e. In this case, the bucket 
immediately moves with v © c, so we have E ~ pc. Therefore, E and 
p Should achieve their maxima at the same location. And indeed, we saw 
above that Emax occurs at Zmax © 1/e. 

2. IfM > Te, then Inzmax ~ —1/2 => Zmax % 1/./e. In this case, the 
bucket is nonrelativistic, so this result should agree with Problem 5.17, 
which it does. 

3. IfM = Te, then Inzmax = (1 — V5)/2, which is the negative of the 
inverse of the golden ratio. & 


12.19. Relativistic bucket 


(a) The energy of the mass right before it hits the wall is E = m+T€. Therefore, the 
momentum right before it hits the wall is p = VE? — m? = /2mTé + T202. 
So F = dp/dt gives (using the fact that the tension is constant) 
Ap V2mT¢ + T? 


At=— = 12.153 
F T ( ) 
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(b) 


(c) 


Ifm « T£, then At ~ £ (or £/c in normal units), which makes sense, because 
the mass travels at essentially speed c. And if m >> Té, then At © ./2m@/T. 
This is the nonrelativistic limit, and it agrees with the result obtained from the 
familiar expression, £ = at” /2, where a = T/m is the acceleration. 
STRAIGHTFORWARD METHOD: The energy of the blob right before it hits the 
wall is Ew = 2m + 2T£ (with the subscript w for “wall’”). If we can find the 
mass, M, of the blob, then we can use p = VE? — M? to get the momentum, 
and then use At = Ap/F to get the time.!7 

From part (a), the momentum right before the collision is pp = 
V2mT¢ + T2€2, and this is also the momentum of the blob right after the 
collision, pa. The energy of the blob right after the collision is Ea = 2m + T£. 
So the mass of the blob after the collision is M = /E2 — pZ = /4m? + mT. 


Therefore, the momentum at the wall is pw = a E2 M? = V6mTe + 4T? E, 
and so 


Ap [6mTt+4Te 
ara B NONE T l (12.154) 
F T 


Ifm = 0 then At = 2£, as expected. 


BETTER METHOD: [In the notation in the hint in the statement of the problem, 
the change in p? from the start to just before the collision is A (p°) = EX — i. 
This is true because 


E? =m? =p, and EX =m? =p, (12.155) 


and since m is the same throughout the first half of the process, we have 
A(E*) = A(p°). Likewise, the change in p? during the second half of the 
process is A(p?) = EI — ES, because 


E -M° =p, and E} - M’ =p, (12.156) 


and since M is the same throughout the second half of the process,!8 we have 


A(E?) = A (p°). The total change in p? is the sum of the above two changes, 
so the final p* is 


p? = (E2 — E?) + (E2 — E2) 


= ((m+ TO? — m°) + (Qm +270? — (2m + T0?) 
= 6mTl + 47707, (12.157) 


as in Eq. (12.154). The first solution in effect performed the same calculation, 
but in a more obscure manner. 

The reasoning in part (b) tells us that the final p? equals the sum of the A (E£?) 
terms over the N parts of the process. So we have, using an indexing notation 


17 Although the tension T acts on two different things (the mass m initially, and then the blob), it is 


valid to use the total Ap to obtain the total time via At = Ap/F, because if we wanted to, we 


could break up the Ap into its two parts, and then find the two partial times, and then add them 
back together to get the total At. 

'8 From the first solution, M happens to be /'4m? + 2mT£, but the nice thing about this solution is 
that we don’t need to know this. All we need to know is that it is constant. 


12.10 Solutions 


analogous to that in part (b), 


N N 
P = D (E - Ele) = Do (hn + kT 0? — (km + k - DTO’) 
k=1 k=1 
N 
=5 (2kmre +E- 19) 
k=1 
=N(N+)mTO+4 N77? CL. (12.158) 
Therefore, 


Ap _ JN(N + DmTEl + NIT? 
F T l 


At = 


(12.159) 


This agrees with the results from parts (a) and (b), for N = 1 and 2. 
(d) We want to take the limit N —> oo, £ —> 0, m — 0, with the restrictions that 
N£ = L and Nm = M. Written in terms of M and L, Eq. (12.159) becomes 


1+ 1/N)MTL + T? VMIL + TI 
ata V4 =i " t > + , (12.160) 


as N —> oo. This At is the same as the time it takes for one particle of mass 
m = M /2 to reach the wall, from part (a). The mass of the bucket at the wall is 


My = yE} — p?, = vV (M + TL? — (MTL + T?1?) 
= J/M2 + MTL. (12.161) 


If TL « M, then My ~ M, which makes sense. IfM < TL, then My ~ 
~ MTL, which means that My is the geometric mean between the given mass 
and the energy stored in the string. This isn’t entirely obvious. The speed of 
the bucket right before it hits the wall is 


pw VMTL+ PTE 
Ey  M+TL 


| TL T T 
= >c ae (12.162) 
M-+TL T+p T + per 


where p = M/L is the mass density. 


Vw = 


REMARK: Note that vw depends only on T and p. This means that if we change L 
by moving the wall to any other position, the speed at the wall will still be vy. In 
other words, the bucket moves toward the wall at the constant speed vw. (The task 
of Exercise 12.40 is to derive this result without taking the N — oo limit of many 
masses.) This constant-speed result must also hold in the nonrelativistic limit (that is, 
T < pc), for which we have vw ~ ./T/p. And indeed, this agrees with the result 
for Problem 5.27 (“Pulling a chain again”), which is essentially the same problem, in 
different language. & 
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Chapter 13 
4-vectors 


We now come to a very powerful concept in relativity, that of 4-vectors. Although 
it’s possible to derive everything in Special Relativity without the use of 4-vectors 
(and indeed, this is the route, give or take, that we’ve taken in the previous two 
chapters), they are extremely helpful in making calculations simpler and concepts 
more transparent. 

I have chosen to postpone the full introduction to 4-vectors until now, in order 
to make it clear that everything in Special Relativity can be derived without them. 
In encountering relativity for the first time, it’s nice to know that no “advanced” 
techniques are required. But now that you’ve seen everything once, let’s go back 
and derive various things in an easier way. 

Although Special Relativity doesn’t require knowledge of 4-vectors, the sub- 
ject of General Relativity definitely requires a firm understanding of tensors, 
which are the generalization of 4-vectors. We won’t have time to go very deeply 
into GR in Chapter 14, so you’ll just have to accept this fact. But suffice it to say 
that an eventual understanding of GR requires a solid foundation in the 4-vectors 
of Special Relativity. So let’s see what they’re all about. 


13.1 Definition of 4-vectors 


Definition 13.1 The 4-tuplet, A = (Ao, A1, 42, A3), is a “4-vector” if the A; 
transform under a Lorentz transformation in the same way as (c dt, dx, dy, dz) 
do. In other words, A is a 4-vector if it transforms like (assuming the Lorentz 
transformation is along the x direction; see Fig. 13.1): 


Ao = y (Ay + (v/c)A1), 
A, = y (41 + (v/c)Ap), 
Ay = A), 
43 = Aj. 


(13.1) 


REMARKS: 


1. Similar equations must hold, of course, for Lorentz transformations in the y and z 
directions. 

2. Additionally, the last three components must be a vector in 3-space. That is, they must 
transform like a usual vector under rotations in 3-space. So the full definition of a 4- 


13.2 Examples of 4-vectors 


vector is that it must transform like (c dt, dx, dy, dz) under Lorentz transformations and 
rotations. 

3. We’ll use a capital italic letter to denote a 4-vector. A bold-face letter will denote, as usual, 
a vector in 3-space. 

4. Lest we get tired of writing the c’s over and over, we'll work in units where c = 1 from 
now on. 

5. The first component of a 4-vector is called the “time” component. The other three are the 
“space” components. 

6. The components in (dt, dx, dy, dz) are sometimes referred to as (dxo, dx1, dx2, dx3). Also, 
some treatments use the indices “1” through “4,” with “4” being the “time” component. 
But we’ll use “0” through “3.” 

7. The A; may be functions of v, the dx;, the x; and their derivatives, and any invariants (that 
is, frame-independent quantities) such as the mass m. 

8. 4-vectors are the obvious generalization of vectors in regular space. A vector in three 
dimensions, after all, is something that transforms under a rotation just like (dx, dy, dz) 
does. We have simply generalized a 3-D rotation to a 4-D Lorentz transformation. & 


13.2 Examples of 4-vectors 


So far, we have only one 4-vector at our disposal, namely (dt, dx, dy, dz). What are 
some others? Well, (7dt, 7dx, 7dy, 7dz) certainly works, as does any other con- 
stant multiple of (dt, dx, dy, dz). Indeed, m(dt, dx, dy, dz) is a 4-vector, because 
m is an invariant. But how about 4 = (dt, 2dx, dy, dz)? No, this isn’t a 4-vector, 
because on one hand it must transform (assuming it’s a 4-vector) like 


dt = Ao = y (4o +4) = y (dt +v2dx)), 
2dx = A, =y(4, +049) = y (2d) +vdt'), 

dy = A.=A), = dy, 

dz = 43 = 44 = dz’, 


(13.2) 


from the definition of a 4-vector. But on the other hand, it must transform like 


dt = y (dt! + v dx’), 


2dx = 2y (dx + vdt’), 
(13.3) 
dy = dy’, 


dz = dz’, 


because this is how the dx; transform. The two preceding sets of equations are 
inconsistent, so A = (dt, 2dx, dy, dz) is not a 4-vector. Note that if we had instead 
considered the 4-tuplet, A = (dt, dx, 2dy, dz), then the two preceding equations 
would have been consistent. But if we had then looked at how A transforms under 
a Lorentz transformation in the y direction, we would have found that it is not a 
4-vector. 
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The moral of this story is that the above definition of a 4-vector is a nontrivial 
one because there are two possible ways that a 4-tuplet can transform. It can 
transform according to the 4-vector definition, as in Eq. (13.2). Orit can transform 
simply by having each of the A; separately transform (using how the dx;, or 
whatever else they may be made of, transform), as in Eq. (13.3). Only for certain 
special 4-tuplets do these two methods give the same result. By definition, we 
label these special 4-tuplets as 4-vectors. 

Let’s now construct some less trivial examples of 4-vectors. In constructing 
these, we’ll make abundant use of the fact that the proper-time interval, dt = 


J dt? — dr?, is an invariant. 


e Velocity 4-vector: We can divide (dt, dx, dy, dz) by dt, where dt is the proper time 
between two events (the same two events that yielded the dt, etc.). The result is indeed 
a 4-vector, because dt is independent of the frame in which it is measured. Using 
dt = dt/y, we obtain 


dx dy dz 


K= ae E ee T 
dt dt dt 


(dt, dx, dy, dz) = y (: ) = (y, yY). (13.4) 


1 
dt 
This is known as the velocity 4-vector. In the rest frame of the object we have v = 0, 
so V reduces to V = (1,0, 0,0). With the c’s, we have V = (yc, yv). 
e Energy—momentum 4-vector: If we multiply the velocity 4-vector by the invariant 
m, we obtain another 4-vector, 


P = mV = (ym, ymy) = (E,p), (13.5) 


which is known as the energy-momentum 4-vector (or the 4-momentum for short), for 
obvious reasons. In the rest frame of the object, P reduces to P = (m, 0,0, 0). With the 
c’s, we have P = (ymc, ymy) = (E/c, p). Some treatments multiply through by c, so 
that the 4-momentum is (E, pc). 

e Acceleration 4-vector: We can also take the derivative of the velocity 4-vector with 
respect to t. The result is indeed a 4-vector, because taking the derivative entails tak- 
ing the (infinitesimal) difference between two 4-vectors (which results in a 4-vector 
because Eq. (13.1) is linear), and then dividing by the invariant dt (which again results 
in a 4-vector). Using dt = dt/y, we obtain 


dV d dy d(yv) 
A= — = — (y, = —, . 13.6 
ee (2 Ai (13.6) 
Using dy /dt = vi/(1 — v?)3/2 = y3 vi, we have 
A= (y*vb, y*viv + y*a), (13.7) 


where a = dv/dt. A is known as the acceleration 4-vector. In the rest frame of the 


object (or, rather, in the instantaneous inertial frame), A reduces to A = (0,a). As we 


13.3 Properties of 4-vectors 


usually do, we'll pick the velocity v to point in the x direction. That is, v = (vx, 0,0). 
This means that v = vy, and also that ù = ùy = ay.! Equation (13.7) then becomes 
A= (yt oxax, y'vžax + yay, i ty, yz) 
= (yÍvxax, vax, y7ay, Yaz). (13.8) 
We can keep taking derivatives with respect to t to create other 4-vectors, but these 


have little relevance in the real world. 
e Force 4-vector: We define the force 4-vector as 


rae _ (dE dp) _ (dE, ce 
ae) ae a) ae 


where f = d(ymv)/dt is the usual 3-force. We’ll use f instead of F in this chapter, 
to avoid confusion with the 4-force, F. In the case where m is constant,2 F can be 
written as F = d(mV)/dt = mdV/dt = mA. We therefore still have a nice “F equals 
mA” law of physics, but it’s now a 4-vector equation instead of the old 3-vector one. In 
terms of the acceleration 4-vector, we can use Eqs. (13.7) and (13.8) to write (if m is 


constant) 


F=mA= m(y* vo, ys viv + ya) 


= My*vxdx, y*ax, Yay, Yaz). (13.10) 
Combining this with Eq. (13.9), we see that the 3-force is 
f = m(y3ax, yay, yaz), (13.11) 


in agreement with Eq. (12.60). In the rest frame of the object (or, rather, the instantaneous 
inertial frame), the F in Eq. (13.9) reduces to F = (0, f), because dE/dt = 0 when 
v = 0, as you can verify. Also, in the rest frame of the object, the mA in Eq. (13.10) 
reduces to mA = (0,ma). So F = mA reduces to the familiar f = ma. 


13.3 Properties of 4-vectors 


The appealing thing about 4-vectors is that they have many useful properties. 
Let’s look at some of these. 


e Linear combinations: If A and B are 4-vectors, then C = aA + bB is also a 4-vector. 
This is true because the transformations in Eq. (13.1) are linear (as we noted above 


' The acceleration vector a is free to point in any direction, but you can check that the 0’s in v lead 
to ù = ay. See Exercise 13.5. 

2 The mass m wouldn’t be constant if the object were being heated, or if extra mass were being added 
to it. We won’t concern ourselves with such cases here. 
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when deriving the acceleration 4-vector). This linearity implies that the transformation 
of, say, the time component is 


Co = (aA + bB)o = ao + bBo = a(Ay + vA)) + (Bo + vB) 
= (Ap + bBo) + v(ad' + bB)) 
=Cyt+vC}, (13.12) 
which is the proper transformation for the time component ofa 4-vector. Likewise for the 
other components. This property holds, of course, just as it does for linear combinations 
of vectors in 3-space. 


Inner-product invariance: Consider two arbitrary 4-vectors, A and B. Define their 
inner product to be 


A - B = AọBọo — 41 B, — A2B2 — A3B3 = ApBo — A - B. (13.13) 


Then A - B is invariant. That is, it is independent of the frame in which it is calculated. 
This can be shown by direct calculation, using the transformations in Eq. (13.1): 


A+B = AoBo — A) B) — A2B2 — A3B3 
= (y (4 + v4})) (YB) + vB1)) — (YAY + v49)) (7B; + ¥BO)) 
— AB), — ASB’ 
= y7(AgBp + v(4pB) +44 Bp) + 0741.B1) 
— y? (A,B + vA By + AGB) + 07AB) — 44B, — 45B, 
= ADB (y? — Y?a?) — A1 Bi (vy? — y707) — AB) — 43B, 
= ApBy — 4) Bi — 45B5 — 43B5 
= A.B’. (13.14) 


The importance of this result cannot be overstated. This invariance is analogous to the 
invariance of the inner product, A - B, for rotations in 3-space. The above inner product 
is also invariant under rotations in 3-space, because it involves the combination A - B. 
The minus signs in the inner product may seem a little strange. But what we want is a 
combination of two arbitrary vectors that is invariant under a Lorentz transformation, 
because such combinations are very useful in seeing what’s going on in a system. The 
nature of the Lorentz transformations demands that there be opposite signs in the inner 
product, so that’s the way it is. 

Norm: As a corollary to the invariance of the inner product, we can look at the inner 
product of a 4-vector with itself, which is by definition the square of the norm. We 
see that 


A? = A - A = AoAo — 4141 — A242 — 4343 = AS — JAI? (13.15) 


13.4 Energy, momentum 


is invariant. This is analogous to the invariance of the norm VA - A for rotations in 
3-space. Special cases of the invariance of the 4-vector norm are the invariance of 
c? dt? — dx? and of E? — prc’. 

e A theorem: Here’s a nice little theorem: 
If a certain one of the components of a 4-vector is 0 in every frame, then all four 


components are 0 in every frame. 


Proof: If one of the space components (say, 41) is 0 in every frame, then the other 
space components must also be 0 in every frame, because otherwise a rotation would 
make A; Æ 0. Also, the time component Ap must be 0 in every frame, because otherwise 
a Lorentz transformation in the x direction would make A; Æ 0. 

If the time component, Ao, is 0 in every frame, then the space components must 
also be 0 in every frame, because otherwise a Lorentz transformation in the appropriate 
direction would make Ag # 0. E 


If someone comes along and says that she has a vector in 3-space that has no x 
component, no matter how you rotate the axes, then you would certainly say that the 
vector must obviously be the zero vector. The situation in Lorentzian 4-space is the 
same, because all the coordinates get intertwined with each other in the Lorentz (and 


rotation) transformations. 


13.4 Energy, momentum 
13.4.1 Norm 


Many useful things arise from the fact that the P in Eq. (13.5) is a 4-vector. The 
invariance of the norm implies that P - P = E? — |p|? is invariant. If we are 
dealing with only one particle, we can determine the value of P? by conveniently 
working in the rest frame of the particle (so that v = 0), which gives 


E -p =m’, (13.16) 


2c4, with the c’s. We already knew this, of course, from just 


writing out E? — p? = y2m? — y?m?v? = m?. 
For a collection of particles, knowledge of the norm is very useful. If a process 


involves many particles, then we can say that for any subset of the particles, 


© E) = © p) is invariant, (13.17) 


because this is the norm of the sum of the energy-momentum 4-vectors of the 
chosen particles. The sum is again a 4-vector, due to the linearity of Eq. (13.1). 
What is the value of the invariant in Eq. (13.17)? The most concise description 
(which is basically a tautology) is that it is the square of the energy in the CM 
frame, that is, in the frame in which }` p = 0. For one particle, this reduces to 
m?. Note that the sums are taken before squaring in Eq. (13.17). Squaring before 
adding would simply give the sum of the squares of the masses. 


or E? — p? =m 
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13.4.2 Transformations of E£ and p 


We already know how the energy and momentum transform (see Section 12.2), 
but let’s derive the transformation again here in a very quick and easy manner. 
We know that (E, px, PysPz) is a 4-vector. So it must transform according to 
Eq. (13.1). Therefore, for a Lorentz transformation in the x direction, we have 


E = y(E' + vpl), 
Px = Y (Px + vE’, 


, (13.18) 
Py = Py» 


Pz =p, 


in agreement with Eq. (12.27). That’s all there is to it. The fact that E and p 
are part of the same 4-vector provides an easy way to see that if one of them 
is conserved (in every frame) in a collision, then the other is also. Consider an 
interaction among a set of particles, and look at the 4-vector, AP = Pafter—Phefore- 
If E is conserved in every frame, then the time component of AP is 0 in every 
frame. But then the theorem in Section 13.3 says that all four components of AP 
are 0 in every frame. Therefore, p is conserved. Likewise for the case where one 
of the p; is known to be conserved. 


13.5 Force and acceleration 


Throughout this section, we'll deal with objects with constant mass, which we’ ll 
call “particles.” The treatment here can be generalized to cases where the mass 
changes (for example, the object is being heated, or extra mass is being dumped 
on it), but we won’t concern ourselves with these. 


13.5.1 Transformation of forces 


Let’s first look at the force 4-vector in the instantaneous inertial frame of a given 
particle (frame S’). Equation (13.9) gives 


IE! 
F' = (1) = (0,f’). (13.19) 


The first component is zero because dE'/dt = d(m/V1 — v’? )/dt, and this 
contains a factor of v’, which is zero in this frame. Equivalently, you can just use 
Eq. (13.10), with a speed of zero. 

We can now write down two expressions for the 4-force, F, in another frame, 
S, in which the particle moves with speed v in the x direction. First, since F 


13.5 Force and acceleration 


is a 4-vector, it transforms according to Eq. (13.1). We therefore have, using 
Eq. (13.19), 


Fo = y (F9 + vF!) = yf, 
Fi = y (Fi + vF) = yh, 
R=F =f, (13.20) 
F; = F; = fy. 
But second, from the definition in Eq. (13.9), we also have 
Fo = y dE/dt, 
Fi = Yfr, 
Fy = Yf, 
F; = yf. 


Combining Eqs. (13.20) and (13.21), we obtain 


(13.21) 


dE /dt = vf, 
F=f 


h =f;lY, 
f =flY. 


We therefore recover the results of Section 12.5.3. The longitudinal force is 
the same in both frames, but the transverse forces are larger by a factor of y in 
the particle’s frame. Hence, f/f, decreases by a factor of y when going from the 
particle’s frame to the lab frame (see Fig. 13.2 and Fig. 13.3). And as a bonus, 
the Fo component in Eq. (13.22) tells us (after multiplying through by dt) that 
dE = fy dx, which is the work—energy result. In other words, using fs = dp,/dt, 
we have just derived again the result, dE/dx = fy = dp/dt, that we derived in 
Section 12.5.1. 

As noted in the first remark in Section 12.5.3, we can’t switch the S and S’ 
frames and write f! = f,/y. When talking about the forces on a particle, there 
is indeed one preferred reference frame, namely that of the particle. All frames 
are not equivalent here. When forming all of our 4-vectors in Section 13.2, we 
explicitly used the dt, dt, dx, etc., from two events, and it was understood that 
these two events were located at the particle. 


(13.22) 


13.5.2 Transformation of accelerations 


The procedure here is similar to the above treatment of the force. Let’s first look 
at the acceleration 4-vector in the instantaneous inertial frame of a given particle 


(frame S’) 


f' 


f 


mer 


Fig. 13.2 


(frame S) 


Fig. 13.3 
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(frame S$’) 


(frame S) 


Fig. 13.5 
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(frame S’). Equation (13.7) or Eq. (13.8) gives 


A = (0,a’), (13.23) 


because v’ = 0 in S’. 

We can now write down two expressions for the 4-acceleration, A, in another 
frame, S. First, since A is a 4-vector, it transforms according to Eq. (13.1). So we 
have, using Eq. (13.23), 


Ao = y (4h + v4)) = yod,, 
A1 = y (41 + v0) = yay, 


(13.24) 
A2 = 5 = ays 
A; = AS =a 
But second, from the expression in Eq. (13.8), we also have 
Ag = y‘vay, 
Aj = yay, 
j (13.25) 
Ay =Y 4, 
A3 = yas. 
Combining Eqs. (13.24) and (13.25), we obtain 
ay = a./y?, 
ax = eae 
ace (13.26) 
ay = a,/y", 
az = d, Jy’. 


(The first two equations here are redundant.) We therefore again recover the 
results of Section 12.5.3. We see that a,/a, increases by a factor of y?y? =y 
when going from the particle’s frame to the lab frame (see Fig. 13.4 and Fig. 13.5). 
This is the opposite of the effect on fy/ f. This difference makes it clear that 
an f = ma law wouldn’t make any sense. If it were true in one frame, then it 
wouldn’t be true in another. Note that the increase in a,/a, in going to the lab 
frame is consistent with length contraction, as the “Bead on a rod” example in 
Section 12.5.3 showed. 


3 In a nutshell, this difference is due to the fact that y changes with time. When talking about the 
acceleration 4-vector, there are y’s that we have to differentiate; see Eq. (13.6). This isn’t the case 
with the force 4-vector, because the y is absorbed into the definition of p = ymv; see Eq. (13.9). 
This is what leads to the different powers of y in Eq. (13.25), in contrast with the identical powers 
in Eq. (13.21). 


13.6 The form of physical laws 


Example (Acceleration for circular motion): A particle moves with constant 
speed v around the circle x2 + y? = r?, z = 0, in the lab frame. At the instant 
the particle crosses the negative y axis (see Fig. 13.6), find the 3-acceleration and 
4-acceleration in both the lab frame and the instantaneous inertial frame of the particle 
(with axes chosen parallel to the lab’s axes). 


Solution: Let the lab frame be S, and let the particle’s instantaneous inertial frame 
be S” when it crosses the negative y axis. Then S and S’ are related by a Lorentz 
transformation in the x direction. The 3-acceleration in S is simply 


a = (0, v? /r,0). (13.27) 


There’s nothing fancy going on here; the nonrelativistic proof of a = v2/r works 
just fine again in the relativistic case. Equation (13.7) or (13.8) then gives the 
4-acceleration in S as 


A= (0,0, y7u7/r, 0). (13.28) 


To find the acceleration vectors in S’, we can use the fact that S’ and S are related 
by a Lorentz transformation in the x direction. This means that the 42 component 
of the 4-acceleration is unchanged. So the 4-acceleration in S’ is also 


A =A = (0,0, y?v7/r, 0). (13.29) 


In the particle’s frame, a’ is the space part of A (using Eq. (13.7) or (13.8), with v = 0 
and y = 1). Therefore, the 3-acceleration in S’ is 


a’ = (0, y7v~/r,0). (13.30) 
Note that our results for a and a’ are consistent with Eq. (13.26). 


REMARK: We can also arrive at the two factors of y in a’ by using a simple time-dilation 
argument. We have 
; d’y'! dy 2 dy a v2 
a = a z5” 2 = p? 
2 dt d(t/y) dt 7 


(13.31) 


where we have used the fact that transverse lengths are the same in the two frames. & 


13.6 The form of physical laws 


One of the postulates of Special Relativity is that all inertial frames are equivalent. 
Therefore, if a physical law holds in one frame, then it must hold in all frames. 
Otherwise, it would be possible to differentiate between frames. As noted in 


Fig. 13.6 
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the previous section, the statement “f = ma” cannot be a physical law. The 
two sides of the equation transform differently when going from one frame to 
another, so the statement cannot be true in all frames. If a statement has any 
chance of being true in all frames, it must involve only 4-vectors. Consider a 
4-vector equation (say, “A = B”) that is true in frame S. Then if we apply to 
this equation a Lorentz transformation (call it M) from S to another frame S’, 
we have 


A=B 
= A=P'. (13.32) 


The law is therefore also true in frame S’. Of course, there are many 4-vector 
equations that are simply not true in any frame (for example, F = P, or2P = 3P). 
Only a small set of such equations (for example, F = mA) are true in at least one 
frame, and hence in all frames. 

Physical laws may also take the form of scalar equations, such as P - P = m’. 
A scalar is by definition a quantity that is frame-independent (as we have shown 
the inner product to be). So ifa scalar statement is true in one inertial frame, then 
it is true in all inertial frames. Physical laws may also be higher-rank “tensor” 
equations, such as the ones that arise in electromagnetism and General Relativity. 
We won’t discuss tensors here, but suffice it to say that they may be thought of 
as things built up from 4-vectors. Scalars and 4-vectors are special cases of 
tensors. 

All of this is exactly analogous to the situation in 3-D space. In Newtonian 
mechanics, f = ma is a possible law, because both sides are 3-vectors. But 
f = m(2ax,ay,az) is not a possible law, because the right-hand side is not a 
3-vector; it depends on which axis you label as the x axis. If a particle has 
acceleration a in the eastward direction, and if you happen to pick this as your 
x direction, then the force is 2ma eastward. But if you happen to pick eastward 
as your y direction, then the force is ma eastward. It makes no sense for a law 
to give two different results based on your arbitrary choice of axis labels. An 
example of a frame-independent statement (under rotations) is the claim that a 
given stick has a length of 2 meters. This is fine, because it involves the norm, 
which is a scalar. But if you say that the stick has an x component of 1.7 meters, 
then this cannot be true in all frames. 


God said to his cosmos directors, 
“Tve added some stringent selectors. 
One is the clause 

That your physical laws 


Shall be written in terms of 4-vectors.” 


13.8 Exercises 


13.7 Problems 


13.1. 


13.2. 


13.4. 


Velocity addition « 


In A’s frame, B moves to the right with speed u, and C moves to the 
left with speed v. What is the speed of B with respect to C? In other 
words, use 4-vectors to derive the velocity-addition formula. 


Relative speed « 


In the lab frame, two particles move with speed v along the paths shown 
in Fig. 13.7. The angle between the trajectories is 20. What is the speed 
of one particle, as viewed by the other? 


. Another relative speed x 


In the lab frame, particles A and B move with speeds u and v along 
the paths shown in Fig. 13.8. The angle between the trajectories is 0. 
What is the speed of one particle, as viewed by the other? 


Acceleration for linear motion « 


A spaceship starts at rest with respect to frame S and accelerates with 
constant proper acceleration a. In Section 11.9, we showed that the 
speed of the spaceship with respect to S is given by v(t) = tanh(at), 
where Tt is the spaceship’s proper time (we have dropped the c’s). Let 
V be the spaceship’s 4-velocity, and let A be its 4-acceleration. In terms 
of the proper time T: 

(a) Find V and A in frame S, by explicitly using v(t) = tanh (art). 

(b) Write down V’ and Æ’ in the spaceship’s frame, S”. 

(c) Verify that V and V’ transform like 4-vectors between the two 

frames. Likewise for A and J’. 


13.8 Exercises 


13.5. 


13.6. 


Acceleration at rest 


Show that the derivative ofv = ,/vz + vy + vz equals ay, independent 
of how the various v;’s are changing, provided that v, = vz = 0 at the 


moment in question. 


Linear acceleration x 


A particle’s velocity and acceleration both point in the x direction, with 
magnitudes v and ù, respectively (as measured in the lab frame). In 
the spirit of the example in Section 13.5.2, find the 3-acceleration and 
4-acceleration in both the lab frame and the instantaneous inertial frame 
of the particle. Verify that the 3-accelerations are related according to 
Eq. (13.26). 
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13.7. Linear force « 


For the setup in the previous exercise, find the 3-force and 4-force in 
both the lab frame and the instantaneous inertial frame of the particle. 
Verify that the 3-forces are related according to Eq. (13.22). 


13.8. Circular motion force « 


For the setup in the example in Section 13.5.2, find the 3-force and 
4-force in both the lab frame and the instantaneous inertial frame of 
the particle. Verify that the 3-forces are related according to Eq. (13.22). 
(Solve this from scratch; don’t just use the results from the example.) 


13.9. Same speed x 


Consider the setup in Problem 13.2. Given v, what should 0 be so 
that the speed of one particle, as viewed by the other, is also v? (The 
first form of the speed in Eq. (13.38) will make your calculations the 
simplest.) Do your answers make sense for v © 0 and v © c? 


13.10. Doppler effect x 


Consider a photon traveling in the x direction. Ignoring the y and z 
components, and setting c = 1, the 4-momentum is (p, p). In matrix 
notation, what are the Lorentz transformations for the frames traveling 
Fig. 13.9 to the left and to the right at speed v? What is the new 4-momentum 
of the photon in these new frames? Accepting the fact that a photon’s 
energy is proportional to its frequency, verify that your results are 
consistent with the Doppler results in Section 11.8.1. 


13.11. Three particles «x 


Three particles head off with equal speeds v, at 120° with respect to 
each other, as shown in Fig. 13.9. What is the inner product of any two 
of the 4-velocities in any frame? Use your result to find the angle 0 
(see Fig. 13.10) at which two particles travel in the frame of the third. 


Fig. 13.10 
Niece 13.9 Solutions 
v C B u 13.1. Velocity addition 
j a > Let the desired speed of B with respect to C be w (see Fig. 13.11). In A’s frame, the 
4-velocity of B is (Yu, Yuu), and the 4-velocity of C is (yv, —yvv), where we have 
suppressed the y and z components. In C’s frame, the 4-velocity of B is (Yw, Yww), 
and the 4-velocity of C is (1, 0). The invariance of the inner product implies that 
C BE w 
e o —— (Yu, Yul) + (Vv, —Yvv) = (Yw, Yww) : (1,0) 
C's frame => uv + w) = Yw 


1 + uv 1 
Fig. 13.11 VI- NVI- NVi-w? 


(13.33) 


13.2. 


13.3. 


13.9 Solutions 


Squaring and then solving for w gives 


u+v 
w = : 
l+uv 


(13.34) 


Relative speed 
In the lab frame, the 4-velocities of the particles are (suppressing the z component) 


(Yo, Yu cos 0, Yyusind) and (yy, Yyvcosd,—yyv sin 0). (13.35) 


Let w be the desired speed of one particle as viewed by the other. Then in the frame 
of one particle, the 4-velocities are (suppressing two spatial components) 


(YwsYww) and (1,0), (13.36) 


where we have rotated the axes so that the relative motion is along the x axis in this 
frame. Since the 4-vector inner product is invariant under Lorentz transformations 
and rotations, we have (using cos 20 = cos? — sin?@) 


(Yv, Yo COS 0, Yov SIND) - (Yv, Yvv cos 0, — yav sin 0) = (Yw, Yww) + (1, 0) 
= y2(1—v’ cos20) = yw. (13.37) 


Using the definitions of the y’s, squaring, and solving for w gives 


(13.38) 


i (d —v?)2 V2v2(1 — cos 20) — v4 sin?20 
Hg (1 —v2cos20)2 I — v2 cos 20 


If desired, this can be rewritten (using some double-angle formulas) in the form, 


2v sin 0y 1 — v2 cos26 
w= . 


1 — v? cos 20 


(13.39) 


See the solution to Problem 11.14 for some limiting cases. 


Another relative speed 
In the lab frame, the 4-velocities of the particles are (suppressing the z component) 


V4 = (Vu, Yuu,0) and Vg = (Vv, Yyv COSA, Yyv sin 0). (13.40) 


Let w be the desired speed of one particle as viewed by the other. Then in the frame 
of one particle, the 4-velocities are (suppressing two spatial components) 


(Yw: Yww) and (1,0), (13.41) 


where we have rotated the axes so that the relative motion is along the x axis in this 
frame. Since the 4-vector inner product is invariant under Lorentz transformations 
and rotations, we have 


(Yus Yul, 0) - (Yv, Yvv COS 0, Yyv sin 0) = (Yw, Yww) » (1, 0) 
=> YuYə(l — wv cos0) = yw. (13.42) 


Using the definitions of the y’s, squaring, and solving for w gives 


(13.43) 


pais w)(1—v2) vu? + v2 — Quv cos 6 — u2v2 sin26 
w= (1—wvcos@)? 1 — uv cos 0 : 


See the solution to Problem 11.15 for some limiting cases. 
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13.4. Acceleration for linear motion 
(a) Using v(t) = tanh(at), we have y = 1/V1— v2 = cosh(at). Therefore, 
V=a(y,yv) = (cosh(at), sinh(at)), (13.44) 
where we have suppressed the two transverse components. We then have 


dV 


A= ae = a( sinh(at), cosh(at)). (13.45) 
t 
(b) The spaceship is at rest in its instantaneous inertial frame, so 
v'=(1,0) and A’ =(0,a). (13.46) 


Equivalently, these are obtained by setting t = 0 in the results from part (a), 
because the spaceship hasn’t started moving at t = 0, as is always the case in 
the instantaneous rest frame. 

(c) The Lorentz transformation matrix from S’ to S is 


_f{ y vyv \_ [{cosh(at) sinh(at) 
a ( rae ) ~ eo cosh(at) ) ; (13.47) 


We must check that 


Vo y! Ag A 
ORORO ONS 


These are easily seen to be true. 


Chapter 14 
General Relativity 


This will be somewhat of a strange chapter, because we won’t have enough 
time to get to the heart of General Relativity (GR).! But we’ll still be able to 
get a flavor of the subject and derive a few interesting GR results. One crucial 
idea in GR is the Equivalence Principle. This says that gravity is equivalent to 
acceleration. Or in more practical terms, it says that you can’t tell the difference. 
We’ ll have much to say about this in the sections below. Another crucial concept 
in GR is that of coordinate independence: the laws of physics can’t depend on 
what coordinate system you choose. This seemingly innocuous statement has 
surprisingly far-reaching consequences. However, a discussion of this topic is 
one of the many things we won’t have time for. We would need a whole book on 
GR to do it justice. But fortunately it’s possible to get a sense of the nature of GR 
without having to master such things. This is the route we’ ll take in this chapter. 


14.1 The Equivalence Principle 


Einstein’s Equivalence Principle says that it is impossible to locally distinguish 
between gravity and acceleration. This may be stated more precisely in (at least) 
three ways. 


e Let person A be enclosed in a small box, far from any massive objects, that undergoes 
uniform acceleration (say, g). Let person B stand at rest on the earth (see Fig. 14.1). 
The Equivalence Principle says that there are no local experiments these two people can 
perform that will tell them which of the two settings they are in. The physics of each 
setting is the same. 

e Let person A be enclosed in a small box that is in free-fall near a planet. Let person B float 
freely in space, far away from any massive objects (see Fig. 14.2). The Equivalence 
Principle says that there are no local experiments these two people can perform that will 
tell them which of the two settings they are in. The physics of each setting is the same. 


Ten years after his 1905 paper on the Special Theory of Relativity, Einstein completed his General 
Theory of Relativity in 1915, collaborating with Marcel Grossmann during the latter part of this 
period. David Hilbert also developed many of the final pieces of the theory in parallel with Einstein; 
see Medicus (1984). For a wonderful account of other historical developments of the theory, see 
Chandrasekhar (1979). 
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Fig. 14.1 
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e “Gravitational” mass is equal to (or proportional to) “inertial” mass. Gravitational mass 


is the mg that appears in the formula, F = GMmg/ r= mgg. Inertial mass is the m; that 
appears in the formula, F = m;a. There is no a-priori reason why these two m’s should 
be the same (or proportional). An object that is dropped on the earth has acceleration 
a = (mg/m;)g. For all we know, the ratio mg/m; for plutonium might be different from 
that for copper. But experiments with various materials have detected no difference in 
the ratios. The Equivalence Principle states that the ratios are equal for any type of mass. 

This definition of the Equivalence Principle is equivalent to, say, the second one 
above for the following reason. Two different masses that start at rest near B will stay 
right where they are as they float freely in space. But two different masses that start at 
rest near A will stay next to each other if and only if their accelerations are equal, that 
is, if and only if the ratio mg/m; is the same for both. If this ratio is different for the 
two masses, then they will diverge from each other, which means that it is possible to 
distinguish between the two settings. 


These statements are all quite believable. Consider the first one, for example. 
When standing on the earth, you have to keep your legs firm to avoid falling 
down. When standing in the accelerating box, you have to keep your legs firm 
to maintain the same position relative to the floor (that is, to avoid “falling 
down”). You certainly can’t naively tell the difference between the two scenarios. 
The Equivalence Principle says that it’s not just that you’re too inept to figure 
out a way to differentiate between them, but instead that there is no possible 
local experiment you can perform to tell the difference, no matter how clever 
you are. 


REMARK: Note the inclusion of the words “small box” and “local” above. On the surface of 
the earth, the lines of the gravitational force are not parallel; they converge to the center. The 
gravitational force also varies with height. Therefore, an experiment performed over a non- 
negligible distance (for example, dropping two balls next to each other, and watching them 
converge; or dropping two balls on top of each other and watching them diverge) will have 
different results from the same experiment in the accelerating box. The Equivalence Principle 
says that if your laboratory is small enough, or if the gravitational field is sufficiently uniform, 
then the two scenarios look essentially the same. &æ 


14.2 Time dilation 


The Equivalence Principle has a striking consequence concerning the behavior of 
clocks in a gravitational field. It implies that higher clocks run faster than lower 
clocks. If someone puts a watch on top of a tower, and if you stand on the ground, 
then you will see the watch on the tower tick faster than an identical watch on 
your wrist. If the watch on the tower is taken down and you compare it with the 
one on your wrist, it will show more time elapsed.” Likewise, someone standing 


2 This is true only if the watch on the tower is kept there for a long enough time (assuming that the 
watches show the same time when you start observing), because the movement of the watch when 


14.2 Time dilation 


on top of the tower will see a clock on the ground run slow. To be quantitative 
about this, let’s consider the following two scenarios. 


e A light source on top of a tower of height h emits flashes at time intervals ts. A receiver 
on the ground receives the flashes at time intervals tr (see Fig. 14.3). What is fr in terms 
of ts? 

e A rocket of length / accelerates with acceleration g. A light source at the front end 
emits flashes at time intervals ¢;. A receiver at the back end receives the flashes at time 
intervals f; (see Fig. 14.4). What is fr in terms of ts? 


The Equivalence Principle tells us that these two scenarios are exactly the same, 
as far as the sources and receivers are concerned. Hence, the relation between ty 
and ts; must be the same in each. Therefore, to find out what is going on in the 
first scenario, we will study the second scenario (because we can figure out how 
this one behaves). 

Consider an instantaneous inertial frame, S, of the rocket. In this frame, the 
rocket is momentarily at rest (at, say, ¢ = 0), and then it accelerates out of the 
frame with acceleration g. The following discussion will be made with respect 
to the frame S. Consider a series of quick light pulses emitted from the source, 
starting at £ = 0. The distance the rocket has traveled out of S at time ¢ is gt*/2, 
so if we assume that ¢, is very small, then we may say that many light pulses are 
emitted before the rocket moves appreciably. Likewise, the speed of the source, 
namely gt, is also very small. We may therefore ignore the motion of the rocket, 
as far as the light source is concerned. 

However, the light takes a finite time to reach the receiver, and by then the 
receiver will be moving. We therefore cannot ignore the motion of the rocket 
when dealing with the receiver. The time it takes the light to reach the receiver 
is A/c, at which point the receiver has a speed of v = g(h/c).°> Therefore, by the 
usual classical Doppler effect, the time between the received pulses is* 


Ís 


EFT (14.1) 


ty 


it is taken down causes it to run slow, due to the usual special-relativistic time dilation. But the 
speeding-up effect due to the height can be made arbitrarily large compared with the slowing-down 
effect due to the motion, by simply keeping the watch on the tower for an arbitrarily long time. 


w 


The receiver moves a tiny bit during this time, so the “h” here should really be replaced by a slightly 
smaller distance. But this yields a negligible second-order effect in the small quantity gh/ c?, as you 
can show. To sum up, the displacement of the source, the speed of the source, and the displacement 
of the receiver are all negligible. But the speed of the receiver is quite relevant. 

Quick proof of the classical Doppler effect (for a moving receiver): As seen in frame S, when the 
receiver and a particular pulse meet, the next pulse is a distance cts behind. The receiver and this 
next pulse then travel toward each other at relative speed c + v (as measured by someone in S). 
The time difference between receptions is therefore tr = cts/(c + v). 


Fig. 14.3 


h 


Fig. 14.4 
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So the frequencies, f = 1/t; and fs = 1/ts, are related by 


f= (1+2)a= (14+ 8) 4 (14.2) 


Returning to the clock-on-tower scenario, the Equivalence Principle tells us that 
an observer on the ground must see the clock on the tower running fast, by a factor 
1+ gh/c*. This means that the upper clock really is running fast, compared with 
the lower clock.° That is, 


gh 
At, ={ 1+ > Ato. (14.3) 
C 


A twin from Denver will be older than his twin from Boston when they meet up 
at a family reunion (all other things being equal). 


Greetings! Dear brother from Boulder, 

I hear that you’ve gotten much older. 

And please tell me why 

My lower left thigh 

Hasn’t aged quite as much as my shoulder! 


Note that the gh in Eq. (14.3) is the gravitational potential energy divided 
by m. 

REMARK: You might object to the above derivation, because tr is the time measured by someone 
in the inertial frame S. And since the receiver is eventually moving with respect to S, we should 
multiply the f; in Eq. (14.2) by the usual special-relativistic time-dilation factor, 1/y 1 — (v/c)? 
(because the receiver’s clocks are running slow relative to S, so the frequency measured by the 
receiver is greater than that measured in S). However, this is a second-order effect in the small 
quantity v/c = gh/c*. We already dropped other effects of the same order, so we have no right 
to keep this one. Of course, if the leading effect in our final answer turned out to be second 
order in v/c, then we would know that our answer was garbage. But the leading effect happens 
to be first order, so we can afford to be careless with the second-order effects. & 


After a finite time has passed, the frame S will no longer be of any use to us. 
But we can always pick a new instantaneous rest frame of the rocket, so we can 
repeat the above analysis at any later time. Therefore, the result in Eq. (14.2) 
holds at all times. 

This gravitational time-dilation effect was first measured by R. Pound and 
G. Rebka in 1960. By sending gamma rays both up and down a 22 m tower, 
they were able to measure the redshift (that is, the decrease in frequency) at the 
top. This was a notable feat indeed, considering that they were able to measure 
a frequency shift of gh/c* (which is only a few parts in 10!5) to within 10% 
accuracy. In 1964, R. Pound and J. Snider improved the accuracy to 1%. 


> Unlike the situation where two people fly past each other (as with the usual twin paradox), we can 
say here that what an observer sees is also what actually is. We can say this because everyone here 
is in the same frame. The “turnaround” effect that was present in the twin paradox isn’t present 
now. The two clocks can be slowly moved together without anything exciting or drastic happening 
to their readings. 


14.3 Uniformly accelerating frame 


14.3 Uniformly accelerating frame 


Before reading this section, you should think carefully about the “Break or not 
break” problem in Chapter 11 (Problem 11.26). Don’t look at the solution too 
soon, because chances are you will change your answer after a few more minutes 
of thought. This is a classic problem, so don’t waste it by peeking! 

Technically, the uniformly accelerating frame we’ll construct here has nothing 
to do with GR. We won’t need to leave the realm of Special Relativity for the 
analysis in this section. But the reason we choose to study this special-relativistic 
setup in detail is that it shows many similarities to genuine GR situations, such 
as black holes. 


14.3.1 Uniformly accelerating point particle 


In order to understand a uniformly accelerating frame, we first need to understand 
a uniformly accelerating point particle. In Section 11.9, we briefly discussed the 
motion of a uniformly accelerating particle, that is, one that feels a constant force 
in its instantaneous rest frame. We’ll now take a closer look at such a particle. Let 
the particle’s instantaneous rest frame be S’, and let it start at rest in the inertial 
frame S. Let its mass be m. We know from Section 12.5.3 that the longitudinal 
force is the same in the two frames. Therefore, since it is constant in S’, it is also 
constant in S. Call it f. If we let g = f/m (so g is the proper acceleration felt by 
the particle), then in S we have, using the fact that f is constant, 


dp _ d(myv) gt 
f dt dt d V1 + (gt)? 


where we have set c = 1. As a double-check, this has the correct behavior for 
t — 0 and t — œ. If you want to keep the c’s in, then (gt)? becomes (gt/ c)? to 
make the units right. Having found the speed in S at time ¢, the position in S at 
time f is given by 


i t gtdt 1 ( ) 
= dt = = eS 1+ 2—1). 14.5 
P [ vdt= | eS, 1+ (gt) (14.5) 


For convenience, let P be the point (see Fig. 14.5) 
(Xp, tp) = (—1/g, 9). (14.6) 
Then Eq. (14.5) yields 


1 
œ- x) -P S. (14.7) 
g 


This is the equation for a hyperbola with its center (defined as the intersection of 
the asymptotes) at point P. For a large acceleration g, the point P is very close 
to the particle’s starting point. For a small acceleration, it is far away. 


asymptote 


Fig. 14.5 
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Everything has been fairly normal up to this point, but now the fun begins. 
Consider a point A on the particle’s hyperbolic worldline at time ¢. From 
Eq. (14.5), A has coordinates 


(x4,t,) = GO + (gt)? — 1), r): (14.8) 


The slope of the line PA is therefore 


ty — tp gt 


= (14.9) 


xa= xe SIFE 


Looking at Eq. (14.4), we see that this slope equals the speed v of the particle 
at point A. But we know very well that the speed v is the slope of the particle’s 
instantaneous x’ axis; see Eq. (11.47). Therefore, the line PA and the particle’s x’ 
axis are the same line. This holds for any arbitrary time ¢. So we may say that at 
any point along the particle’s worldline, the line PA is the instantaneous x’ axis 
of the particle. Or, said another way, no matter where the particle is, the event 
at P is simultaneous with an event located at the particle, as measured in the 
instantaneous frame of the particle. In other words, the particle always says that 
P happens “now.”® 

Here is another strange fact. What is the distance from P to 4, as measured 
in an instantaneous rest frame, S’, of the particle? The y factor between frames 
S and S' is, using Eq. (14.4), y = y 1 + (gt)?. The distance between P and A in 
frame S is x4 — xp = y 1 + (gt)? /g. So the distance between P and A in frame 
S’ is (using the Lorentz transformation Ax = y(Ax’ + vAt’), with Ar = 0) 


j ; 1 1 
y= = 7 (x4 — Xp) = Fe . (14.10) 


This has the unexpected property of being independent of t. Therefore, not only 
do we find that P is always simultaneous with the particle, as measured in the 
particle’s instantaneous rest frame; we also find that P is always the same distance 
(namely 1/g) away from the particle, in the particle’s frame. This is rather strange. 
The particle accelerates away from point P, but it doesn’t get farther away from 
it, as measured in its own frame. 


REMARK: We can give a continuity argument that shows that such a point P must exist. If a 
point is close to you, and if you accelerate away from it, then of course you get farther away 
from it. Everyday experience is quite valid here. But if a point is sufficiently far away from 
you, and if you accelerate away from it, then the at?/2 distance you travel away from it can 
easily be compensated by the decrease in distance due to length contraction (brought about by 
your newly acquired velocity). This effect grows with distance, so we simply need to pick the 


é The point P is very much like the event horizon of a black hole. Time seems to stand still at P in 
this treatment. And if we went more deeply into GR, we would find that time seems to stand still 
at the edge of a black hole, too (as viewed by someone far away). 


14.3 Uniformly accelerating frame 


point to be sufficiently far away. What this means is that every time you get out of your chair 
and walk to the door, there are stars very far away behind you that get closer to you as you 
walk away from them (as measured in your instantaneous rest frame). By continuity, then, there 
must exist a point P that remains the same distance from you (in your frame) as you accelerate 
away fromit. & 


14.3.2 Uniformly accelerating frame 


Let’s now put a collection of uniformly accelerating particles together to make a 
uniformly accelerating frame. Our goal will be to create a frame in which the dis- 
tances between particles (as measured in any particle’s instantaneous rest frame) 
remain constant. Why is this our goal? We know from the “Break or not break” 
problem in Chapter 11 that if all the particles accelerate with the same proper 
acceleration, g, then the distances (as measured in a particle’s instantaneous rest 
frame) grow larger. While this is a perfectly possible frame to construct, it is not 
desirable, for the following reason. Einstein’s Equivalence Principle states that 
an accelerating frame is equivalent to a frame sitting on, say, the earth. We can 
therefore study the effects of gravity by studying an accelerating frame. But if we 
want this frame to look anything like the surface of the earth, we certainly can’t 
have distances that change over time. We therefore want to construct a static 
frame, that is, one in which distances do not change (as measured in the frame). 
This will allow us to say that if we enclose the frame in a windowless box, then 
for all a person inside knows, he is standing motionless in a static gravitational 
field (which has a certain definite form, as we shall see). 

Let’s figure out how to construct the frame. We’ll discuss the acceleration 
of just two particles here. Others can be added in a similar manner. In the end, 
the desired frame as a whole is constructed by accelerating each atom in the 
floor of the frame with a specific proper acceleration. From Section 14.3.1, we 
already have a particle A that is “centered” around the point P. (This will be our 
shorthand notation for “traveling along a branch of a hyperbola whose center 
is the point P.”) We claim, for reasons that will become clear, that every other 
particle in the frame should also be “centered” around the same point P. 

Consider another particle, B. Let a and b be the initial distances from 
P to A and B. If both particles are to be centered around P, then their proper 
accelerations must be, from Eq. (14.6), 


ese. and eo, (14.11) 
a b 
Therefore, in order to have all points in the frame be centered around P, we 
simply have to make their proper accelerations inversely proportional to their 
initial distances from P. 
Why do we want every particle to be centered around P? Consider two 
events, E4 and Eg, such that P, E4, and Eg are collinear in Fig. 14.6. From 


Section 14.3.1, we know that the line PE4E z is the x’ axis for both particle 


655 


656 


General Relativity 


A and particle B, at the positions shown. We also know that A is always a distance 
a from P, and B is always a distance b from P (in their frame). Combining these 
facts with the fact that A and B measure their distances along the x’ axis of the 
same frame (at the events shown in the figure), we see that both A and B measure 
the distance between them to be b — a. This is independent of t, so A and B 
measure a constant distance between them. We have therefore constructed our 
desired static frame. This frame is often called a “Rindler space.” If a person 
walks around in the frame, he will think that he lives in a static world where the 
acceleration due to gravity takes the form g(z) x 1/z, where z is the distance to 
a certain magical point which is located at the end of the known “universe.” 

What if a person releases himself from the accelerating frame, so that he 
forever sails through space at constant speed? His view is that he is falling. He 
falls past the “magical point” P in a finite proper time, because the hyperbolic 
worldline of a point infinitesimally close to P is essentially the asymptote of all 
the hyperbolas, and the person’s straight-line worldline intersects this line. But 
his friends who are still in the frame will see him take an infinitely long time to 
get to P, because the x’ axes of points in the frame never quite swing up to the 
asymptote, even after an infinite amount of time. So the “now” line of any point 
in the frame never quite passes through the event where the person crosses the 
asymptote. This is similar to the situation with a black hole. An outside observer 
will see it take an infinitely long time for a falling person to reach the “boundary” 
of a black hole, even though it takes a finite proper time for the person. 

Our analysis shows that A and B feel different proper accelerations, because 
a # b. There is no way to construct a static frame where all points feel the 
same proper acceleration, so it is impossible to mimic a constant gravitational 
field (over a finite distance) by using an accelerating frame. The problems and 
exercises for this chapter offer plenty of opportunity for you to play around with 
the properties of our uniformly accelerating frame. 


14.4 Maximal-proper-time principle 


The maximal-proper-time principle in General Relativity says: Given two events 
in spacetime, a particle under the influence of only gravity takes the path in 
spacetime that maximizes the proper time. For example, if you throw a ball from 
given coordinates (r1, tı), and it lands at given coordinates (r2, t2), then the claim 
is that the ball takes the path that maximizes its proper time.’ 

This is clear for a freely moving ball in outer space, far from any massive 
objects. The ball travels at constant speed from one point to another, and we 


7 The principle is actually the “stationary-proper-time principle,” because as with the Lagrangian 
formalism in Chapter 6, any type of stationary point (a maximum, minimum, or saddle point) is 
allowed. But although we were very careful about stating things properly in Chapter 6, we’ll be a 
little sloppy here and just use the word “maximum,” because that’s what it will generally turn out 
to be in the situations we’ll look at. However, see Problem 14.8. 


14.4 Maximal-proper-time principle 


know that this constant-speed motion is the motion with the maximal proper 
time. This is true because a ball, A, moving at constant speed sees the clock 
on any other ball, B, running slow due to the special-relativistic time dilation, 
if there is a relative speed between them. (We’re assuming that B’s nonuni- 
form velocity is caused by a nongravitational force acting on it.) B therefore 
shows a shorter elapsed time. This argument doesn’t work the other way around, 
because B isn’t in an inertial frame and therefore can’t use the special-relativistic 
time-dilation result. 


Consistency with Newtonian physics 

The maximal-proper-time principle sounds like a plausible idea, but we already 
know from Chapter 6 that the path an object takes is the one that yields a stationary 
value of the classical action, f (T — V). We must therefore demonstrate that the 
maximal-proper-time principle reduces to the stationary-action principle, in the 
limit of small velocities. If this weren’t the case, then we’d have to throw out the 
maximal-proper-time principle. 

Consider a ball thrown vertically on the earth. Assume that the initial and final 
coordinates are fixed to be (1, ¢,) and (2, t2). Our plan will be to assume that 
the maximal-proper-time principle holds, and to then show that this leads to the 
stationary-action principle. Before being quantitative, let’s get a qualitative idea 
about what’s going on with the ball. There are two competing effects, as far as 
maximizing the proper time goes. On one hand, the ball wants to climb very 
high, because its clock will run faster there, due to the GR time dilation. But on 
the other hand, if it climbs very high, then it must move very fast to get there 
(because the total time, t — t1, is fixed), and this will make its clock run slow, 
due to the SR time dilation. So there’s a tradeoff. Let’s now look quantitatively 
at the implications of this tradeoff. The goal is to maximize 


l) 
r= f dt. (14.12) 
ti 


Due to the motion of the ball, we have the usual time dilation, dt = 
y 1 — v? /c? dt. But due to the height of the ball, we also have the gravitational 
time dilation, dt = (1 + gy/c?) dt. Combining these effects gives 


v? gy 
de= 1-5 (1+ 5) a. (14.13) 


8 This result is technically incorrect, because the two effects are intertwined in a somewhat more 
complicated manner (see Exercise 14.20). But it’s valid up to first order in v? / c? and gy/ c?, which 
is all we’re concerned with here. 
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Using the Taylor expansion for /1 — €, and dropping terms of order 1/c* and 
smaller, we see that we want to maximize 


h to v2 2y h vw gy 
apes i= (1 Sax f 1-2 lee (14.14 
Í T Í ( 5) = c2 i 2c? 7 c? ( ) 


The “1” term gives a constant, so maximizing this integral is equivalent to 
minimizing 


h2 2 h 2 

3 v gy mv 
—-=)d= — — dt, 14.15 
mc [ (3 =) f ( 5 mg) ( ) 


which is the classical action, as desired. For a one-dimensional gravitational 
problem such as this one, the action is always a (global) minimum, and the 
proper time is always a (global) maximum, as you can show by considering the 
second-order change in the action (see Exercise 14.23, which is the same as 
Exercise 6.32). 

In retrospect, it isn’t surprising that the kinetic energy here works out. The 
factor of 1/2 comes about in exactly the same way as in the derivation in 
Eq. (12.9), where we showed that the relativistic form of energy reduces to 
the familiar Newtonian expression. As far as the potential energy goes, the gy 
here can be traced to the acceleration times a certain time, where this time is pro- 
portional to the distance; see the paragraph before Eq. (14.1). This yields (when 
multiplied by mc?) the usual force times distance expression for the potential. 

The maximal-proper-time principle here is equivalent to the statement that 
the action for the Lagrangian we introduced near the end of Section 12.1 is 
S= — mc f dt. In both cases we want to extremize the integral of the proper 
time. In Section 12.1 we dealt only with a free particle, so the treatment here 
is a little more general because it includes the effect of gravity. But note that 
although Eq. (14.15) has the obvious interpretation of including the gravitational 
potential, the starting point in Eq. (14.12) makes no mention of a gravitational 
force. In the present treatment, gravity shouldn’t be thought of as a force, but 
instead as something that affects the proper time.” 


14.5 Twin paradox revisited 


Let’s take another look at the standard twin paradox, this time from the perspective 
of General Relativity. We should emphasize that GR is by no means necessary 
for an understanding of the original formulation of the paradox (the first scenario 
below). We were able to solve it in Section | 1.3.2, after all. The present discussion 
is given simply to show that the answer to an alternative formulation (the second 


° Foran interesting article on how the classical action, along with a few ingredients from differential 
geometry, can lead to General Relativity, see Rindler (1994). 


14.5 Twin paradox revisited 


scenario below) is consistent with what we’ve learned about GR. Consider the 
two following twin-paradox setups. 


e Twin A floats freely in outer space. Twin B flies past A in a spaceship, with speed vg (see 
Fig. 14.7). At the instant they are next to each other, they both set their clocks to zero. At 
this same instant, B turns on the reverse thrusters of his spaceship and decelerates with 
proper deceleration g. B eventually reaches a farthest point from A and then accelerates 
back toward A, finally passing him with speed vp again. When they are next to each 
other, they compare the readings on their clocks. Which twin is younger? 

e Twin B stands on the earth. Twin 4 is thrown upward with speed vo (let’s say he is 
fired from a cannon in a hole in the ground; see Fig. 14.8). At the instant they are next 
to each other, they both set their clocks to zero. A rises up and then falls back down, 
finally passing B with speed vp again. When they are next to each other, they compare 
the readings on their clocks. Which twin is younger? 


The first scenario is easily solved using Special Relativity. Since A is in an 
inertial frame, he can apply the results of Special Relativity. In particular, he sees 
B’s clock run slow, due to the usual special-relativistic time dilation. Therefore, 
B ends up younger at the end. B cannot use the reverse reasoning, because she is 
not in an inertial frame. 

What about the second scenario? The key point to realize is that the Equiva- 
lence Principle says that these two scenarios are exactly the same (ignoring the 
nonuniformity of the earth’s gravitational force), as far as the twins are con- 
cerned. Twin B has no way of knowing whether she is in a spaceship accelerating 
at g or on the surface of the earth. And 4 has no way of knowing whether he is 
floating freely in outer space or in free-fall in a gravitational field.'° We therefore 
conclude that B must be younger in the second scenario, too. 

At first glance, this seems incorrect, because in the second scenario, B is 
sitting motionless while A is the one who is moving. It seems that B should see 
A’s clock running slow, due to the usual special-relativistic time dilation, and 
hence A should be younger. This reasoning is incorrect because it fails to take 
into account the gravitational time dilation. The fact of the matter is that A is 
higher in the gravitational field, and therefore his clock runs faster. This effect 
does indeed win out over the special-relativistic time dilation, and A ends up 
older. You can explicitly show this in Problem 14.11. 

The reasoning in this section yields another way to conclude that the Equiva- 
lence Principle implies that higher clocks must run faster (in one way or another). 
The Equivalence Principle implies that A must be older in the second scenario, 
which means that there must be some height effect that makes A’s clock run 


10 As mentioned in Section 14.1, this fact is made possible by the equivalence of inertial and gravi- 
tational mass. Were it not for this, different parts of A’s body would want to accelerate at different 
rates in the gravitational field in the second scenario. This would certainly clue him in to the fact 
that he wasn’t floating freely in space. 
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fast (fast enough to win out over the special-relativistic time dilation). But it 
takes some more work to show that the factor is actually 1 + gh/c?. Note that 
the fact that A is older is consistent with the maximal-proper-time principle. In 
both scenarios, A is under the influence of only gravity (zero gravity in the first 
scenario), whereas B feels a normal force from either the spaceship’s floor or 
the ground. 


14.6 Problems 


Section 14.2: Time dilation 


14.1. 


14.2. 


14.3. 


14.4. 


Airplane’s speed « 

A plane flies at constant height A. What should its speed be so that an 
observer on the ground sees the plane’s clock tick at the same rate as a 
ground clock? (Assume v < c.) 


Clock on a tower xx 


A clock starts on the ground and then moves up a tower at constant 
speed v. It sits on top of the tower for a time 7 and then descends at 
constant speed v. If the tower has height A, how long should the clock 
sit at the top so that it comes back showing the same time as a clock 
that remained on the ground? (Assume v < c.) 


Circular motion +x 


B moves at speed v (with v < c) ina circle of radius r around A, far 
from any masses. By what fraction does B’s clock run slower than A’s? 
Calculate this in three ways. Work in: 


(a) A’s frame. 

(b) The frame whose origin is B and whose axes remain parallel to 
an inertial set of axes. 

(c) The rotating frame that is centered at A and rotates with the same 
frequency as B. 


More circular motion «« 


A and B move at speed v (v < c) ina circle of radius r, at diametrically 
opposite points, far from any masses. They both see their clocks ticking 
at the same rate. Show this in three ways. Work in: 


(a) The lab frame (the inertial frame whose origin is the center of 
the circle). 

(b) The frame whose origin is B and whose axes remain parallel to 
an inertial set of axes. 

(c) The rotating frame that is centered at the origin and rotates with 
the same frequency as A and B. 


14.6 Problems 


Section 14.3: Uniformly accelerating frame 


14.5. 


14.6. 


14.7. 


Accelerator’s point of view xxx 


Arocket starts at rest relative to a planet, a distance £ away. It accelerates 
toward the planet with proper acceleration g. Let t and t be the readings 
on the rocket’s and planet’s clocks, respectively. 


(a) Show that when the astronaut’s clock reads t, he observes the 
rocket-planet distance x (as measured in his instantaneous 
inertial frame) to be given by 


l+gé 


= (14.16) 


l+gx= 
(b) Show that when the astronaut’s clock reads t, he observes the 
time ¢ on the planet’s clock to be given by 


gt = (1 + g£) tanh(gr). (14.17) 


The results from Exercises 14.16 and 14.20 will be useful here. 


Getting way ahead «««« 


A rocket with proper length L accelerates from rest, with proper accel- 
eration g (where gL < c°). Clocks are located at the front and back 
of the rocket. If we look at this setup in the rocket frame, then the GR 
time-dilation effect tells us that the times on the two clocks are related 
by tf = (1+ gL/c’)ty. Therefore, if we look at things in the ground 
frame, then the times on the two clocks are related by 


L\ L 
r=) -5 (14.18) 
Cc C 


where the last term comes from the SR “rear clock ahead” result. Derive 
the above relation by working entirely in the ground frame. !! 


Lv/ revisited xx 


You stand at rest relative to a rocket that has synchronized clocks at its 
ends. It is then arranged for you and the rocket to move with relative 
speed v. A reasonable question to ask is: As viewed by you, what is the 
difference in readings on the clocks located at the ends of the rocket? 


" You might find this relation surprising, because it implies that the front clock will eventually be an 


arbitrarily large time ahead of the back clock, in the ground frame. (The subtractive Lu/ c? term is 


bounded by L/c and will therefore eventually become negligible compared with the additive, and 


unbounded, (gL/ 2th term.) But both clocks seem to be doing basically the same thing relative 
to the ground frame, so how can they end up differing by so much? Your job is to find out. 
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It turns out that this question cannot be answered without further 
information on how you and the rocket got to be moving with rela- 
tive speed v. There are two basic ways this relative speed can come 
about. The rocket can accelerate while you sit there, or you can accel- 
erate while the rocket sits there. Using the results from Problems 14.5 
and 14.6, explain what the answers to the above question are in these 
two cases. 


Section 14.4: Maximal-proper-time principle 


14.8. Circling the earth «+ 


Clock A sits at rest on the earth, and clock B circles the earth in an 
orbit that skims along the ground. Both A and B are essentially at the 
same radius, so the GR time-dilation effect yields no difference in their 
times. But B is moving relative to A, so A sees B running slow, due 
to the usual SR time-dilation effect. The orbiting clock, B, therefore 
shows a smaller elapsed proper time each time it passes A. In other 
words, the clock under the influence of only gravity (B) does not show 
the maximal proper time, in conflict with what we have been calling 
the maximal-proper-time principle. Explain. 


Section 14.5: Twin paradox revisited 


14.9. Twin paradox + 


A spaceship travels at speed v (v < c) to a distant star. Upon reach- 
ing the star, it decelerates and then accelerates back up to speed v 
in the opposite direction (uniformly, and in a short time compared 
with the total journey time). By what fraction does the traveler age 
less than her twin on the earth? (Ignore the gravity from the earth.) 
Work in: 


(a) The earth frame. 
(b) The spaceship frame. 


14.10. Twin paradox again +» 


(a) Answer part (b) of the previous problem, except now let the 
spaceship turn around by moving in a small semicircle while 
maintaining speed v. 

(b) Answer part (b) of the previous problem, except now let the 
spaceship turn around by moving in an arbitrary manner. The 
only constraints are that the turnaround is done quickly (com- 
pared with the total journey time) and that it is contained in a 
small region of space (compared with the earth—star distance). 


14.11. 


14.7 Exercises 


Twin paradox times x+» 


(a) In the first scenario in Section 14.5, calculate the ratio of B’s 
elapsed time to A’s, in terms of v9 and g. Work in A’s frame. 
Assume vo < c, and drop high-order terms. 

(b) Do the same for the second scenario in Section 14.5. Do this 
from scratch using the time dilations, and then check that 
your answer agrees (within the accuracy of the calculations) 
with part (a), as the Equivalence Principle demands. Work in 
B’s frame. 


14.7 Exercises 


Section 14.2: Time dilation 


14.12. 


14.13. 


14.14. 


14.15. 


Driving on a hill + 


You drive up and down a hill of height / at constant speed. The hill is 
in the shape of an isosceles triangle with altitude A. What should your 
speed be so that you age the same amount as someone standing at the 
base of the hill? (Assume v « c.) 


Lv/c? and gh/c? x 


The familiar special-relativistic “rear clock ahead” result, Lu/ c, 
looks rather similar to the gh/c? term in the GR time-dilation result, 
Eq. (14.3). Imagine standing on the ground near the front of a train of 
length L. For small v, devise a thought experiment that explains how 
the Lv/c? result follows from the gh/c* result. 


Both points of view «+s 


A and B are initially a distance L apart, at rest with respect to each 
other. At a given time, B accelerates toward A with constant proper 
acceleration a. Assume aL < c’. 


(a) Working in A’s frame, calculate the difference in readings on A’s 
and B’s clocks when B reaches A. 
(b) Do the same by working in B’s frame, and show that the 
result agrees (neglecting higher-order effects) with the result 
from part (a). 
Opposite circular motion «*«« 
A and B move at speed v (v < c) in opposite directions around a circle 
of radius r (so they pass each other after each half-revolution), far from 


any masses. They both see their two clocks ticking, on average, at the 
same rate. That is, if they compare their clocks each time they pass 
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each other, both clocks show the same time elapsed. Demonstrate this 
in three ways. Work in: 


(a) The lab frame (the inertial frame whose origin is the center of 
the circle). 

(b) The frame whose origin is B and whose axes remain parallel to 
an inertial set of axes. 

(c) The rotating frame that is centered at the origin and rotates along 
with B. This part is very tricky; the solution is given in Cranor 
et al. (2000), but don’t peek too soon. 


Section 14.3: Uniformly accelerating frame 


14.16. 


14.17. 


14.18. 


14.19. 


Various quantities « 


A particle starts at rest and accelerates with proper acceleration g. Let t 
be the time on the particle’s clock. Starting with the v from Eq. (14.4), 
use time dilation to show that the time ¢ in the original inertial frame, 
the speed of the particle, and the associated y factor are given by (with 
c= 1) 


gt =sinh(gt), v = tanh(gt), y =cosh(gt). (14.19) 


Using rapidity + 
Another way to derive the v in Eq. (14.4) is to use the v = tanh (grt) 
rapidity result (where t is the particle’s proper time) from Section 11.9. 
Use time dilation to show that this implies gt = sinh(gt), which then 
implies Eq. (14.4). 


Speed in an accelerating frame « 


In the setup in Problem 14.5, use Eq. (14.16) to find the speed of the 
planet in the rocket’s accelerating frame, |dx/dt|, as a function of T. 
What is the maximum value of this speed, in terms of g and the initial 
distance £? 


Redshift, blueshift ++ 


We found in Section 14.2 that a clock at the rear of a rocket sees a 
clock at the front run fast by a factor 1 + gh/c*. However, we ignored 
higher-order effects in 1/c?, so for all we know, we found only the 
first term in the Taylor series, and the factor is actually something like 
ehe or perhaps 1 + In(1 + gh/c?). 

(a) For the uniformly accelerating frame in Section 14.3.2, show 
that the factor is in fact exactly 1 + g,h/c?, where g, is the 
acceleration of the rear of the rocket. Show this by lining up a 
series of clocks and looking at the successive factors between 
them. (Hint: Take the log of the product of the factors.) The 


14.20. 


14.21. 


14.22. 


14.7 Exercises 


overall factor takes a very nice form when written in terms of 
the a and b in Fig. 14.6; what is it? 

(b) By the same reasoning as in part (a), it follows that the front 
clock sees the rear clock run slow by a factor 1 — gfh/c¢?, 
where gr is the acceleration of the front. Show explicitly that 
(1 + gh/c*)(1 — gph/c”) = 1, as must be the case, because 
a clock can’t gain time with respect to itself (and because the 
two clocks are at rest in the frame of the rocket; this reasoning 
wouldn’t apply to two clocks flying past each other). 


Gravity and speed combined +x 


In the spirit of Problem 11.25 (“Acceleration and redshift”), use a 
Minkowski diagram to solve this problem. A rocket accelerates with 
proper acceleration g toward a planet. As measured in the instantaneous 
inertial frame of the rocket, the planet is a distance x away and moves 
at speed v. Everything is in one dimension here. As measured in the 
accelerating frame of the rocket, show that the planet’s clock runs at a 
rate (with c = 1), 


dtp = dt (1 + gx)V 1 — v? , (14.20) 
and show that the planet’s speed is 
V = (1 + gx)v. (14.21) 


Note that if we combine these two results and eliminate v, and if we 
then invoke the Equivalence Principle, we arrive at the result that a 
clock moving at speed V at height / in a gravitational field is seen by 
someone on the ground to run at a rate (putting the c’s back in), 


h\? v2 
g = a (14.22) 
GC GC 


Length contraction +s 


A pencil points directly at you. From some distance away from it, 
you start at rest and accelerate toward it with acceleration a. After 
a time ¢, the pencil is length contracted in your frame by a factor 
V1 —v2/c2 © 1— (at)? /2c?, for small t (more precisely, for at < c). 
But the only way the pencil can shrink is for the back to move faster 
than the front, as measured by you in your accelerating frame. Using 
Eq. (14.21), show that the contraction factor works out as it should, for 
small ¢. 


Accelerating stick’s length +» 


Consider a uniformly accelerating frame consisting of a stick, the ends 
of which have worldlines given by the curves in Fig. 14.6 (so the stick 
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has proper length b — a). At time ¢ in the lab frame, we know from 
Eqs. (14.5) and (14.6) that a point that undergoes acceleration g has 
position y 1 + (gt)?/g relative to the point P in Fig. 14.6. An observer 
in the original inertial frame sees the stick being length contracted by 
different factors along its length, because different points move with 
different speeds (at a given time in the original frame). Show, by doing 
the appropriate integral, that the inertial observer concludes that the 
stick always has proper length b — a. 


Section 14.4: Maximal-proper-time principle 


14.23. Maximum proper time x 


Show that the stationary value of the gravitational action in Eq. (14.15) 
is always a minimum (which means that the proper time is always a 
maximum). Do this by considering a function, y(t) = yo(t) + £, 
where yo is the path that yields the stationary value, and £ is a small 
variation. 


Section 14.5: Twin paradox revisited 


14.24. Symmetric twin nonparadox +» 


14.8 


14.1. 


14.2. 


Two twins travel toward each other, both at speed v (v « c) with 
respect to an inertial observer. They synchronize their clocks when they 
pass each other. They travel to stars located at positions +£, and then 
decelerate and accelerate back up to speed v in the opposite direction 
(uniformly, and in a short time compared with the total journey time). 
In the frame of the inertial observer, it is clear (by symmetry) that both 
twins age the same amount by the time they pass each other again. 
Reproduce this result by working in the frame of one of the twins. 


Solutions 


Airplane’s speed 


An observer on the ground sees the plane’s clock run slow by a factor y 1 — v? /c? 
due to SR time dilation. But he also sees it run fast by a factor (1 + gh/c”) due to GR 
time dilation. We therefore want the product of these two factors to equal 1. Using 
the standard Taylor-series approximation for slow speeds in the first factor, we find 


je ek a ja 8 ao (1) i (14.23) 
2c? e) 22 e éj | ` 


Neglecting the small 1/c* term, and canceling the 1’s, we obtain v = \/2gh. Inter- 


estingly, ,/2gh is also the answer to a standard question from Newtonian physics, 
namely, how fast do you need to throw a ball straight up so that it reaches a height A? 


Clock on a tower 


The SR time-dilation factor is /1 — v2/c? ~ 1 — v?/2c?. The clock therefore loses 
a fraction v? /2c? of the time elapsed during its motion up and down the tower. The 
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upward journey takes a time h/v, and likewise for the downward trip, so the time loss 


due to the SR effect is 
vw \ (2h vh 
(3) (=) = eee 


Our goal is to balance this time loss with the time gain due to the GR time-dilation 
effect. If the clock sits on top of the tower for a time T, then the time gain is (gh/c?)T. 

But we must not forget the increase in time due to the height gained while the clock 
is in motion. During its motion, the clock’s average height is h/2. The total time in 
motion is 2h/v, so the GR time gain while the clock is moving is (we can use the 
average height here, because the GR effect is linear in h) 


h/2 2h h? 
ES ) ( - ) = E , (14.25) 


Setting the total change in the clock’s time equal to zero gives 


h 
=0 v+gT+ © =0 T=? 


g v 
(14.26) 


Remarks: Note that we must have v > gh in order for a positive solution for T to 
exist. Ifv < gh, then the SR effect is too small to cancel out the GR effect, even if 
the clock spends no time sitting at the top. Ifv = /gh, then T = 0, and we essentially 
have the same situation as in Exercise 14.12. Note also that if v is very large compared 
with ,/gh (but still small compared with c, so that our y1 — v2/c? ~ 1 — v? /2c? 
approximation is valid), then T ~ v/g, which is independent of h. & 


14.3. Circular motion 


(a) In A’s frame, there is only the SR time-dilation effect. A sees B move at speed 
v, so B’s clock runs slow by a factor of y 1 — v? /c?. And since v < c, we may 
use the Taylor series to approximate this as 1 — v? /2¢?. 

(b) In this frame, there are both SR and GR time-dilation effects. A moves at speed 
v with respect to B in this frame, so the SR effect is that A’s clock runs slow by a 
factor \/1 — v2/c2 ~ 1—v*/2c?. ButB undergoes an acceleration of a = v? /r 
toward Á, so for all B knows, he lives in a world where the acceleration due 
to gravity is v/r. A is “higher” in the gravitational field, so the GR effect is 
that A’s clock runs fast by a factor 1 + ar/c? = 1+ 7/c?. Multiplying the SR 
and GR effects together, we find (to lowest order) that A’s clock runs fast by a 
factor 1 + v*/2c*. This means (to lowest order) that B’s clock runs slow by a 
factor 1 — v*/2c*, in agreement with the answer to part (a). 

(c) In this frame, there is no relative motion between A and B, so there is only 
the GR time-dilation effect. The gravitational field (that is, the centripetal 
acceleration) at a distance x from the center is gy = xw”. Imagine lining up a 
series of clocks along a radius, with separation dx. Then the GR time-dilation 
result tells us that each clock loses a fraction gy dx/ c? = xw* dx/c? of time 
relative to the next clock inward. Integrating these fractions from x = 0 to 
x = r shows that B’s clock loses a fraction r?w?/2c? = v?/2c?, compared 
with A’s clock. This agrees with the results in parts (a) and (b). 


Remark: Ifyou want to imagine an analogous line of clocks in part (b), you 
can imagine lining a stick with them, with B at one end of the stick. The sensible 
thing to do with the stick is to have it be motionless with respect to B’s frame 
(as the line of clocks was in part (c)). But this means that the stick doesn’t rotate 
with respect to the inertial axes, because B’s axes don’t rotate. So all of the 
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clocks feel the same acceleration rw? = v*/r, in contrast with the decreasing 


accelerations in part (c). The integral of all the fractions therefore doesn’t pick 
up the factor of 2 as it did in part (c), and so we simply end up with v? /c?. The 
SR effect then arises in part (b) because A is flying past the clock on the other 
end of the stick, whereas in part (c) A was at rest. & 


14.4. More circular motion 


(a) In the lab frame, the situation is symmetric with respect to A and B. Therefore, 


if A and B are decelerated in a symmetric manner and brought together, their 
clocks must read the same time. 

Assume (in the interest of obtaining a contradiction) that A sees B’s clock 
run slow. Then after an arbitrarily long time, A will see B’s clock an arbitrarily 
large time behind his. Now bring A and B to a stop. There is no possible way 
that the stopping motion can make B’s clock gain an arbitrarily large amount of 
time, as seen by A. This is true because everything takes place in a finite region 
of space, so there is an upper bound on the GR time-dilation effect (because 
it behaves like gh/c*, and h is bounded). Therefore, A will end up seeing 
B’s clock reading less. This contradicts the result of the previous paragraph. 
Likewise for the case where A sees B’s clock run fast. 


REMARK: Note how this problem differs from the problem where A and B move 
with equal speeds directly away from each other, and then reverse directions 
and head back to meet up again. For this new “linear” problem, the symmetry 
reasoning in the first paragraph above still holds, so A and B will indeed have 
the same clock readings when they meet up again. But the reasoning in the 
second paragraph does not hold (it had better not, because each person does 
not see the other person’s clock running at the same rate). The error is that 
in this linear scenario, the experiment is not contained in a small region of 
space, so the turning-around effects of order gh/c* become arbitrarily large 
as the time of travel becomes arbitrarily large, because grows with time 
(see Problem 14.9). & 


(b) In this frame, there are both SR and GR time-dilation effects. A moves at speed 


2v with respect to B in this frame (we don’t need to use the relativistic velocity- 
addition formula, because v < c), so the SR effect is that A’s clock runs slow 
by a factor y1 — (2v)?/c? ~ 1 — 2u7/c?. But B undergoes an acceleration 
ofa= v2 /r toward A, so the GR effect is that A’s clock runs fast by a factor 
1 + a(2r)/c? = 1+ 2v7/c? (because they are separated by a distance 2r). 
Multiplying the SR and GR effects together, we find (to lowest order) that the 
two clocks run at the same rate. 


(c) In this frame, there is no relative motion between A and B, so there is only (at 


most) the GR effect. But A and B are both at the same gravitational potential, 
because they are at the same radius. Therefore, they both see the clocks running 
at the same rate. If you want, you can line up a series of clocks along the 
diameter between A and B, as we did along a radius in part (c) of Problem 14.3. 
The clocks will gain time as you march in toward the center, and then lose 
the same amount of time as you march back out to the diametrically opposite 
point. 


14.5. Accelerator’s point of view 


(a) First soLUTION: Equation (14.5) says that the distance traveled by the rocket 


(as measured in the original inertial frame), as a function of the time in the 


inertial frame, is 
1 
ja (i + (gf)? - 1). (14.27) 
g€ 
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An inertial observer on the planet therefore measures the rocket—planet 


distance to be 
1 
x= J1+ (gt)? 1). (14.28) 
g ( 


The rocket observer sees this length contracted by a factor y. Using the 
result of Exercise 14.16, we have y = y 1 + (gt)? = cosh(gt). So the rocket- 
planet distance, as measured in the instantaneous inertial frame of the 
rocket, is 


1 
pa algan TE 1+ 


cosh(gt) a cosh(gt) ` 


(14.29) 


SECOND SOLUTION: Equation (14.21) gives the speed of the planet in the accel- 
erating frame of the rocket. Using the results of Exercise 14.16 to write v in 
terms of t, we have (with c = 1) 


dx 
ae = —(1+ gx) tanh(gt). (14.30) 


Separating variables and integrating gives 


d 
| x = f amigo ar In(1 + gx) = —In(cosh(gt)) + C 
1+ gx 


A 
cosh(gt) ` 


l+gx= (14.31) 


Since the initial condition is x = £ when t = 0, we must have A = 1 + g£, 


which gives Eq. (14.16), as desired. 
(b) Equation (14.20) says that the planet’s clock runs fast (or slow) according to 


dt = dt (1+ gx)V/1— v2. (14.32) 


The results of Exercise 14.16 yield /1 — v? = 1/cosh(gt). Combining this 
with the result for 1 + gx above, and integrating, gives 


(l+g)dt 
fe=f : gt=(1+gé)tanh(gt). (14.33) 
cosh“ (gt) 


14.6. Getting way ahead 


The explanation of why the two clocks show different times in the ground frame is the 
following. The rocket becomes increasingly length contracted in the ground frame, 
which means that the front end isn’t traveling quite as fast as the back end. Therefore, 
the time-dilation factor for the front clock isn’t as large as that for the back clock. So 
the front clock loses less time relative to the ground, and hence ends up ahead of the 
back clock. Of course, it’s not at all obvious that everything works out quantitatively 
and that the front clock eventually ends up an arbitrarily large time ahead of the back 
clock. In fact, it’s quite surprising that this is the case, because the above difference 
in speeds is very small. But let’s now show that the above explanation does indeed 
account for the difference in the clock readings. 

Let the back of the rocket be located at position x. Then the front is located at 
position x + Lv 1 — v2, due to the length contraction. Taking the time derivatives of 
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the two positions, we see that the speeds of the back and front are (with v = dx/dt)'* 
Up =v, and ve = v(1—Lyv). (14.34) 


If we assume that the back is the part that accelerates at g (it doesn’t matter which 
point we pick, to leading order), then we can just invoke the result in Eq. (14.4), 


t 
nagn (14.35) 


IFEN 


where ¢ is the time in the ground frame. Having written down v, we must now find 
the y factors associated with the speeds of the front and back. The y factor associated 
with the speed of the back, namely v, is 


1 
= ———_ = ,/1 + (g)?. 14.36 
Yb Vio y (gt) ( ) 


The y factor associated with the speed of the front, vp = v(1 — Lyd), is a bit 
more complicated. We must first calculate ù. From Eq. (14.35), we obtain ò = 
g/(14+ g707)3/?, which gives 


nent Ripe a= (: gL | (14.37) 

V1 + (gt)? lpr 
The y factor (or rather 1/y, which is what we’ll be concerned with) associated with 
this speed is given as follows. In the first line below, we ignore the higher-order (gL)? 
term, because it is really (gZ/c*)*, and we are assuming that gL/c? is small. And in 
obtaining the third line, we use the Taylor-series approximation, /1 — € ~ 1 — €/2. 


1 ge 2gL 
zyl- ~]! zz \1 272 
ve 1+ g¢t I+ g°t 


1 28L 
= 1 
1427 L +g? 
1 geL 
Se it za) (14.38) 
Fg + 


We can now calculate the time that each clock shows, at time ż in the ground frame. 
The time on the back clock changes according to dt) = dt /Y», so Eq. (14.36) gives 


(14.39) 


, i dt 
b=] =. 
0 y1 +g 


The integral of dx/v 1 +x? is sinh”! x (to derive this, make the substitution x = 
sinh 0). Letting x = gt, this gives 


gt = sinh™! (gt). (14.40) 
The time on the front clock changes according to dte = dt/y¢, so Eq. (14.38) gives 


etrLdt 


t dt t 
tf =} H / : 
0 J/l+¢22 o (1 +g?) 


(14.41) 


12 Since these two speeds aren’t equal, there is an ambiguity concerning which speed we should use 
in the length-contraction factor, y 1 — v2. Equivalently, the rocket doesn’t have a single inertial 
frame that describes all of it. But you can show that any differences arising from this ambiguity 
are of higher order in gl / c? than we need to be concerned with. 
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The integral of x? dx/(1 + x?)3/” is sinh! x — x//1 + x2 (to derive this, make the 
substitution x = sinh 6, and use f d0/ cosh? = tanh 0). Letting x = gt, this gives 


te = sinh“! (ef) + (eL) ( sinh! (ep - —&— |. 14.42 
gte = sinh‘ (gt) en (sin (gt) VETZ ( ) 


Using Eqs. (14.35) and (14.40), we may rewrite this as 
gtt = gto (l + gL) — gLv. (14.43) 


Dividing by g, and putting the c’s back in to make the units correct, we finally have 


gL Lv 
tf =t(1l+5 3 (14.44) 
c c 


as we wanted to show. If we look at this calculation from the reverse point of view, 
we see that by using only Special Relativity concepts, we have demonstrated that 
someone at the back of a rocket sees a clock at the front running fast by a factor 
(q +2L/ c?). There are, however, much easier ways of deriving this, as we saw in 
Section 14.2 and in Problem 11.25 (“Acceleration and redshift”). 


14.7. Lv/c? revisited 


Consider first the case where the rocket accelerates while you sit there. Problem 
14.6 is exactly relevant here, and it tells us that in your frame the clock readings are 
related by 


LI\ L 
n= (1+%) ei (14.45) 


c 


You will eventually see the front clock an arbitrarily large time ahead of the back 
clock. But note that for small times (before things become relativistic), the standard 
Newtonian result, v © gt, is valid, so we have 


tp © (« + =) = = = ty. (14.46) 
c c 
So in this setup where the rocket is the one that accelerates, both clocks show essen- 
tially the same time near the start. This makes sense; both clocks have essentially the 
same speed at the beginning, so to lowest order their y factors are the same, so the 
clocks run at the same rate. But eventually the front clock will get ahead of the back 
clock. 

Now consider the case where you accelerate while the rocket sits there. Problem 
14.5 is relevant here, if we let the rocket in that problem now become you, and if we 
let two planets a distance L apart become the two ends of the rocket. The times you 
observe on the front and back clocks on the rocket are then, using Eq. (14.33) and 
assuming that you are accelerating toward the rocket, 


gte = (1 + g£) tanh(gt), and għ = (1 +e +L)) tanh(gT). (14.47) 


But from Exercise 14.16, we know that your speed relative to the rocket is v = 
tanh(gt). Equation (14.47) therefore gives t = tp + Lv, or t = te + Lu/ c? with the 
c?. So in this case we arrive at the standard Lv/c? “rear clock ahead” result. 

The point here is that in this second case, the clocks are synchronized in the rocket 
frame, and this is the assumption that went into our derivation of the Lv/c? result in 
Chapter 11. In the first case above where the rocket accelerates, the clocks are not 
synchronized in the rocket frame (except right at the start), so it isn’t surprising that 
we don’t obtain the Lv/c? result. 
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14.8. Circling the earth 


This is one setup where we really need to use the correct term, stationary-proper-time 
principle. It turns out that B’s path yields a saddle point for the proper time. The value 
at this saddle point is less than A’s proper time, but this is irrelevant, because we care 
only about local stationary points, not about global extrema. 

To show that we have a saddle point, we must show that (1) the differences in the 
proper time are second order, and (2) there exist nearby paths that give both a larger 
and a smaller proper time. The first of these is true because the first-order differences 
vanish due to the fact that the path satisfies the Euler-Lagrange equations for the 
Lagrangian in Eq. (14.15), because the path is indeed a physical one. 

The second is true because the proper time can be made smaller by having B speed 
up and slow down. This will cause a net increase in the time-dilation effect as viewed 
by A, thereby yielding a smaller proper time.'? And the proper time can be made 
larger by having B take a nearby path that doesn’t quite form a great circle on the 
earth. (Imagine the curve traced out by a rubber band that has just begun to slip away 
from a great-circle position.) This path is shorter, so B won’t have to travel as fast to 
get back in a given time, so the time-dilation effect will be smaller as viewed by A, 
thereby yielding a larger proper time. 


14.9. Twin paradox 


(a) In the earth frame, the spaceship travels at speed v for essentially the whole 
time. Therefore, the traveler ages less by a fraction y 1 — v?/c? ~ 1 —v?/2c?. 
The fractional loss of time is thus v*/2c*. The time-dilation effect is different 
during the short turning-around period, but this is negligible. 

(b) Let the distance to the star be £, as measured in the frame of the earth (but the 
difference in lengths in the two frames is negligible in this problem), and let the 
turnaround take a time T. Then the given information says that T < (2¢)/v. 

During the constant-speed part of the trip, the traveler sees the earth clock 
running slow by a fraction /1 — v2/c2 ~ 1 — v?/2c*. The time for this 
constant-speed part is (essentially) 2€/v, so the earth clock loses a time of 
(v2 /2c?)(2€/v) = ve/c?. 

However, during the turnaround time, the spaceship is accelerating toward 
the earth, so the traveler sees the earth clock running fast, due to the GR time 
dilation. The magnitude of the acceleration is a = 2u/T, because the spaceship 
goes from velocity v to —v in time T. The earth clock therefore runs fast by a 
factor 1+ a€/c? = 1+ 2vé/Tc*. This happens for a time T, so the earth clock 
gains a time of (2v0/Te?)T = 2ve/c?. 

Combining the results of the previous two paragraphs, we see that the 
earth clock gains a time of Qve/c? — ve/ce = ve/c*. This is a fraction 
(vl/c?)/(2€/v) = v? /2c* of the total time, in agreement with part (a). 


14.10. Twin paradox again 


(a) The only difference between this problem and the previous one is the nature 
of the turnaround, so all we need to show here is that the traveler still sees the 
earth clock gain a time of 2v€/c? during the turnaround. 

Let the radius of the semicircle be r. Then the magnitude of the acceleration 
is a = v? /r. Let 8 be the angle shown in Fig. 14.9. Fora given 0, the earth is at 


'3 This is true for the same reason that a person who travels at constant speed in a straight line 
between two points shows a larger proper time than a second person who speeds up and slows 
down. This follows directly from SR time dilation, as viewed by the first person. If you want, 
you can imagine unrolling B’s circular orbit into a straight line, and then invoke the result just 
mentioned. As far as SR time-dilation effects from clock A’s point of view go, it doesn’t matter if 
the circle is unrolled into a straight line. 


14.8 Solutions 


a height of essentially £ cos @ in the gravitational field felt by the spaceship. The 
fractional time that the earth gains while the traveler is at an angle 0 is therefore 
ah/ e = w? /r)(€ cos 0) Je. Integrating this over the time of the turnaround, 
and using dt = r d0 /v, we see that the earth gains the desired time of 


7/2 /a2€cos0\ /rd0 Quel 
At= z m a: (14.48) 
—1/2 


(b) Let the acceleration vector at a given instant be a, and let £ be the vector from 
the spaceship to the earth. Note that since the turnaround is done in a small 
region of space, £ is essentially constant here. The earth is at a height of â - £ 
in the gravitational field felt by the spaceship; the dot product just gives the 
cosine term in the above solution in part (a). The fractional time gain, ah/c’, 
is therefore equal to |a|(a- £) Je =a- tje. Integrating this over the time of 
the turnaround, we see that the earth gains a time of 


ta.l l tf 
At= — dt = =: adt 
AR 


i i 


A 
= z 0e) 


£- (2vf) 
2 


C 


z2 (14.49) 


e’ 


as we wanted to show. The point here is that no matter how complicated the 
motion is during the turnaround, the total effect is simply to change the velocity 
from v outward to v inward. 


14.11. Twin paradox times 


(a) As viewed by A, the relation between the twins’ times is 


dtg = V1 — v? dta. (14.50) 


Assuming vo < c, we may say that v(t4) is essentially equal to vo — gt4, so 
the out and back parts of the trip each take a time of essentially vo/g in A’s 
frame. The total elapsed time on B’s clock is therefore 


vo/g 
T= f asx f y1 — v dty 
0 


vo/g 2 
zJ (: = 5) dt, 
P 2 
vo/g 1 
2f (1 = 5 (0-20?) dt 
A 2 


Q 


Q 


1 3 vo/g 
=2(1+ Ewen ) 
6g 0 
2 3 
Sea os (14.51) 
g 3gc 


where we have put the c’s back in to make the units right. The ratio of B’s 
elapsed time to A’s is therefore 


2 
TB Tg U9 


(14.52) 
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(b) As viewed by B, the relation between the twins’ times is given by Eq. (14.13), 


v? gy 
dt, =,/1- = (1 + =) dip. (14.53) 


Assuming vo < c, we may say that v(tg) is essentially equal to vo — gtg, 
and A’s height is essentially equal to votg — gt /2. The up and down parts 
of the trip each take a time of essentially vo/g in B’s frame. Therefore, the 
total elapsed time on A’s clock is (using the approximation in Eq. (14.14), and 
dropping the c’s) 


vo/g v2 
ty = f du =2 | (1-5 +e) dw 
0 


vo/g 1 . 
x2 1 (1 z% gt)? + g(vot e/2)) dt. 
0 


1 wt? gh 
2(t4 t) 4 
( ae e( 2 a 


vo/g 


0 


2 3 
TETN (14.54) 
g 3g 
where we have put the c’s back in to make the units right. We therefore have 
Ta Ta vw Tg v 
zx x14 %1 F 14.55 
Tg  2v/g 62? 7” T4 6c? (14:33) 


up to higher-order corrections. This agrees with the result found in part (a), as 
the Equivalence Principle requires. 


Appendix A Useful formulas 


A.1 Taylor series 


The general form of a Taylor series is 


f" (Xo) 2 gf OO) 23 


So +x) =f Go) +f Cox +, 3! 


+, (A.1) 


which can be verified by taking derivatives and then setting x = 0. For example, 
taking the first derivative and then setting x = 0 yields f” (xọ) on the left, and 
also f’(xo) on the right, because the first term is a constant and gives zero, the 
second term gives f” (xg), and all the rest of the terms give zero once we set x = 0 
because they all have at least one power of x left in them. Likewise, if we take 
the second derivative of each side and then set x = 0, we obtain f” (xo) on both 
sides. And so on for all derivatives. Therefore, since the two functions on each 
side of the above equation are equal at x = 0 and also have their nth derivatives 
equal at x = 0 for all n, they must in fact be the same function (assuming that 
they’re nicely behaved functions, which we generally assume in physics). 

Some specific Taylor series that come up often are listed below. They are 
all derivable via Eq. (A.1), but sometimes there are quicker ways of obtaining 
them. For example, Eq. (A.3) is most easily obtained by taking the derivative of 
Eq. (A.2), which itself is simply the sum of a geometric series. 


1o 2. 33 
{2.7 +H +>: (A.2) 
=x 
1 = 2 3 
— = 14+ 2x4 3x° 4+ 40° 4+ --- (A.3) 
d -= x)? 
an 
Ind = = A4 
n( x) x 5 3 (A.4) 
*=] rh AAS 
e= tirg a t (A.5) 


675 


676 Appendix A 


3 5 
sinx=x- + (A.6) 

x x4 
COS el gy e (A.7) 

x x 
JItx=1+5-54-- (A.8) 
Dh epee (A.9) 
I+}x 2 8 
+x" =1+nx+ (3) $ (3) per (A.10) 


A.2 Nice formulas 


The first formula here can be quickly proved by showing that the Taylor series 
for both sides are equal. 


el? 


= cos + isind (A.11) 


1, lane 
cos@ = 5c" +e), sind = ae se) (A.12) 
l 


0 1 0 0 1— 0 
ia, a o eea (A.13) 
2 2 2 2 
ia a /1 — cos = S _ sin 0 (A.14) 
2 1 + cos sin 0 1 + cosé 


sin20 =2sin@cos@, cos20 = cos?ð — sin? (A.15) 
sin(a + £) = sina cos B + cos « sin B (A.16) 
cos(a + B) = cos g cos  — sina sin £ (A.17) 

tana + tan £ 
tan(a + 6) = ——————_ (A.18) 
1 — tan g tan £ 


Useful formulas 


1 1 
cosh x = ze te), sinhx = xe —e*) 
cosh?x — sinh?x = 1 
dx 


d d 
—coshx = sinhx, — sinh x = cosh x 
dx 


A.3 Integrals 


finxae=xinx—s 


2 2 
[rinxac= Z nx- 
2 4 


[enze 1) 


dx -1 -1 
—— =tan x or —cot x 
1 +x 


f dx 1 Xe 
= -ln 
x1 +x) 2 1 +x? 
d 1 1 
i} sr a In ae or tanh~!x (x? < 1) 
1-x* 2 l-x 


d 1 1 
/ <= D in( r) or coth™!x (x? > 1) 


(vi +2 + In (x + 1+2)) 


l 2 
oe -56 +x)VI-x 


(A.19) 


(A.20) 


(A.21) 


(A.22) 


(A.23) 


(A.24) 


(A.25) 


(A.26) 


(A.27) 


(A.28) 


(A.29) 


(A.30) 


(A.31) 
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d 
/ T =In(x+Vx2+1) or sinh! x (A.32) 
d 
l Z - = ln (x +vx? — 1) or cosh™!x (A.33) 
fee 
dx i -1 
E) =sec x or —csc x (A.34) 
d l+vl 2 
= =—-—In t TX or — csch™!x (A.35) 
xv 1 + x2 x 
d 1+VJ1—x2 
f = a n( is Ž ) or — sech™!x (A.36) 
xV1—x X. 
f dx = (- + =) (437) 
cosx cos x 
ae (=) (A.38) 
sin x sin x 


d 
I 2 1 (22) (A.39) 
sin x COS X sin x 


Appendix B Multivariable, vector calculus 


This appendix gives a brief review of multivariable calculus, also known as 
vector calculus. The first three topics below (dot product, cross product, partial 
derivatives) are used often in this book, so if you haven’t seen them before, you 
should read these parts carefully. But the last three topics (gradient, divergence, 
curl) are used only occasionally, so it isn’t crucial that you master these (for this 
book, at least). For all of the topics, it’s possible to go much deeper into them, 
but I’ll present just the basics here. If you want further material, any book on 
multivariable calculus should do the trick. 


B.1 Dot product 


The dot product, or scalar product, between two vectors is defined to be 
a- b = ayby + ayby + azbz. (B.1) 


The dot product takes two vectors and produces a scalar, which is just a number. 
You can quickly use Eq. (B.1) to show that the dot product is commutative and 
distributive. That is, a - b = b-a, and (a + b) - c = a - c + b - c. Note that the 
dot product of a vector with itself is a - a = a2 + a, + a2, which is just its length 
squared, |a|? = a’. 

Taking the sum of the products of the corresponding components of two vec- 
tors, as we did in Eq. (B.1), might seem like a silly and arbitrary thing to do. Why 
don’t we instead look at the sum of the cubes of the products of the corresponding 
components? The reason is that the dot product as we’ve defined it has many nice 
properties, the most useful of which is that it can be written as 


a - b = |a||b| cos = ab cos9, (B.2) 


where 0 is the angle between the two vectors. We can demonstrate this as follows. 
Consider the dot product of the vector c = a + b with itself, which is simply the 
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square of the length of c. Using the commutative and distributive properties, we 
have 


c? = (a+b). (a+b) =a-a+2a-b+b-b 
ay +2a- b+ b?. (B.3) 
But from the law of cosines applied to the triangle in Fig. B.1, we have 
C=a +b? —2abcosy =a’ + b? + 2abcosd, (B.4) 


because y = x — 0. Comparing this with Eq. (B.3) yields a - b = ab cos 0, as 
desired. The angle between two vectors is therefore given by 


ga RF 
|a||b| 


Anice corollary of this result is that if the dot product of two vectors is zero, then 
cos = 0, which means that the vectors are perpendicular. If someone gives 
you the vectors (1, —2,3) and (4,5, 2), it’s by no means obvious that they’re 
perpendicular. But you know from Eq. (B.5) that they indeed are. 

Geometrically, the dot product a - b = abcos@ equals the length of a times 
the component of b along a. Or vice versa, depending on which length you want 
to group the cos 0 factor with. If we rotate our coordinate system, the dot product 
of two vectors remains the same, because it depends only on their lengths and the 
angle between them, and these are unaffected by the rotation. In other words, the 
dot product is a scalar. This certainly isn’t obvious from looking at the original 
definition in Eq. (B.1), because the coordinates get all messed up during the 
rotation. 


(B.5) 


Example (Distance on the earth): Given the longitude angle ¢ and the polar 
angle 6 (measured down from the north pole, so 6 is 90° minus the latitude angle) 
for two points on the earth, what is the distance between them, as measured along the 
earth? 


Solution: Our goal is to find the angle, 6, between the radii vectors to the two 
points, because the desired distance is then RB. This would be a tricky problem if 
we didn’t have the dot product at our disposal, but things are easy if we make use of 
Eq. (B.5) to say that cos £ = r1 - r2/R?. The problem then reduces to finding r1 - r2. 
The Cartesian components of these vectors are 

rı = R(sin 01 cos 1, sin 0; sing), cos 01), 


(B.6) 
r2 = R(sin 62 cos %2, sin 62 sin ġ2, cos 62). 


The desired distance is then RB = R cos7! (r1 -r2 /R?), where 
ri- r2/R? = sin 6] sin 62 (cos dj cos @2 + sin gd] sin d2) + cos 01 cos 62 


= sin 04 sin 62 cos(@2 — 1) + cos 8; cos 62. (B.7) 
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We can check some limits: If ¢; = $2, then this gives B = 02 — 0, (or 81 — 62, 
depending on which is larger), as expected. And if 6; = 62 = 90°, then it gives 
B = $2 — | (or $1 — $2), as expected. 


B.2 Cross product 


The cross product, or vector product, between two vectors is defined via a 
determinant to be 


kK y 2 
axb=/a ay a, 
by by bz 

= X(aybz 7 azby) + Î(azbx — aybz) + Z(axby = ayby). (B.8) 


The cross product takes two vectors and produces another vector. As with the 
dot product, you can show that the cross product is distributive. However, it is 
anti-commutative (that is, a x b = —b x a), which is evident from Eq. (B.8). So 
the cross product of any vector with itself is zero. 

As with the dot product, the reason why we study this particular combination 
of components is that it has many nice properties, the most useful of which are 
that its direction is perpendicular to both a and b (in the orientation determined 
by the right-hand rule; see below), and its magnitude is 


|a x b| = |a||b| sin ð = ab sin 8. (B.9) 


Let’s first show that a x b is indeed perpendicular to both a and b. We’ll do this 
by making use of the above handy fact that if the dot product of two vectors is 
zero, then the vectors are perpendicular. We have 


a- (a x b) = ax(aybz = azby) F ay(azbx — axybz) + az (axby _ dyby) = 0, 
(B.10) 


as desired. Likewise for b. There is still an ambiguity, however, because although 
we know that a x b points along the direction perpendicular to the plane spanned 
by a and b, there are two possible directions along this line. Assuming that our 
coordinate system has been chosen to be “right-handed” (that is, if you point the 
fingers of your right hand in the direction of X and then swing them to y, your 
thumb points along z), then the direction of a x b is determined by the right- 
hand rule. That is, if you point the fingers of your right hand in the direction 
of a and then swing them to b (through the angle that is less than 180°), your 
thumb points along a x b. This is consistent with the fact that Eq. (B.8) gives 
(1,0,0) x (0, 1,0) = (0,0, 1), ork x y =z. 
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Let’s now demonstrate the |a x b| = ab sin 0 result, which is equivalent to 
ja x b|? = a?b? (1 — cos*), which is equivalent to |a x b|? = a?b? — (a- b)?. 
Written in terms of the components, this last equation is 


(aybz = azby)* + (azbx — abe + (axby = Ayby)* 
= (a, + a, + a2) (by + by + b2) — (acby + ayby +azbz). (B11) 


If you stare at this long enough, you’ll see that it’s true. The three different 
types of terms agree on both sides. For example, both sides have an a2b? term, a 


y 
—2aybyazb; term, and no ab? term. 


B.3 Partial derivatives 


When dealing with a function of only one variable, there is no ambiguity when 
taking a derivative. However, with a function of many variables, we have to 
specify which one of the variables we’re differentiating with respect to. If we 
have a function of, say, two variables, f (x, y), and if we want to take the derivative 
with respect to x, then we use the terminology “partial derivative with respect 
to x,” with the notation 3f /ðx. To evaluate this partial derivative, we don’t have 
to do anything fancy. We just take a regular derivative with respect to x, while 
assuming that y is constant. For example, if f(x,y) = x — 2y + x’y°, then 
af /ax = 14 2xy>, and ðf /dy = —2 + 3x?y?. If we plot the value of f as the 
height above the x-y plane, then when we take the partial derivative with respect 
to x, we’re simply finding the slope of the curve formed by the intersection of 
the function’s surface with the vertical plane parallel to the x axis and passing 
through the point in question. Similarly for y. 

If we want to maximize or minimize a function of more than one variable, 
we need to set all the partial derivatives equal to zero. This is true because if the 
partial derivative with respect to a certain variable isn’t zero, then the slope of 
the function in that direction is nonzero, which means that the point can’t be a 
local maximum or minimum. This argument is the same as in the single-variable 
case. It’s just that now we can make the argument for each of the variables 
independently. 

Demanding that all the partial derivatives equal zero doesn’t actually guarantee 
having a local maximum or minimum. The point in question might be a saddle 
point, which means that the function is a local maximum in some directions 
and a local minimum in others (so in two dimensions the function looks like a 
saddle; hence the name). For example, consider the function of two variables, 
f (x,y) = 3x? — y*. Then the point (0,0) is a local minimum in the x direction 
and a local maximum in the y direction. 

For two variables, if the second partial derivatives have opposite signs at a 
point where the first partial derivatives are zero, then we have a saddle point, 
because there is an upward parabola in one direction and a downward parabola 
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in the other. However, we might have a saddle point even if the second partial 
derivatives have the same sign. For example, if we make the change of variables 
x=w-—zandy =w+zinf(x,y) = 3x? — y’, then it becomes f(w,z) = 
2w? + 2z? — 8wz. If we didn’t already know from the f(x,y) form that (0, 0) 
is a saddle point, we could deduce this in the following way. Imagine that z is 
given, and then solve for the w that makes f(w,z) = 0. The result of solving 
this quadratic equation is that w takes the form of some multiple of z. That is, 
w = Az, where A happens to be 2 + V3 in the present case. Since there are 
two (real) solutions for A here, there are two lines, namely w = (2 + V3)z, 
emanating from (0, 0) for which f(w,z) = 0. So (0, 0) can’t be a local maximum 
or minimum. It must therefore be a saddle point. ! 

In general, there are two real solutions for A if and only if the discriminant 
of the above quadratic equation is positive. For an arbitrary function of two 
variables, its shape in the vicinity of a point (which we will take to be (0,0) 
after a shift in the coordinates) where both first partial derivatives are zero can 
be approximated by the Taylor series to second order (which you can verify by 
taking various derivatives, just as you would do with a function of one variable), 


. 7 LOT). > 1/8ƏF\ə af 
fnsy=c+5(T4)a +3 (3) (i)er. (B.12) 


where it is understood that the partial derivatives here are evaluated at (0, 0). The 
condition that the discriminant is positive is therefore 


f N? (PF) (PF 
Eo >° B 


If this is true, then the point is a saddle point. If the left side is less than zero, then 
the point is a local maximum or minimum, because there are no nearby points 
for which f (x, y) = C; they are all either greater than C or less than C. If the left 
side equals zero, then the function looks like a trough, at least in the vicinity of 
the point in question (assuming that there is at least some quadratic dependence 
in the function). 


' This is true because along these two lines, the first partial derivatives aren’t zero, which means that 
the plane of the function’s surface is tilted there. So the function is positive on one side of each 
line and negative on the other, which is exactly what happens with a saddle. There is, however, 
the special case where the discriminant of the quadratic equation is zero (as with, for example, 
fœ, y) = œ- y)?), in which case there is only one solution for A and thus only one line for 
which the function is zero. In this case, the function looks like a (possibly upside down) trough. 
It is zero (or some given constant) along the line, at least to second order. And it curves up (or 
down) quadratically as you move away from the line (assuming that there is at least some quadratic 
dependence in the function). 


683 


684 


Appendix B 


B.4 Gradient 


Given a function f (x, y, Z) (we’ll work mainly with three variables from now on), 
we can form the vector whose components are the partial derivatives of f, namely 
(of /dx, df /dyv, Of /dz). This vector is called the gradient. If we define the dif- 
ferential vector operator V (usually called “del”) to be V = (0/dx, 0/dy, 0/02), 
then the gradient is simply 


Vf = (Žž. A z), (B.14) 
dx dy dz 

The gradient takes a function and produces a vector. For example, if f (x,y,z) = 

xy? — yz, then Vf = (y2, 2xy — 23, —3yz?). We call V an “operator” because it 

needs to operate on a function to produce the gradient vector. 

What is the physical meaning of the gradient? The gradient gives the direction 
you should march in if you want f to increase at the greatest rate. The reason for 
this is the following. Consider the value ofa function f (x, y, z) at a certain point, 
and then look at the value at a nearby point, displaced by the vector (dx, dy, dz). 
What (approximately, to first order) is the change in f between the two points? 
Well, as you march a distance dx in the x direction, the (first-order) change in f is 
(of /dx) dx, by the definition of the partial derivative (just as in the one-variable 
case). If you then march a distance dy in the y direction, the function changes by 
an additional amount of (df /dy) dy. And likewise the z direction gives a change 
of (df /dz) dz. Adding up these three changes in f , we see that the total first-order 
change in f is? 


ə 3 
df = f dx + of dy + f dz. (B.15) 
ox oy Oz 


Using the dot product, this can be written concisely as 


df = (2, of , =) - (dx, dy, dz) = Vf - dr. (B.16) 
ox dy dz 
We can now make use of Eq. (B.2) to say that the change in f is df = 
|Vf||dr| cos, where @ is the angle between Vf and dr. The meaning of this 
is the following. Consider a given point (x,y,z). The Vf gradient vector at this 
point is a particular vector. Imagine marching along little dr vectors in various 
directions and seeing how much f changes (assume that all the dr vectors have 
the same length, to be consistent). What direction should you march in, in order 
to have f change the most? Or not change at all? In the df = |V/f||dr| cos 


2 There is technically an ambiguity in where each of these partial derivative is evaluated, because the 
three little steps you took started at three different points. But these ambiguities involve first-order 
corrections to the partial derivatives, and since these partial derivatives are already being multiplied 
by the first-order dx, dy, and dz terms in Eq. (B.15), any ambiguities will result in second-order 
effects and can therefore be ignored. 
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expression, |Vf | has a definite value at the point in question, and we’re assuming 
that we pick |dr| to always be the same, so it comes down to the cos @ factor. 
Therefore, if you march directly along the Vf gradient vector at the point, then 
f increases the most. And if you march in any direction in the plane perpendic- 
ular to Vf, then f doesn’t change at all (to first order). And if you march in the 
direction anti-parallel to Vf, then f decreases the most. 

This is more easily visualized in the case of a function of only two variables, 
f(x,y), because then we can picture the value of f as being the height in the 
z direction. The graph of f is just a surface of mountains and valleys above 
(or below) the x-y plane. The gradient Vf = (df /dx, df /dy) then gives the 
direction of steepest ascent. That is, f changes at the greatest rate if you march in 
the direction of Vf in the x-y plane; the slope of the surface in this direction in the 
x-y plane is larger than in any other direction. And if you march in either direction 
along the line in the x-y plane that is perpendicular to Vf, then f has zero change. 
By continuing to march along the direction perpendicular to Vf wherever you 
are, you will form a curve in the x-y plane for which all the points give the same 
value of f. In other words, if you slice the surface of f with a horizontal plane 
whose height equals this particular value of f , and if you look at the intersection 
of this plane with the surface, then the projection of this intersection onto the x-y 
plane is the above curve you formed in the x-y plane. 


B.5 Divergence 

Consider a vector whose components are functions of the coordinates. For exam- 
ple, let F = (Fy, Fy, Fz) = (xz, 2y? + xyz, x? + 23). Then the divergence of F 
is defined to be the dot product of the V operator with F, that is, 


OF, OFy OF 
ee ee (B.17) 


The divergence takes a vector and produces a number. The above F has a 
divergence of (3z) + (4y + xz) + (327). 

What is the physical meaning of the divergence? Consider an infinitesimal 
box with sides of length dx, dy, and dz. Then the divergence measures the net 
flux of the vector field out of the box, divided by the volume of the box. (The flux 
through a surface is defined to be the integral of the area times the component 
of the vector perpendicular to the surface.) For example, if a certain vector field 
gives the velocity at each point in a fluid flow, and if the divergence is nonzero, 
then there must be a source (or sink) that creates (or destroys) fluid, because 
otherwise whatever fluid goes into the little box would have to come out of it 
somewhere, yielding zero net flux. 

Let’s see why the divergence equals the flux per volume. Consider the “left” 
dy x dz face of the little box. The amount of flux from the vector field into the 
box through this face equals the area dy dz times the Fy component. (The F, and 
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F, components are parallel to this face and therefore contribute nothing to the 
flux through it.) The amount of flux out of the “right” dy x dz face equals the area 
dy dz times the value of the Fy component there. But this value equals (to first 
order) the original Fy plus (3Fx/3x) dx, by the definition of the partial derivative. 
The Fy part of this cancels the inward flux from the left face, so the net flux out of 
the little box through these two faces is (F, %/3x) dx) dy dz. Similar calculations 
work for the other two pairs of parallel faces, so the total flux out of the box is 


OF, OF, OF. 

Net flux = 24  bedyade: (B.18) 
ox oy az 

Therefore, as promised, the net flux per volume equals the divergence. The 

integrated form of this result is the divergence theorem, or Gauss theorem, which 

takes the form, 


[vy rav= f Faa (B.19) 
V S 


The integral on the left runs over a given volume, and the integral on the right runs 
over the surface that encloses this volume. The vector dA has a magnitude equal 
to an infinitesimal piece of area of S and a direction defined to be perpendicular 
to the plane containing this piece (with the positive direction being outward from 
the volume). Dotting dA with F has the effect of picking out only the component 
of F that is perpendicular to the piece (which is what is relevant in calculating 
the flux). 

We’ll skip the fine details here, but the basic idea of the proof of Eq. (B.19) 
is to divide the volume up into many infinitesimal cubes and to look at the total 
flux through all the cubes. From Eq. (B.18), the integral of the divergence over 
one little cube (which is essentially the divergence times the volume, because the 
divergence is essentially constant over the tiny volume) equals the flux through 
that cube. The integral of the divergence over the whole volume therefore equals 
the sum of the fluxes through all the cubes. But all the faces of the cubes that 
lie in the interior of the volume are shared by two cubes, so the flux through 
these faces cancels when taking the sum (because the flux through a given face 
is counted positive for one cube and negative for the other). So we are left with 
only the flux through the faces on the boundary of the volume (because these 
faces appear only once in the total integral). We are therefore left with the flux 
through the surface S, which is what appears on the right-hand side of Eq. (B.19). 


Example (Flux through a sphere): Verify the divergence theorem in the case 
where the surface is a sphere of radius R centered at the origin, and F = (x,y,z). 


Solution: On the left-hand side of Eq. (B.19), the divergence of (x, y, z) is 1+1+1=3, 
so the integral of this over the volume of the sphere is simply 34n R? Ja) = 4r RÈ. 
On the right-hand side, the unit vector perpendicular to the surface is (x,y,z)/R, so 
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dA = (dA)(x,y,z)/R. The dot product of this with (x,y,z) is (dA)(x* + y? + 2”)/ 
R = (dA)R. The integral of this over the surface of the sphere is just (4n R?)R = 47 RÈ. 
The two sides are therefore equal, as we wanted to show. 


B.6 Curl 


Consider a vector whose components are functions of the coordinates. For exam- 
ple, let F = (Fy, Fy, Fz) = (3xz, xyz, x +z). Then the curl of F is defined to be 
the cross product of the V operator with F, that is, 


Xx y 2 
VxF=| ə/ðəx d/dy d/dz 
ir po A 


_ (oF, ƏFy OF, OF, OF, ƏFy 
~ \ ay dz” daz ax” Ox dy J` 


(B.20) 


The curl takes a vector and produces another vector. The above F has a curl of 
(=xy, 3x — 1, 2xyz). 

What is the physical meaning of the curl? Consider the infinitesimal rectangle 
shown in Fig. B.2. This rectangle lies in the x-y plane, so for the moment we will 
suppress the z component of all coordinates, for convenience. It turns out that 
the z component of the curl equals the counterclockwise integral f F - dr around 
the closed loop, divided by the area of the loop (similar statements hold for the y 
and x components and the associated little rectangles in the x-z and y-z planes). 
Let’s see why this is true. 

The total counterclockwise integral of F - dr around the loop entails moving 
to the right on segment 1 and to the left on 3, and up on segment 2 and down on 
4. On segments 1 and 3, both dy and dz are zero, so only the Fy dx term survives 
in the dot product F - dr. Likewise, F, dy is the only nonzero term on segments 
2 and 4. If we pair up the two pairs of parallel sides, the total counterclockwise 
integral is 


X+dX 
[rae (Fe(x, Y) — Fea, Y + dY)) dx 
X 


Y+dY 
+ Í (F(X + dX, y) — F(X,y)) dy. (B.21) 


Let’s approximate the differences in these parentheses. To first order, we have 


y? EED) x dy Fee) 


Fy, Y + dY)—Fy(x,Y) ed 
dy lay) ay læ» 


(B.22) 


(X+dX, Y+d 
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The first approximation here is valid due to the definition of the partial derivative. 
The second approximation (replacing x with X) is valid because our rectangle is 
small enough so that x is essentially equal to X. Any error in this approximation 
is second-order small, because we already have a factor of dY in our term. A 
similar treatment works for the F, terms, so Eq. (B.21) becomes 


Y+dY E X+dX 
[rae f Py gia H a-f ay dx. 
Y ax (X.Y) X dy (X,Y) 


(B.23) 


The integrands are constants, so we can quickly perform the integrals to obtain 


OF, F(x, 
[P-ar=avar( yy) _ 2 <P) (B.24) 
wy) 


ox oy 


As promised, the z component of the curl equals the counterclockwise integral 
JF - dr around the closed loop, divided by the area of the loop. The preceding 
analysis also works, of course, for little rectangles in the x-z and y-z planes. We 
therefore obtain the two other components of the curl. 

The generalization of the above result to tilted and wavy surfaces is Stokes’ 
theorem, which states that 


[Oxia = | Far (B.25) 
S GC 


The integral on the left runs over a given surface, and the integral on the right 
runs over the curve that is the boundary of this surface. The vector dA has a 
magnitude equal to an infinitesimal piece of area of S and a direction defined to 
be perpendicular to the plane containing this piece (with its orientation defined 
via the orientation along C and the right-hand rule). We’ll skip the details here, 
but the basic idea of the proof is similar to the idea behind the divergence theorem 
above, except with certain words replaced by other words (“volume” becomes 
“surface,” etc.). We’ll divide the surface up 
into many infinitesimal rectangles and look at the total integral around all the 
rectangles. For simplicity, let’s just deal with a flat surface in the x-y plane. 

From above, the integral of the z component of the curl over one little rectangle 
equals the counterclockwise integral of F - dr around the edges of the rectangle. 
The integral of the z component of the curl over the whole surface therefore 
equals the sum of the integrals around all the rectangles. But all the edges of the 
rectangles that lie in the interior of the surface are shared by two rectangles, so 
the integral along these edges cancels when taking the sum (because the integral 
along a given edge is counted positive for one rectangle and negative for the 
other). So we are left with only the integral along the edges on the boundary of 
the surface (because these edges appear only once in the total integral). We are 
therefore left with the integral along the curve C, which is what appears on the 
right-hand side of Eq. (B.25). 


> 


and “surface” becomes “curve, 
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Note that if the surface is closed, so that it has no boundary (in other words, 
there is no curve C), then the right-hand side of Eq. (B.25) is zero, and hence 
the left-hand side is also. Such is the case with, for example, a sphere. Having 
no boundary means that if you’re a little bug walking on the surface, you can’t 
walk off it. 


Example (Integral around a circle): Verify Stokes’ theorem in the case where 
the curve is a circle of radius R in the x-y plane, centered at the origin, and 
F = (-y,x,0). 


Solution: On the left-hand side of Eq. (B.25), the curl of (—y, x, 0) is (0, 0, 2). The 
dA vector also points in the z direction, so the dot product is just 2(dA). The integral 
of this over the interior of the circle is simply 2( R2). On the right-hand side, the dot 
product equals —y dx + x dy. Integrating this along the circumference of the circle is 
most easily done in polar coordinates. With x = Rcos@ and y = Rsin#@, we have 
dx = —Rsin6@ d0 and dy = Rcosé dé. So —y dx + x dy = R? d0. The integral of 
this as 8 ranges from 0 to 27 is R? (27x). The two sides are therefore equal, as desired. 


There are some handy facts that deal with combinations of the gradient, diver- 
gence, and curl. One is that the curl of a gradient is identically zero. That is, 
V x Vf = 0. You can verify this explicitly by using the definitions of the curl 
and the gradient, and also the fact that partial differentiation is commutative 
(that is, 02f/dx dy = 3?f /ðy dx). Alternatively, you can let F = Vf in Stokes’ 
theorem, which gives [5(V x Vf)-dA = fe Vf - dr. The right-hand side of 
this is simply the net change in the function f around the closed curve C, which 
is always zero. The integrand on the left-hand side must therefore identically 
be zero. 

Also, the divergence ofa curl is identically zero. That is, V-(V x F) = 0. Again, 
you can verify this explicitly by using the definitions of the divergence and the 
curl, and also the fact that partial differentiation is commutative. Alternatively, 
you can combine Gauss’ theorem and Stokes’ theorem to write f yV- (Vx 
F)dV = fo F- dr. The right-hand side is always zero, because the boundary 
surface S of any given volume V is closed, so there is no curve C. The integrand 
on the left-hand side must therefore identically be zero. 
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In nonrelativistic mechanics,' the equations F = ma and F = dp/dt say exactly 
the same thing ifm is constant. But ifm is not constant, then dp/dt = d(mv)/dt = 
ma + (dm/dt)v, which doesn’t equal ma. So if a system has a changing mass, 
should we use F = ma or F = dp/dt? Which equation correctly describes the 
physics? The answer to this depends on what you label as the system to which 
you associate the quantities m, p, and a. You can generally do a problem using 
either F = ma or F = dp/dt, but you must be very careful about how you label 
things and how you treat them. The subtleties are best understood through two 
examples. 


Example 1 (Sand dropping into a cart): Consider a cart into which sand is 
dropped vertically at a rate dm/dt = o. With what force must you push on the cart 
to keep it moving horizontally at a constant speed v? (This was the setup in the first 
example in Section 5.8.) 


First solution: Let m(t) be the mass of the cart-plus-sand-inside system (which 
we’ ll just call the “cart”). If we use F = ma (where a is the acceleration of the cart, 
which is zero), then we obtain F = 0, which is incorrect. The correct expression to 
use is F = dp/dt. This gives 


re preii (C.1) 

= — = ma + —v = ov. ; 
dt dt 

This makes sense, because your force is what increases the momentum of the cart, 

and this momentum increases simply because the mass of the cart increases. 


Second solution: Itis possible to solve this problem by using F = ma if we let 
our system be a small piece of mass that is being added to the cart. Your force is what 
accelerates this mass from rest to speed v. Consider a mass Am that falls into the cart 
during a time At. Imagine that it falls into the cart in one lump at the start of the At, 
and then accelerates up to speed v during the time Aż. (It accelerates due to friction. 
But if you want, you can eliminate the cart as the intermediate object and just push 
directly on the mass.) This process repeats during each successive At interval. We 


' We won’t bother with relativity in this appendix, because nonrelativistic mechanics contains all 
the critical aspects we want to address. 
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can use F = ma here because the mass of the small piece is constant. So we have 
F = ma = Am(v/At). Writing this as (Am/ Atv gives the ov result we found above. 


Third solution: As in the second solution, let’s imagine the process occurring in 
discrete steps, but now with the cart as the system. Assume that a mass Am falls into 
the cart and instantaneously decreases its speed (by conservation of momentum) to 
v’ = mv/(m+ Am), whichis Av = v—v! = vAm/(m+ Am) less than the original v. 
Assume that you then push on the cart for a time Aż (during which the mass remains 
constant at m + Am, so F = ma is the relevant expression) and bring it back up to 
speed v. The acceleration is a = Av/At = v(Am/At)/(m+ Am) = ov/(m+ Am), 
so your force is 


F = (m+ Ama = (m+ Am) (=) = ov. (C.2) 


Example 2 (Sand leaking from a cart): Consider a cart that leaks sand out 
of the bottom at a rate dm/dt = o. If you apply a force F to the cart, what is its 
acceleration? 


Solution: Let m(t) be the mass of the cart-plus-sand-inside system (the “cart’”). In 
this example, the correct expression to use is F = ma, so the acceleration is 


a= —. (C.3) 
m 

Note that since m decreases with time, a increases with time. We used F = ma here 
because at any instant, the mass m is what is being accelerated by the force F. As 
above, you can imagine the process occurring in discrete steps: You push on the mass 
for a short period of time, then a little piece instantaneously leaks out; then you push 
again on the new (smaller) mass, then another little piece leaks out; and so on. In this 
discretized scenario, it is clear that F = ma is the appropriate formula, because it 
holds for each step in the process. The only ambiguity is whether to use m or m + dm 
at a certain time, but this yields a negligible error. 


REMARKS: Itis still true that F = dp/dt in this second example, provided that you let 
F be total force, and let p be the total momentum. In this example, F is the only force. 
However, the total momentum consists of both the sand in the cart and the sand that has 
leaked out and is falling through the air.? A common mistake is to use F = dp/dt, with p 
being only the cart’s momentum. The leaked sand still has momentum. 

There is a simple example that demonstrates why F = dp/dt doesn’t work when p 
refers only to the cart. Choose F = 0, so that the cart moves with constant speed v. Cut 
the cart in half, and label the back part as the “leaked sand” and the front part as the 
“cart.” If you want the cart’s p to have dp/dt = F = 0, then the cart’s speed must double 
if its mass gets cut in half. But this is nonsense. Both halves simply continue to move 
at the same rate. & 


2 If there were air resistance, we would have to worry about its effect on the falling sand if we wanted 
to use F = dp/dt to solve the problem, where p is the total momentum. This is clearly not the best 
way to do the problem. If complicated things happen with the sand in the air, it would be foolish 
to consider this part of the sand when we don’t have to. 
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To sum up, F = dp/dt is always valid, provided that you use the total force 
and total momentum of a given system of particles. This approach, however, can 
get messy in certain situations. So in some cases it is easier to use an F = ma 
argument, but you must be careful to correctly identify the system that is being 
accelerated by the force. The asymmetry in the above two examples is that in 
the first example, the force does indeed accelerate the incoming sand. But in the 
second example, the force does not accelerate (or decelerate) the outgoing sand. 
F has nothing to do with the leaked sand. 


Appendix D Existence of principal axes 


In this appendix, we will prove Theorem 9.4. That is, we will show that an 
orthonormal set of principal axes does indeed exist for any object, and for any 
choice of origin. It isn’t crucial that you study this proof. If you want to just 
accept the fact that principal axes exist, that’s perfectly fine. But the method we 
will use in this proof is one you will see again and again in your physics stud- 
ies, in particular when you study quantum mechanics (see the remark following 
the proof). 


Theorem D.1 Given a real symmetric 3 x 3 matrix, I, there exist three 
orthonormal real vectors, @,, and three real numbers, I, with the property 
that 


Te; = rôk. (D.1) 


Proof: This theorem holds more generally with 3 replaced by N (all the steps 
below easily generalize), but we’ll work with N = 3, to be concrete. Consider a 
general 3 x 3 matrix, I (we don’t need to assume yet that it’s real or symmetric). 
Assume that Iu = Ju for some vector u and some number J.! This may be 
rewritten as 


(hy — I) Ley Le Ux 0 
De Gai 1. fie leloj: (D.2) 
Le ly (Lz — I) Uz 0 


In order for there to be a nontrivial solution for the vector u (that is, one where u Æ 
(0, 0, 0)), the determinant of this matrix must be zero.” Taking the determinant, 
we see that we get an equation for / of the form 


aP + bi? +cl+d=0. (D.3) 


' Such a vector u is called an eigenvector of I, and the associated 7 is called an eigenvalue. But don’t 
let these names scare you. They’re just definitions. 

2 If the determinant were not zero, then we could explicitly construct the inverse of the matrix, which 
involves cofactors divided by the determinant. Multiplying both sides by this inverse would yield 
u=0. 
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The constants a, b, c, and d are functions of the matrix entries J, but we won’t 
need their precise form to prove this existence theorem. The only thing we need 
this equation for is to say that there do exist three (generally complex) solutions 
for J, because the equation is a cubic. 

We will now show that the solutions for / are real. This will imply that there 
exist three real vectors u satisfying Iu = Ju, because we can plug the real /’s 
back into Eq. (D.2) and solve for the real components ux, uy, and uz, up to an 
overall constant. We will then show that these vectors are orthogonal. 


e Proof that the Is are real: This follows from the real and symmetric conditions on I. 
Start with the equation Iu = Ju, and take the dot product with u* to obtain 


u* . Iu = u* - Ju 
= Iu* -u. (D.4) 
The vector u* is the vector obtained by complex conjugating each component of u (we 
don’t know yet that u can be chosen to be real). On the right side, Z is a scalar, so we can 
take it out from between the u* and u. The fact that I is real implies that if we complex 
conjugate the equation Iu = Ju, we obtain Iu* = /*u* (we know that I is real, but we 


don’t know yet that Z is real). If we then take the dot product of this equation with u, 


we obtain 
u-Tu* = /*u-u*. (D.5) 


We now claim that if I is symmetric, then a- Ib = b - Ia, for any vectors a and b. 
(We’ll leave this for you to show by simply multiplying each side out.) In particular, 
u* - Tu = u- Iu*, so Eqs. (D.4) and (D.5) give 


q —I*)u-u* = 0. (D.6) 


And since u- u* = |u|? + |u|? + |u3|2 4 0, we must have J = I*. Therefore, I 
is real. 

e Proof that the u are orthogonal: This follows from the symmetric condition on I. Let 
Tu, = u4, and Iu? = huz. Take the dot product of the former equation with uz to 


obtain 

u - Iu; = 1w - uj, (D.7) 
and take the dot product of the latter equation with u; to obtain 

uj -Iup = by -w. (D.8) 


As above, the symmetric condition on I implies that the left-hand sides of Eqs. (D.7) 
and (D.8) are equal. Therefore, 


(Mı —J2)u, -w = 0. (D.9) 


Existence of principal axes 


There are two possibilities here: (1) If J; 4 Jp, then we are done, because uy - u2 = 0, 
which says that uq and uz are orthogonal. (2) If 7) = h = J, then we have I(au, + 
buz) = (au; + buz), for any a and b. So any linear combination of u] and uz has 
the same property that u] and uz have (namely, that applying I is the same as just 
multiplying by /). We therefore have a whole plane of such vectors, so we can pick any 
two orthogonal vectors in this plane to be called uy and u2. a 


This theorem proves the existence of principal axes, because the inertia tensor in 
Eq. (9.8) is indeed a real and symmetric matrix. 


REMARK: (Warning: This remark has nothing to do with classical mechanics. It’s simply 
an ill-disguised excuse to get a limerick on quantum mechanics into the book.) In quantum 
mechanics, it turns out that any observable quantity, such as position, energy, momentum, 
angular momentum, etc., can be represented by a Hermitian matrix, with the observed value 
being an eigenvalue of the matrix. A Hermitian matrix is a (generally complex) matrix with the 
property that the transpose of the matrix equals the complex conjugate of itself. For example, 
a2 x 2 Hermitian matrix must be of the form, 


a b+ ic 
(aa d J (D.10) 


for real numbers a, b, c, and d. Now, if observed values are to be given by the eigenvalues of 
such a matrix, then the eigenvalues had better be real, because no one (in this world, at least) 
is about to go for a jog of 4 + 3i miles, or pay an electric bill for 17 — 43i kilowatt-hours. And 
indeed, you can show, via a slightly modified version of the above “Proof that the /’s are real” 
procedure, that the eigenvalues of any Hermitian matrix are in fact real. (And likewise, the 
eigenvectors are orthogonal.) This is, to say the least, very fortunate. 


God’s first tries were hardly ideal, 

For complex worlds have no appeal. 

So in the present edition, 

He made things Hermitian, 

And this world, it seems, is quite real. &æ 
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Appendix E Diagonalizing matrices 


This appendix is relevant to Section 9.3, which covers principal axes. The process 
of diagonalizing matrices (that is, finding the eigenvectors and eigenvalues, 
defined below) has countless applications in a wide variety of subjects. We’ll 
describe the process here as it applies to principal axes and moments of inertia. 

Let’s find the three principal axes and moments of inertia for a square with 
side length a, mass m, and one corner at the origin. The square lies in the x-y 
plane, with sides along the x and y axes (see Fig. E.1). We’ll choose the given 
x, y, and z axes as our initial basis axes. Using Eq. (9.8), you can show that the 
matrix I (with respect to this initial basis) is 


fy -fx 0 1/3 -1/4 0 
I=p|—-fxy fx 0 =ma’|-1/4 1/3 0 |, 
0 0 f@?+y’) 0 0 2/3 


(E.1) 


where p is the mass per unit area, so that a7 = m. We have used the fact that 
z = 0, and we have not bothered to write the dx dy in the integrals. 

Our goal is to find the basis in which I is diagonal. That is, we want to find 
three solutions! for u (and /) for the equation Iu = /u. Letting Z = Ama? to make 
things look a little cleaner, and using the above explicit form of I, the equation 
(I — J)u = 0 becomes 


i3=k =1/4 0 Ux 0 
ma | —1/4 1/3-aA 0 uy | =| 0 (E.2) 
0 0 2/3—Aa} \uz 0 


In order for there to be a nonzero solution for the components uy, uy, uz, the 
determinant of this matrix must be zero (see Footnote D.2). The resulting cubic 


l One obvious solution is u = Z, because Iz = (2/ 3)ma?2. From the orthogonality result of Theorem 
9.4, we know that the other two vectors must lie in the x-y plane. So we could quickly reduce this 
problem to a two-dimensional one, but let’s forge ahead with the general method. 


Diagonalizing matrices 


equation for A is easy to solve, because the determinant is [(1/3 — A)* — (1/4)7] 
(2/3 — à) = 0. The solutions are A = 1/3 + 1/4, and A = 2/3. So our three 
principal moments, J = Ama’, are 

T 3 l 4 2 


= pm » h= ma, h= = ma’. (E.3) 


I = 
i 12 3 


These are the eigenvalues of I. 

What are the vectors, u1, u2, and u3, associated with each of these /’s? Plug- 
ging A = 7/12 into Eq. (E.2) gives the three equations (one for each component), 
—Ux — Uy = 0, —ux — uy = 0, and uz = 0. These are redundant equations (that 
was the point of setting the determinant equal to zero). So ux = —uy, and uz = 0. 
The vector may therefore be written as u; = (c,—c,0), where c is any con- 
stant.” If we want a normalized vector, then c = 1/./2. In a similar manner, 
plugging 4 = 1/12 into Eq. (E.2) gives u2 = (c,c,0). And finally, plugging 
à = 2/3 into Eq. (E.2) gives u3 = (0,0,c), as claimed in the above footnote. 
Our three orthonormal principal axes corresponding to the moments in Eq. (E.3) 
are therefore 


1 1 1 1 
ôi = | ~,-—.0], ®={|—~,—,0], ô= (0,0,1). E.4 
i (Fs v2 ) ° (Zs v2 ) i ii 
These are the eigenvectors of I. They are shown in Fig. E.2. In the new basis of 
the principal axes (where @; = (1,0,0), etc.), the matrix I takes the form, 


7/12 0 0 
I=ma| 0 1/12 0 |. (E.5) 
0 0 2/3 


In other words, we have “diagonalized” the matrix. The basic idea is that from 
now on we should use the principal axes as our basis vectors. We can forget that 
we ever had anything to do with the original x, y, and z axes. 


REMARKS: 


1. 4) +h = h, as the perpendicular-axis theorem demands. 

2. h is the moment around one diagonal through the center of the square, which of course 
equals the moment around the other diagonal through the center. But the latter is related 
to Jı by the parallel-axis theorem. And indeed, J; = J) + m(a//2)?. 

3. Any axis through the center of the square, in the plane of the square, has the same moment 
(by Theorem 9.5 or 9.6). So h equals the moment around an axis through the center, 
parallel to a side. But this is the same as the moment of a stick of length a around its 
center (the extent of the square in the direction of the axis is irrelevant). Hence the factor 
of 1/12inh. & 


> We can solve for u only up to an overall constant, because if Iu = Tu is true for a certain u, then 
it is also true that I(cu) = /(cu), where c is any constant. 


$ 


Fig. E.2 
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Appendix F Qualitative relativity questions 


1. Is there such a thing as a perfectly rigid object? 


Answer: No. Since information can move no faster than the speed of 
light, it takes time for the atoms in the object to communicate with each 
other. If you push on one end of a rod, then the other end won’t move right 
away. If it did move right away, then these “pushing” and “moving” events 
would be spacelike separated (see Section 11.6), which would mean that 
there would exist a frame in which the “moving” event occurred before 
the “pushing” event. This is a violation of causality, so we conclude that 
the other end doesn’t move right away. 


2. How do you synchronize two clocks that are at rest with respect to each 


other? 


Answer: One way is to put a light source midway between the two 
clocks and send out signals, and then set the clocks to a certain value 
when the signals hit them. Another way is to put a watch right next to 
one of the clocks and synchronize it with this clock, and then move the 
watch very slowly over to the other clock and synchronize that clock with 
it. Any time-dilation effects can be made arbitrarily small by moving 
the watch sufficiently slowly, because the time-dilation effect is second 
order in v. 


3. Moving clocks run slow. Does this result have anything to do with the time 


it takes light to travel from the clock to your eye? 


Answer: No. When we talk about how fast a clock is running in a given 
frame, we are referring to what the clock actually reads in that frame. It 
will certainly take time for the light from the clock to reach an observer’s 
eye, but it is understood that the observer subtracts off this transit time in 
order to calculate the time at which the clock actually shows a particular 
reading. Likewise, other relativistic effects, such as length contraction and 
the loss of simultaneity, have nothing to do with the time it takes light to 
reach your eye. They deal only with what really is, in your frame. One way 
to avoid the complication of the travel time of light is to use the lattice of 
clocks and meter sticks described at the end of Section 11.3.3. 
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4. Does time dilation depend on whether a clock is moving across your vision 
or directly away from you? 


Answer: No. A moving clock runs slow, no matter which way it is mov- 
ing. This is clearer if you think in terms of the lattice of clocks and meter 
sticks from Section 11.3.3. If you imagine a million people standing at the 
points of the lattice, then they all observe the clock running slow. Time 
dilation is an effect that depends on the frame and the speed of a clock 
with respect to it. It doesn’t matter where you are in the frame (as long as 
you’re at rest in it). 


5. Does special-relativistic time dilation depend on the acceleration of the 
moving clock? 


Answer: No. The time-dilation factor is y = 1/,/1 — v?/c?, which 
doesn’t depend on a. The only relevant quantity is the v at a given instant. 
It doesn’t matter ifv is changing. But if you are accelerating, then you can’t 
naively apply the results of Special Relativity. (To do things correctly, it is 
perhaps easiest to think in terms of General Relativity. But GR is actually 
not required; see Chapter 14 for a discussion of this.) But as long as you 
represent an inertial frame, then the clock you are viewing can undergo 
whatever motion it wants, and you will observe it running slow by the 
simple factor of y. 


6. Someone says, “A stick that is length-contracted isn’t really shorter, it just 
looks shorter.” How do you respond? 


Answer: The stick really is shorter in your frame. Length contraction 
has nothing to do with how things look. It has to do with where the ends of 
the stick are at simultaneous times in your frame. (This is, after all, how 
you measure the length of something.) At a given instant in time in your 
frame, the distance between the ends of the stick is indeed less than the 
proper length of the stick. 


7. Consider a stick that moves in the direction in which it points. Does its length 
contraction depend on whether this direction is across your vision or directly 
away from you? 


Answer: No. The stick is length-contracted in both cases. Of course, if 
you look at the stick in the latter case, then all you see is the end, which 
is just a dot. But the stick is indeed shorter in your reference frame. As in 
Question 4 above, length contraction depends on the frame, not where you 
are in It. 


8. A mirror moves toward you at speed v. You shine a light toward it and the 
light beam bounces back at you. What is the speed of the reflected beam? 


Answer: The speed is c, as always. You will observe the light having a 
higher frequency, due to the Doppler effect. But the speed is still c. 
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9. In relativity, the order of two events in one frame may be reversed in another 
frame. Does this imply that there exists a frame in which I get offa bus before I 
get on it? 


Answer: No. The order of two events can be reversed in another frame 
only if the events are spacelike separated. That is, if Ax > cAt (in other 
words, the events are too far apart for even light to get from one to the 
other). The two relevant events here (getting on the bus, and getting off 
the bus) are not spacelike separated, because the bus travels at a speed less 
than c, of course. They are timelike separated. Therefore, in all frames it 
is the case that I get off the bus after I get on it. 

There would be causality problems if there existed a frame in which I 
got off the bus before I got on it. If I break my ankle getting off a bus, then 
I wouldn’t be able to make the mad dash that I made to catch the bus in 
the first place, in which case I wouldn’t have the opportunity to break my 
ankle getting off the bus, in which case I could have made the mad dash 
to catch the bus and get on, and, well, you get the idea. 


10. You are in a spaceship sailing along in outer space. Is there any way you can 
measure your speed without looking outside? 


Answer: There are two points to be made here. First, the question is 
meaningless, because absolute speed doesn’t exist. The spaceship doesn’t 
have a speed; it only has a speed relative to something else. Second, even 
if the question asked for the speed with respect to, say, a given piece of 
stellar dust, the answer would be “no.” Uniform speed is not measurable 
from within the spaceship. Acceleration, on the other hand, is measurable 
(assuming there is no gravity around to confuse it with). 


11. If you move at the speed of light, what shape does the universe take in your 
frame? 


Answer: The question is meaningless, because it’s impossible for you 
to move at the speed of light. A meaningful question to ask is: What shape 
does the universe take if you move at a speed very close to c? The answer 
is that in your frame everything would be squashed along the direction of 
your motion, due to length contraction. Any given region of the universe 
would be squashed down to a pancake. 


12. Two objects fly toward you, one from the east with speed u, and the other 
from the west with speed v. Is it correct that their relative speed, as measured 
by you, is u + v? Or should you use the velocity-addition formula, V = 
(u + v)/(l + uv /c?)? Is it possible for their relative speed, as measured by 
you, to exceed c? 


Answer: Yes, no, yes, to the three questions. It is legal to simply add the 
two speeds to obtain u + v. There is no need to use the velocity-addition 
formula, because both speeds here are measured with respect to the same 


13. 


14. 


15. 


16. 
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thing, namely you. It’s perfectly legal for the result to be greater than c, 
but it must be less than (or equal to, for photons) 2c. 

You need to use the velocity-addition formula when, for example, you 
are given the speed of a ball with respect to a train, and also the speed of 
the train with respect to the ground, and your goal is to find the speed of 
the ball with respect to the ground. The point is that now the two given 
speeds are measured with respect to different things, namely the train and 
the ground. 


Two clocks at the ends of a train are synchronized with respect to the train. 
If the train moves past you, which clock shows the higher time? 


Answer: The rear clock shows the higher time. It shows Lu/c* more 
than the front clock, where L is the proper length of the train. 


A train moves at speed 4c/5. A clock is thrown from the back of the 
train to the front. As measured in the ground frame, the time of flight is 
1 second. Is the following reasoning correct? “The y factor between the 
train and the ground is y = 1/./1— (4/5)? = 5/3. And since moving 
clocks run slow, the time elapsed on the clock during the flight is 3/5 of a 
second.” 


Answer: No. It is incorrect, because the time-dilation result holds only 
for two events that happen at the same place in the relevant reference frame 
(the train, here). The clock moves with respect to the train, so the above 
reasoning is invalid. 

Another way of seeing why it must be incorrect is the following. A 
certainly valid way to calculate the clock’s elapsed time is to find the 
speed of the clock with respect to the ground (more information would 
have to be given to determine this), and to then apply time dilation with 
the associated y factor to arrive at the answer of (1 s)/y. Since the clock’s 
v is definitely not 4c/5, the correct answer is definitely not 3/5 s. 


Person A chases person B. As measured in the ground frame, they have speeds 
4c/5 and 3c/5, respectively. If they start a distance L apart (as measured in 
the ground frame), how much time will it take (as measured in the ground 
frame) for A to catch B? 


Answer: As measured in the ground frame, the relative speed is 4c/5 — 
3c/5 = c/5. Person A must close the initial gap of L, so the time it takes 
is L/(c/5) = 5L/c. There is no need to use any fancy velocity-addition 
or length-contraction formulas, because all quantities in this problem are 
measured with respect to the same frame. So it quickly reduces to a simple 
“(rate) (time) = (distance)” problem. 


Is the “the speed of light is the same in all inertial frames” postulate really 
necessary? That is, is it not already implied by the “the laws of physics are 
the same in all inertial frames” postulate? 
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Answer: Yes, it is necessary. The speed-of-light postulate is definitely 
not implied by the laws-of-physics postulate. The latter doesn’t imply that 
baseballs have the same speed in all inertial frames, so it likewise doesn’t 
imply it for light. 

It turns out that nearly all the results in Special Relativity can be deduced 
by using only the laws-of-physics postulate. What you can find (with 
some work) is that there is some limiting speed, which may or may not be 
infinite (see Section 11.10). But you still have to say whether this speed is 
finite or infinite. The speed-of-light postulate does the trick. 


17. Imagine closing a very large pair of scissors. It is quite possible for the point 
of intersection of the blades to move faster than the speed of light. Does this 
violate anything in relativity? 


Answer: No. If the angle between the blades is small enough, then the 
tips of the blades (and all the other atoms in the scissors) can move at a 
speed well below c, while the intersection point moves faster than c. But 
this doesn’t violate anything in relativity. The intersection point is not an 
actual object, so there is nothing wrong with it moving faster than c. 

You might be worried that this result allows you to send a signal down 
the scissors at a speed faster than c. However, since there is no such thing 
as a rigid body, it is impossible to get the far end of the scissors to move 
right away, when you apply a force at the handle. The scissors would have 
to already be moving, in which case the motion is independent of any 
decision you make at the handle to change the motion of the blades. 


18. Two twins travel away from each other at relativistic speed. The time-dilation 
result says that each twin sees the other twin’s clock running slow, so each 
says the other has aged less. How would you reply to someone who asks, 
“But which twin really is younger?” 


Answer: It makes no sense to ask which twin really is younger, because 
the two twins aren’t in the same reference frame; they are using different 
coordinates to measure time. It’s as silly as having two people run away 
from each other into the distance (so that each person sees the other become 
small), and then asking: Who is really smaller? 


19. A particular event has coordinates (x,t) in one frame. How do you use a 
Lorentz transformation to find the coordinates of this event in another frame? 


Answer: You don’t. Lorentz transformations have nothing to do with 
single events. They deal only with pairs of events and the separation 
between them. As far as a single event goes, its coordinates in another 
frame can be anything you want, simply by defining your origin to be 
wherever and whenever you please. But for pairs of events, their separation 
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is a well-defined quantity, independent of any definition of origin. It is 
therefore a meaningful question to ask how the separations in two different 
frames are related, and the Lorentz transformations answer this question. 


20. When using the Lorentz transformations, how do you tell which frame is the 
moving “primed” frame? 


Answer: You don’t. There is no preferred frame, so it doesn’t make 
sense to ask which frame is moving. We used the “primed” notation in 
the derivation in Section 11.4.1 for ease of notation, but don’t take it to 
imply that there is a preferred frame S and a less fundamental frame S’. In 
general, a better notation is to use subscripts that describe the two frames, 
such as “g” for ground and “t” for train. For example, if you know the 
values of Af, and Ax; on a train (which we’ll assume is moving in the 
positive x direction with respect to the ground), and if you want to find 
the values of Af, and Axg on the ground, then you can write down: 


Axg = y (Ax + v At), 
5 (F.1) 
Atg = y (At +v Axt/c ). 


The sign is a “+” because the frame associated with the left side of the 
equation (the ground) sees the frame associated with the right side (the 
train) moving to the right. If instead you know the intervals on the ground 
and you want to find them on the train, then you just need to switch the 
subscripts “g” and “t” and change the sign to “—”, by the reasoning in the 
previous sentence. 


21. The momentum of an object with mass m and speed v is p = ymv. “A photon 
has zero mass, so it should have zero momentum.” Correct or incorrect? 


Answer: Incorrect. True, m is zero, but the y factor is infinite because 
v = c. Infinity times zero is undefined. A photon does indeed have momen- 
tum, and it equals E/c (which happens to equal hv/c, where v is the 
frequency of the light). 


22. It is not necessary to postulate the impossibility of accelerating an object 
to speed c. It follows as a consequence of the relativistic form of energy. 
Explain. 


Answer: E = ymc’, so ifv = c then y = 00, and the object must have 
an infinite amount of energy (unless m = 0, as for a photon). All the energy 
in the universe, let alone all the king’s horses and all the king’s men, can’t 
accelerate something to speed c. 
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Appendix G Derivations of the Lv/c? result 


In Section 11.3.1, we showed that if a train with proper length L moves at speed 
v with respect to the ground, then in the ground frame the rear clock reads Lu/c? 
more than the front clock (assuming that the clocks are synchronized in the train 
frame). There are various other ways to derive this result, so for the fun of it 
I’ve listed here all the derivations I can think of. The explanations are terse, but I 
refer you to the specific problem or section in the text where things are discussed 
in more detail. Many of these derivations are slight variations on each other, so 
perhaps they shouldn’t all count as separate ones, but here’s my list: 


1. Light source on train: Put a light source on a train, at distances de = L(c — v)/2c 
from the front and dp = L(c + v)/2c from the back. You can show that the photons 
hit the ends of the train simultaneously in the ground frame. But they hit the ends 
at different times in the train frame; the difference in the readings on clocks at the 
ends when the photons run into them is (d, — dp)/c = Lu/c?. Therefore, at a given 
instant in the ground frame (for example, the moment when the clocks at the ends are 
simultaneously illuminated by the photons), a person on the ground sees the rear clock 
read Lu/ c? more than the front clock. (See the example in Section 11.3.1.) 

2. Lorentz transformation: The second of Eqs. (11.17) is Atg = y (At + v Axt/c?), 
where the subscripts refer to the ground and train frames. If two events (for exam- 
ple, two clocks flashing their times) located at the ends of the train are simultaneous 
in the ground frame, then we have Atg = 0. And Axt = L, of course. The above 
Lorentz transformation therefore gives At = —Lv/c?. The minus sign here means 
that the event with the larger x; value has the smaller 4 value. In other words, 
the front clock reads Lu/c? less time than the rear clock, at a given instant in the 
ground frame. 

3. Invariant interval: This is actually just a partial derivation, because it determines 
only the magnitude of the Lv/ c? result, and not the sign. The invariant interval says 
that PAR = Ax? = At = Axe, where the subscripts refer to the ground and 
train frames. If two events (for example, two clocks flashing their times) located at 
the ends of the train are simultaneous in the ground frame, then we have Atg = 0. 
And Axt = L, of course. And we also know from length contraction that Axg = 
L/y. The invariant interval then gives c?(0)? — (L/y)* = agra — LŽ, which yields 
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cA? =Ľ(-— L/y?) => cA? = Lv? /c? => At = +Lv/c?. As mentioned 
above, the sign isn’t determined by this method. 

. Minkowski diagram: The task of Exercise 11.63 is to use a Minkowski diagram to 
derive the Lu/ c? result. The basic goal is to determine how many cf’ units fit in the 
segment BC in Figure 11.27, and also how many ct units fit in the segment BE in 
Figure 11.28. 

. Walking slowly on a train: In Exercise 11.58, a person walks very slowly at speed 
u from the back of a train of proper length L to the front. In the frame of the train, 
the time-dilation effect is second order in u/c and therefore negligible (because the 
total time is only first order in 1/1). But in the frame of the ground, the time-dilation 
effect is (as you can show) first order in u/c and therefore has a nonzero effect; an 
observer on the ground sees the person’s clock advance by less than a clock that is fixed 
on the train. Now, the person’s clock agrees with clocks at the rear and front at the 
start and finish, because of the negligible time dilation in the train frame. Therefore, 
since less time elapses on the person’s clock than on the front clock (in the ground 
frame), the person’s clock must have started out reading more time than the front 
clock (in the ground frame). This then implies that the rear clock must show more 
time than the front clock. A quantitative analysis shows that this excess time is in 
fact Lv/c?. 

. Consistency arguments: There are many setups (a few examples are Problems 11.2, 
11.3, 11.8, and Exercise 11.35) where the Lv/c? result is an ingredient in explaining a 
result. Without it, you would encounter a contradiction, such as two different frames 
giving two different answers to a frame-independent question. So if you wanted to, you 
could work backwards (under the assumption that everything is consistent in relativity) 
and let the rear-clock-ahead effect be some unknown time T (which might be zero, for 
all you know), and then solve for the 7 that makes everything consistent. You would 
arrive at T = Lu/c*. 

. Gravitational time dilation: The task of Exercise 14.13 is to derive (for small v) the 
Lv/ c? result by making use of the fact that Lu/ c looks a lot like the gh/ c? term in the 
GR time-dilation result. If you stand on the ground near the front of a train of length L 
and then accelerate toward the back with acceleration g, you will see a clock at the back 
running faster by a factor (1 + gL/c*), which will cause it to read (gL/c*)t = Lv/c? 
more than a clock at the front. (Assume that you accelerate for a short period of time, 
so that v © gt and that the distance in the gl / c? term remains essentially L.) 

. Accelerating rocket: The task of Problem 14.6 is to show that if a rocket with proper 
length Z accelerates at g and reaches a speed v, then in the ground frame the readings on 
the front and rear clocks are related by te = t)(1 + gL/c?°) —Lv/c?. In other words, the 
front clock reads 4,(1 + gL/ c?) —Lv/c* simultaneously with the rear clock reading ths 
in the ground frame. But in the rocket frame, gravitational time dilation tells us that 
the front clock reads 4 (1 + gL/ c?) simultaneously with the rear clock reading ty. The 
difference in clock readings (front minus rear) is therefore smaller in the ground frame 


than in the rocket frame, by an amount equal to Lv/c. This is the desired result. 
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Appendix H Resolutions to the twin paradox 


The twin paradox appeared in Chapters 11 and 14, both in the text and in various 
problems. To summarize, the twin paradox deals with twin A who stays on the 
earth,! and twin B who travels quickly to a distant star and back. When they 
meet up again, they discover that B is younger. This is true because A can use the 
standard special-relativistic time-dilation result to say that B’s clock runs slow 
by a factor y. 

The “paradox” arises from the fact that the situation seems symmetrical. That 
is, it seems as though each twin should be able to consider herself to be at rest, 
so that she sees the other twin’s clock running slow. So why does B turn out 
to be younger? The resolution to the paradox is that the setup is in fact not 
symmetrical, because B must turn around and thus undergo acceleration. She is 
therefore not always in an inertial frame, so she cannot always apply the simple 
special-relativistic time-dilation result. 

While the above reasoning is sufficient to get rid of the paradox, it isn’t quite 
complete, because (a) it doesn’t explain how the result from B’s point of view 
quantitatively agrees with the result from A’s point of view, and (b) the paradox 
can actually be formulated without any mention of acceleration, in which case 
slightly different reasoning applies. 

Below is a list of all the complete resolutions I can think of. The descriptions 
are terse, but I refer you to the specific problem or section in the text where things 
are discussed in more detail. As with the Lv/c* derivations in Appendix G, many 
of these resolutions are slight variations on each other, so perhaps they shouldn’t 
all count as separate ones, but here’s my list: 


1. Rear-clock-ahead effect: Let the distant star be labeled as C. Then on the outward 
part of the journey, B sees C’s clock ahead of A’s by Lv/c, because C is the rear clock 
in the universe as the universe flies by. But after B turns around, A becomes the rear 
clock and is therefore now ahead of C. This means that A’s clock must jump forward 
very quickly, from B’s point of view. (See Section 11.3.1 and Problem 11.2.) 


l We should actually have A floating in space, to avoid any GR time-dilation effects from the earth’s 
gravity. But if B travels quickly enough, the SR effects will dominate the gravitational ones. 


Resolutions to the twin paradox 


. Looking out of the portholes: Imagine many clocks lined up between the earth and the 
star, all synchronized in the earth—star frame. And imagine looking out of the portholes 
of the spaceship and making a movie of the clocks as you fly past them. Although 
you see each individual clock running slow, you see the “effective” clock in the movie 
(which is really many successive clocks) running fast. This effect is just a series of 
small applications (see Problem 11.2) of the rear-clock-ahead effect mentioned above. 
. Minkowski diagram: Draw a Minkowski diagram with the axes in A’s frame per- 
pendicular. Then the lines of simultaneity (that is, the successive x axes) in B’s frame 
are titled in different directions for the outward and inward parts of the journey. The 
change in the tilt at the turnaround causes a large amount of time to advance on A’s 
clock, as measured in B’s frame. (See Section 11.7 and Figure 11.68.) 

. General-relativistic turnaround effect: The acceleration that B feels when she turns 
around may equivalently be thought of as a gravitational field. Twin A on the earth is 
high up in the gravitational field, so B sees A’s clock run very fast during the turnaround. 
This causes A’s clock to show more time in the end. (See Problem 14.9.) 

. Doppler effect: By equating the total number of signals one twin sends out with the 
total number of signals the other twin receives, we can relate the total times on their 
clocks. (See Exercise 11.67.) 
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Appendix! Lorentz transformations 


In this appendix, we will give an alternate derivation of the Lorentz 
transformations in Eq. (11.17). The goal here is to derive them from scratch, 
using only the two postulates of relativity. We will not use any of the results 
derived in Section 11.3. Our strategy will be to use the relativity postulate (“all 
inertial frames are equivalent’’) to figure out as much as we can, and to then invoke 
the speed-of-light postulate at the end. The main reason for doing things in this 
order is that it will allow us to derive a very interesting result in Section 11.10. 
As in Section 11.4, consider a coordinate system S’ moving relative to another 
system S (see Fig. I.1). Let the constant relative speed between the frames be 
v. Let the corresponding axes of S and S’ point in the same direction, and let 
the origin of S’ move along the x axis of S, in the positive direction. As in 
Section 11.4, we want to find the constants, A, B, C, and D, in the relations, 


Ax =AAx’+BAr, 
At=CAt+DAx’. 


(1.1) 


The four constants will end up depending on v (which is constant, given the 
two inertial frames). Since we have four unknowns, we need four facts. The 
facts we have at our disposal (using only the two postulates of relativity) are the 
following. 


1. The physical setup: S’ travels with velocity v with respect to S. 

2. The principle of relativity: S should see things in S” in exactly the same way as S’ sees 
things in S (except perhaps for a minus sign in some relative positions, but this just 
depends on our arbitrary choice of directional signs for the axes). 

3. The speed-of-light postulate: A light pulse with speed c in S’ also has speed c in S. 


The second statement here contains two independent bits of information. (It con- 
tains at least two, because we will indeed be able to solve for our four unknowns. 
And it contains no more than two, because then our four unknowns would be 
over-constrained.) The two bits that are used depend on personal preference. 
Three that are commonly used are: (a) the relative speed looks the same from 
either frame, (b) time dilation (if any) looks the same from either frame, and 


Lorentz transformations 


(c) length contraction (if any) looks the same from either frame. It is also common 
to recast the second statement in the form: The Lorentz transformations are the 
same as their inverse transformations (up to a possible minus sign). We’ll choose 
to work with (a) and (b). Our four independent facts are then: 


1. S’ travels with velocity v with respect to S. 


. S travels with velocity —v with respect to S’. The minus sign here is due to the 


convention that we picked the positive x axes of the two frames to point in the same 


direction. 


. Time dilation (if any) looks the same from either frame. 
. A light pulse with speed c in S’ also has speed c in S. 


Let’s see what these imply, in the above order.! 


(1) says that a given point in S” moves with velocity v with respect to S. Letting x’ = 0 
(which is understood to be Ax’ = 0, but we’ll drop the A’s from here on) in Eqs. (1.1) 
and dividing them gives x/t = B/C. This must equal v. Therefore, B = vC, and the 


transformations become 


x = Ax’ + Cr’, 
t= Ct + Dx. a 


(2) says that a given point in S moves with velocity —v with respect to S’. Letting x = 0 
in the first of Eqs. (1.2) gives x’ /t/ = —vC/A. This must equal —v. Therefore, C = A, 


and the transformations become 


x = Ax’ + vAfť', 
t = Ať + Dx’. = 


Note that these are consistent with the Galilean transformations, which have A = 1 and 
D=0. 

(3) can be used in the following way. How fast does a person in S see a clock in S’ 
tick? (The clock is assumed to be at rest with respect to S’.) Let our two events be two 
successive ticks of the clock. Then x’ = 0, and the second of Eqs. (1.3) gives 


t= At’. (1.4) 


In other words, one second on S’’s clock takes a time of A seconds in S’s frame. 
Consider the analogous situation from S’’s point of view. How fast does a person in 

S’ see a clock in S tick? (The clock is now assumed to be at rest with respect to S, in 

order to create the analogous setup. This is important.) If we invert Eqs. (1.3) to solve 


In what follows, we could obtain the final result a little quicker if we invoked the speed-of-light 
fact prior to the time-dilation one. But we’ll do things in the above order so that we can easily carry 
over the results of this appendix to the discussion in Section 11.10. 
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Appendix | 


for x’ and ¢’ in terms of x and t, we find 


, x — ut 

* =I- Dv’ 

y_ At Dx (5) 
A(A — Do) 


Two successive ticks of the clock in S satisfy x = 0, so the second of Eqs. (1.5) gives 
foot 
A- Dv’ 


(1.6) 


In other words, one second on S’s clock takes a time of 1/(4 — Dv) seconds in S’’s 
frame. 

Both Eqs. (1.4) and (1.6) apply to the same situation (someone looking at a clock 
flying by). Therefore, the factors on the right-hand sides must be equal, that is, 


1 1 1 
A= = D=-|A--). L7 
A— Dv ={ 3) (7) 


Our transformations in Eq. (1.3) therefore take the form 


x = A(x + vt’), 


1 1 
t=A (+ -(1-—)s! . 
v A2 


These are consistent with the Galilean transformations, which have A = 1. 


(1.8) 


e (4) may now be used to say that ifx’ = ct’, then x = ct. In other words, if x’ = ct’, then 


/ / 
pas A((ct’) + vt’) _ c+uv (1.9) 


A (r 4. “(1 a pe) Pi “(1 = =) 


a (1.10) 


V1 —v2/c? 


We have chosen the positive square root so that the positive x and x’ axes point in the 


Solving for A gives 


same direction. The transformations are now no longer consistent with the Galilean 


transformations, because c is not infinite, which means that A is not 1. 


The constant A is commonly denoted by y, so we may finally write our 
Lorentz transformations, Eqs. (1.8), in the form, 
x=y(' +t’), 
(1.11) 
t=y(ť + vx’ /c*), 
where 
1 


V1 —v2/c? 


y (1.12) 


in agreement with Eq. (11.17). 


Appendix J Physical constants and data 


Earth 


Mass 

Mean radius 

Mean density 

Surface acceleration 
Mean distance from sun 
Orbital speed 

Period of rotation 
Period of orbit 


Moon 


Mass 

Radius 

Mean density 

Surface acceleration 
Mean distance from earth 
Orbital speed 

Period of rotation 

Period of orbit 


Sun 


Mass 

Radius 

Mean density 
Surface acceleration 


Fundamental constants 


Speed of light 
Gravitational constant 


Mg = 5.97 - 1074 kg 
Rg = 6.37- 10° m 
5.52 g/cm? 

g = 9.81 m/s* 

1.5- 10!! m 

29.8 km/s 


23 h 56 min 4 s = 8.6164 - 104 s 
365 days 6 h = 3.16 - 107 s ~ x - 10” s 


My = 7.35 - 10? kg 
Ry = 1.74 - 10° m 

3.34 g/cm? 

1.62 m/s? ~ g/6 

3.84 - 108 m 

1.0 km/s 

27.3 days = 2.36 - 106 s 
27.3 days = 2.36 - 106 s 


Ms = Mo = 1.99 - 10% kg 
Rs = 6.96 - 108 m 

1.41 g/cm? 

274 m/s? ~ 28g 


c = 2.998 . 108 m/s 
G = 6.674 - 107!! m? /kg s? 
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Planck’s constant 


Electron charge 
Electron mass 
Proton mass 
Neutron mass 


h = 6.63 - 10734 Js 

h = h/2r = 1.05 - 107°4 Js 

—e = —1.602 - 107!? C 

me = 9.11 - 1073! kg = 0.511 MeV/c? 
mp = 1.673 - 10-77 kg = 938.3 MeV/c? 
mn = 1.675 - 10777 kg = 939.6 MeV/c” 
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accelerating frame, 326, 457-477 
relativistic, 653—656 
action, 221, 658 
relativistic, 592 
stationary, 221-227 
air resistance, 7, 17, 63—65, 72, 80 
amplitude, 105, 113, 123, 225 
angular frequency, 5, 69, 105 
angular momentum, 233—234, 238, 
281-282, 309-313, 376-381, 
465, 592 
angular speed, 69, 311 
angular velocity vector, 372-376, 
400-403 
Atwood’s machines, 58-59 
azimuthal force, 461, 469-470, 478 


billiards, 166, 176, 191, 597 
body frame, 396-398 
boundary conditions, 226, 245 
brachistochrone, 250 


catenary, 32 
causality, 548, 700 
Cavendish experiment, 154—156 
center of mass (CM), 293, 312-315, 
325-326, 380-383 
calculation of, 317-318 
frame, 161-163 
center of percussion, 347 
central force, 151, 281-285 
centrifugal force, 220, 461-464 
centripetal acceleration, 70, 74, 471 
Chandler wobble, 397 
Chasles’ theorem, 371 
coin flipping, 426 


collisions, 164—167 
relativistic, 596-598 
rotational, 328-331 
complex numbers, 107, 109-112, 225, 
695 
Compton scattering, 612 
conservation of 
angular momentum, 233-234, 
282-283, 292 
energy, 138-141, 148-149, 234-236, 
586-588, 595-596 
momentum, 156-159, 233, 584-586, 
595-596 
in Noether’s theorem, 237 
string, 58—59 
conservative forces, 142—143, 150-151 
constraint forces, 227—229 
cookies, relativistic, 551 
coolest answer in the book, 192 
Coriolis force, 70, 220, 461, 464-469 
coupled oscillators, 115—120 
critical damping, 108—109 
cross product, 681—682 
cross section, 296, 298 
curl, 150-151, 687—689 
cyclic coordinates, 232—234 
cycloid, 82, 251-252, 326 


damped oscillations, 107—109 

and driven, 111-115 
decays, 598—600 

muon, 518, 521 
del operator, 684 
diagonalizing matrices, 696—697 
differential equations, 60—65, 101-105 
dimensional analysis, 4—6, 94 
divergence, 685—687, 689 


Doppler effect 

longitudinal, 539-541, 557, 607, 
611, 646, 651, 707 

transverse, 541—543 

dot product, 679—681 

double pendulum, 249 

drag force, 7, 17, 63—65, 72, 80, 107, 

177, 191 
driven oscillations, 109-115 


eccentricity, 288, 291, 568 

Eddington, Arthur, 51 

effective gravity, 463, 477 

effective potential, 283—285 

eigenvalue, 136-137, 693, 695-697 

eigenvector, 136—137, 693, 696—697 

Einstein, 462, 502-504, 508, 649 

elastic collision, 164-167, 329, 
588, 597 

energy, see kinetic energy, potential 
energy 

relativistic, 586—588 

energy—momentum 4-vector, 596, 636, 
639-640 

equation of motion, 12, 219 

equinoxes (precession of), 481 

Equivalence Principle, 649-650 

escape velocity, 174, 186 

ether, 505-509 

Euler angles, 399-400 

Euler’s equations, 393-396 

Euler-Lagrange equation, 218 

relativistic, 592 
event, 523 


Fermi problems, 3 
Feynman, Richard, 225 
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fictitious forces, 457—458, 460-471 
force 
central, 281-285 
common types, 23—24 
conservative, 142-143, 150-151 
constraint, 227, 229 
drag, 107 
driving, 111 
fictitious, 457-458, 460-471 
4-vector, 637 
gravitational, 24, 152-154 
Hooke’s law, 105, 147 
in differential equations, 60—65 
in free-body diagrams, 55—59 
in Newton’s laws, 51—54 
ma vs. dp/dt, 690-692 
radial, tangential, 69 
relativistic, 601-606 
tidal, 471-477 
transformation of, 604—606, 640-641 
Foucault’s pendulum, 467—469 
4-vector 
acceleration, 636, 641-643 
definition, 634 
energy—momentum, 596, 636, 
639-640 
force, 637, 640-641 
velocity, 636 
Fourier analysis, 110—111, 261 
fractal dimension, 356 
free symmetric top, see top, free 
free-body diagrams, 55-59 
frequency 
beats, 133 
circular pendulum, 70 
circular perturbation, 302 
cycloidal pendulum, 252 
Doppler, longitudinal, 540 
Doppler, transverse, 541, 542 
double pendulum, 273-274 
earth rotation, 463 
extended pendulum, 342 
fast, slow precession, 440 
Foucault’s pendulum, 468 
free top precession, body frame, 396 
free top precession, fixed frame, 398 
GR red/blue-shift, 652 
harmonic oscillator, 105 
heavy top precession, 408 
inverted pendulum, 260 


normal modes, via determinant, 
116-118 

nutation, 412 

pendulum, 106 

small oscillations, 147 

underdamped oscillator, 108 

zucchini, 106 


Galilean invariance, 503 
Galilean transformations, 502—505, 524, 
526, 548 
y factor, 515 
Gauss’ theorem, 686—687 
General Relativity, see Chapter 14 
golden ratio, 42, 80, 126, 191, 255, 343, 
558, 617, 631 
gradient, 151, 153, 684-685, 689 
gravity, 24, 141-142, 152-156, 287-289, 
463 
Equivalence Principle, 649-650 
tides, 471-477 


Hamilton’s principle, 224 

harmonic oscillator, 105—106, 225—226 
relativistic, 612 

heavy symmetric top, see top, heavy 

Heisenberg’s uncertainty principle, 72 

Heron’s formula, 621 

homogeneous equation, 101, 104, 117 

Hooke’s law, 105, 147 

hyperbolic functions, 43, 103, 543, 677 


ideal chain, 170 
impact parameter, 291, 299 
impulse 
angular, 331-333 
linear, 157, 331 
inelastic collision, 164, 167-172, 
330-331 
inertia tensor, 376-380 
inertial frame, 52, 509-510 
inhomogeneous equation, 101, 109 
initial conditions, 60, 105—106, 110, 
412, 417 
invariance 
Galilean, 503 
of (As), 533 
of E? — p*, 595 
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of inner product, 638 

of Lagrangian, 236 

of mass, 590-591 

of speed of light, 509 

rotation, 233 

space translation, 233 

time translation, 235 
invariant interval, 533—536, 591 
inverted pendulum, 247 


Kepler’s laws, 291-292 
with reduced mass, 295-296 
kinetic energy, 139-141, 163-164 
relativistic, 589 
rotational, 311, 313, 380-382 


Lagrange multiplier, 229 
Lagrangian 
definition, 218 
for heavy top, 405 
other applications, 242 
relativistic, 592 
with reduced mass, 294 
least action, 225 
length contraction, 519-522, 529, 
538-539, 665, 699 
no transverse, 549 
light, 501, 504—510, 611 
lightlike separation, 535 
limiting cases, 7—11 
linear differential equations, 101—105 
linear momentum, 52, 54, 156-159, 233 
relativistic, 584—586 
linearity, 103, 595, 638 
Lorentz transformations, 504, 523-529, 
543, 634, 702, 708-710 
Lorentz—FitzGerald contraction, 507 
loss of simultaneity, see simultaneity, 
loss of 


Maxwell’s equations, 503—505 

Michelson—Morley experiment, 505-509 

minimal surface, 242, 250 

Minkowski diagram, 536—539, 707 

moment of inertia, 311, 318-322 

momentum, see angular momentum, 
linear momentum 

multivariable calculus, 679-689 

muon, 518, 521, 601 
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Newton’s laws, 51—54 

Noether’s theorem, 236-239 

noninertial frame, see accelerating frame 
normal coordinates, 116 

normal modes, 118 

numerical solution, 11—13 

nutation, 411—414 


orbits, 289-292 

oscillations, see coupled oscillators, 
damped oscillations, driven 
oscillations, harmonic oscillator, 
small oscillations 

overdamping, 108 


parallel-axis theorem, 314-315, 321-322, 
382-383, 415 

partial derivative, 682—683 
particle-physics units, 600—601 
particular solution, 110 
pendulum 

circular, 10, 70, 81, 416, 483 

cycloidal, 252 

double, 249 

extended, 342 

Foucault’s, 467-469 

inverted, 247 

on inclined plane, 249 

plane, 4, 17, 81, 106, 121, 124 

spring, 219 

with free support, 248 

with oscillating support, 246 
period 

beats, 133 

equinox precession, 482 

extended pendulum, 342 

harmonic oscillator, 105 

minimum for planet, 185 

pendulum, 106 

pendulum corrections, 124 

planetary orbits, 292 

relativistic oscillator, 612 
perpendicular-axis theorem, 316 
phase, 112, 114-115, 225 
photon, 589, 611-612, 703 
Planck’s constant, 73, 225, 611 
planetary motion, 287—296 
polar coordinates, 68—70 
potential energy 


effective, 283-285 

in 1-D, 138-143 

in 3-D, 148-151 

near a minimum, 147-148 

vs. work, 144-147 
precession 

Chandler wobble, 397 

equinoxes, 48 | 

Foucault’s pendulum, 468 

free top 

body frame, 397 
fixed frame, 398 

heavy top, 402, 406—408 
principal axes, 383-388 

calculation of, 696-697 

existence of, 693-695 
principal moments, 383, 386, 696—697 
principle of 

correspondence, 588 

equivalence, 649—650 

Galilean invariance, 503 

least action, 225 

least time, 250 

maximal proper time, 656—658 

relativity, 510 

stationary action, 224 

superposition, 103 
projectile motion, 65—68 
proper length, 520, 535 
proper time, 534, 591, 636 

maximal, 656—658 
Pythagorean triple, 560 


quadrupole, 423, 480 
quantum mechanics, 73, 225, 227, 695 


raindrop, 180 

rapidity, 543-546 

redshift, 540, 555, 607 
gravitational, 652, 664 

reduced mass, 294 

relativistic mass, 590-591 

relativistic strings, 609-611 

relativity, see Chaps. 11—14 

resonance, 113—114 

rigid body motion, 371—372, 388-393, 

396-399, 403-414 
Rindler space, 656 
Roche limit, 486 


rocket motion, 159-160, 606-609 
rolling without slipping, 326, 420 
rotation matrix, 543 

Rutherford scattering, 297 


saddle point, 222, 247, 277, 672, 
682-683 

scalar product, see dot product 

scaling argument, 321-322 

separation of variables, 60—62 

simple harmonic motion, 105—106 

simultaneity 

line of, 538, 580 
loss of, 511-514, 528 

small oscillations, 147—148, 198, 
239-242 

solid angle, 297 

space elevator, 187 

spacelike separation, 535, 700 

spacetime interval, 534 

Special Relativity, see Chapters 11—13 

speed of light, 501, 504, 507, 509, 511, 
549, 701 

springs, 6, 105—109, 111-115, 118-119 

statics, see Chap. 2 

stationary action, 221—227 

steradian, 297 

Stokes’ theorem, 150, 688—689 

superball, 337, 348-349 

superposition principle, 103 

symmetric top, see top 

symmetry, 236, 238 


Taylor series, 9-10, 675—676 
tennis racket theorem, 417, 421 
tensor, see inertia tensor 
terminal velocity, 64 
threshold energy, 611 
tidal force, 471-477 

general, 473-477 

longitudinal, 471—472 

transverse, 472—473 
time dilation 

GR, 650-652 

SR, 514-519, 528, 535, 699 
timelike separation, 534, 700 
top 

free, 396-399 

heavy, 399-414 


torque, 27—30, 322-327 
transformation 

Galilean, 502-505, 524, 526, 548 

Lorentz, 523-529, 543, 702, 

708-710 

of acceleration, 641—643 

of E and p, 594-596, 640 

of force, 604—606, 640-641 

of 4-vector, 634 

of sphere into itself, 415 
translation force, 457-458, 461—462 
turning points, 141 


twin paradox, 517, 553, 555, 564, 
658-660, 662—663, 706-707 


uncertainty principle, 72 
underdamping, 107—108 
units, see dimensional analysis 


vector calculus, see multivariable 
calculus 
vector product, see cross product 
velocity addition 
longitudinal, 529-532, 553-554, 561, 
645 
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transverse, 532—533 


very important relation, 589 


weakly coupled oscillators, 
121 
work, 140 
in 3-D, 149 
work—energy theorem, 140 
general, 145 
worldline, 537 
hyperbolic, 654 
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The Hamiltonian method 
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and Solutions © 2007, by David Morin, Cambridge University Press. 


The text in this version is the same as in Version 1, but some new problems and exercises have 
been added. 


More information on the book can be found at: 


http://www.people.fas.harvard.edu/~djmorin/book.html 


At present, we have at our disposal two basic ways of solving mechanics problems. In 
Chapter 3 we discussed the familiar method involving Newton’s laws, in particular 
the second law, F = ma. And in Chapter 6 we learned about the Lagrangian 
method. These two strategies always yield the same results for a given problem, of 
course, but they are based on vastly different principles. Depending on the specifics 
of the problem at hand, one method might lead to a simpler solution than the other. 

In this chapter, we’ll learn about a third way of solving problems, the Hamil- 
tonian method. This method is quite similar to the Lagrangian method, so it’s 
debateable as to whether it should actually count as a third one. Like the La- 
grangian method, it contains the principle of stationary action as an ingredient. 
But it also contains many additional features that are extremely useful in other 
branches of physics, in particular statistical mechanics and quantum mechanics. 
Although the Hamiltonian method generally has no advantage over (and in fact is 
invariably much more cumbersome than) the Lagrangian method when it comes to 
standard mechanics problems involving a small number of particles, its superiority 
becomes evident when dealing with systems at the opposite ends of the spectrum 
compared with “a small number of particles,” namely systems with an intractably 
large number of particles (as in a statistical-mechanics system involving a gas), or 
systems with no particles at all (as in quantum mechanics, where everything is a 
wave). 

We won’t be getting into these topics here, so you’ll have to take it on faith 
how useful the Hamiltonian formalism is. Furthermore, since much of this book is 
based on problem solving, this chapter probably won’t be the most rewarding one, 
because there is rarely any benefit from using a Hamiltonian instead of a Lagrangian 
to solve a standard mechanics problem. Indeed, many of the examples and problems 
in this chapter might seem a bit silly, considering that they can be solved much more 
quickly using the Lagrangian method. But rest assured, this silliness has a purpose; 
the techniques you learn here will be very valuable in your future physics studies. 

The outline of this chapter is as follows. In Section 15.1 we’ll look at the sim- 
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ilarities between the Hamiltonian and the energy, and then in Section 15.2 we'll 
rigorously define the Hamiltonian and derive Hamilton’s equations, which are the 
equations that take the place of Newton’s laws and the Euler-Lagrange equations. 
In Section 15.3 we'll discuss the Legendre transform, which is what connects the 
Hamiltonian to the Lagrangian. In Section 15.4 we’ll give three more derivations of 
Hamilton’s equations, just for the fun of it. Finally, in Section 15.5 we’ll introduce 
the concept of phase space and then derive Liouville’s theorem, which has countless 
applications in statistical mechanics, chaos, and other fields. 


15.1 Energy 


In Eq. (6.52) in Chapter 6 we defined the quantity, 


N 
ðL. 
E= (>: 5G, a) Ts (15.1) 
a 


which under many circumstances is the energy of the system, as we will see below. 
We then showed in Claim 6.3 that dE/dt = —OL/0t. This implies that if ƏL/ðt = 0 
(that is, if t doesn’t explicitly appear in L), then E is constant in time. In the 
present chapter, we will examine many other properties of this quantity Æ, or more 
precisely, the quantity H (the Hamiltonian) that arises when E is rewritten in a 
certain way explained in Section 15.2.1. 

But before getting into a detailed discussion of the actual Hamiltonian, let’s first 
look at the relation between E and the energy of the system. We chose the letter 
E in Eq. (6.52/15.1) because the quantity on the right-hand side often turns out to 
be the total energy of the system. For example, consider a particle undergoing 1-D 
motion under the influence of a potential V(x), where x is a standard Cartesian 
coordinate. Then L = T — V = mx?/2 — V(x), which yields 

OL . Ko 
= gt b= (mt)è-L=2T-(T-V)=T+V, (15.2) 
which is simply the total energy. By performing the analogous calculation, it like- 
wise follows that E is the total energy in the case of Cartesian coordinates in N 
dimensions: 


1 1 
— (Fmt Sma) — Venen) 
2. 39 (P AV) 


In view of this, a reasonable question to ask is: Does E always turn out to be the 
total energy, no matter what coordinates are used to describe the system? Alas, 
the answer is no. However, when the coordinates satisfy a certain condition, Æ is 
indeed the total energy. Let’s see what this condition is. 

Consider a slight modification to the above 1-D setup. We’ll change variables 
from the nice Cartesian coordinate x to another coordinate q defined by, say, x(q) = 
Kq°, or equivalently q(x) = (x/K)‘/°. Since « = 5Kq‘*q, we can rewrite the 
Lagrangian L(x, t) = m&?/2 — V(x) in terms of q and ġ as 


Lad) = (PRE)? —V(wl)) = FOP -Valad, 54 
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where F(q) = 25K?mgq®/2 (so the kinetic energy is T = F'(q)q?). The quantity E 
is then 


= į- L= (2F(q)4)¢-L=2T -(T-V)=T+YV, (15.5) 


which again is the total energy. So apparently it is possible for (at least some) 
non-Cartesian coordinates to yield an E equaling the total energy. 

We can easily demonstrate that in 1-D, E equals the total energy if the new 
coordinate q is related to the old Cartesian coordinate x by any general functional 
dependence of the form, x = x(q). The reason is that since « = (dx/dq)q by the 
chain rule, the kinetic energy always takes the form of ġ? times some function of q. 
That is, T = F(q)q’, where F(q) happens to be (m/2)(dx/dq)?. This function F(q) 
just goes along for the ride in the calculation of E, so the result of T + V arises in 
exactly the same way as in Eq. (15.5). 

What if instead of the simple relation x = x(q) (or equivalently q = q(x)) we 
also have time dependence? That is, x = x(q,t) (or equivalently q = q(x,t))? The 
task of Problem 15.1 is to show that L(q,q,t) yields an E that takes the form, 


E=T+V n ((5) (F) (E) (15.6) 


which is not the total energy, T + V, due to the ðz(q,t)/ðt Æ 0 assumption. So a 
necessary condition for E to be the total energy is that there is no time dependence 
when the Cartesian coordinates are written in terms of the new coordinates (or vice 
versa). 

Likewise, you can show that if there is ġ dependence, so that x = x(q,q), the 
resulting E turns out to be a very large mess that doesn’t equal T + V. However, 
this point is moot, because as we did in Chapter 6, we will assume that the trans- 
formation between two sets of coordinates never involves the time derivatives of the 
coordinates. 

So far we’ve dealt with only one variable. What about two? In terms of Carte- 
sian coordinates, the Lagrangian is L = (m/2)(4? + t2) — V(a1, 22). If these co- 
ordinates are related to new ones (call them qı and q2) by xı = 21(q1,q@2) and 
z2 = ©2(q1, q2), then we have #1 = (0x1/0q1)q + (021 /0q2)q2, and similarly for tə. 
Therefore, when written in terms of the q’s, the kinetic energy takes the form, 


T= z (Adi + Binds + Caz), (15.7) 


where A, B, and C are various functions of the q’s (but not the q’s), the exact forms 
of which won’t be necessary here. So in terms of the new coordinates, we have 


OL , OL . 
Ogi ae Og2 2 
= m(Adi +(B/2åġ2)i +m((B/2å1 + Coe) de - L 

= m(Adi + Barge + Caz) -L 

= 2T-(T-V) 

= T+V, (15.8) 


B= L 


which is the total energy. This reasoning quickly generalizes to N coordinates, qi. 
The kinetic energy has only two types of terms: ones that involve ġ? and ones that 


Figure 15.1 
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involve q;q;. These both pick up a factor of 2 (as either a 2 or a 1 + 1, as we just 
saw in the 2-D case) in the sum )>(0L/04;)q;, thereby yielding 2T. 

As in the 1-D case, time dependence in the relation between the Cartesian 
coordinates and the new coordinates will cause E to not be the total energy, as 
we saw in Eq. (15.6) for the 1-D case. And again, g; dependence will also have 
this effect, but we are excluding such dependence. We can sum up all of the above 
results by saying: 


Theorem 15.1 A necessary and sufficient condition for the quantity E to be the 
total energy of a system whose Lagrangian is written in terms of a set of coordinates 
qi is that these qi are related to a Cartesian set of coordinates x; via expressions of 
the form, 


Tı Gd q2,- --), 


LN = £N(q1,q2,---)- (15.9) 


That is, there is no t or q; dependence. 


In theory, these relations can be inverted to write the q; as functions of the 2;. 


REMARK: It is quite permissible for the number of q;’s to be smaller than the number of 
Cartesian x;’s (N in Eq. (15.9)). Such is the case when there are constraints in the system. 
For example, if a particle is constrained to move on a plane inclined at a given angle 0, 
then (assuming that the origin is chosen to be on the plane) the Cartesian coordinates 
(x,y) are related to the distance along the plane, r, by x = rcos@ and y = r sin 0. Because 
6 is given, we therefore have only one qi, namely qi = r. The point is that even if there 
are fewer than N q;’s, the kinetic energy still takes the form of (m/2) $` t? in terms of 
Cartesian coordinates, and so it still takes the form (in the case of two q;’s) given in Eq. 
(15.7) once the constraints have been invoked and the number of coordinates reduced (so 
that the Lagrangian can be expressed in terms of independent coordinates, which is a 
requirement in the Lagrangian formalism). So E still ends up being the energy (assuming 
there is no t or ġi dependence in the transformations). 

Note that if the system is describable in terms of Cartesian coordinates (which means 
that either there are no constraints, or the constraints are sufficiently simple), and if we 
do in fact use these coordinates, then as we showed in Eq. (15.3), E is always the energy. 


h 


Example 1 (Particle in a plane): A particle of mass m moves in a horizontal 
plane. It is connected to the origin by a spring with spring constant k and relaxed 
length zero (so the potential energy is kr?/2 = k(x? +y°)/2), as shown in Fig. 15.1. 
Find L and E in terms of Cartesian coordinates, and then also in terms of polar 
coordinates. Verify that in both cases, E is the energy and it is conserved. 


Solution: In Cartesian coordinates, we have 


r? +y’), (15.10) 


1A more trivial example is a particle constrained to move in the x-y plane. In this case, the 
Cartesian coordinates (x,y,z) are related to the “new” coordinates (q1, q2) in the plane (which we 
will take to be equal to x and y) by the relations: x = q1, y = q2, and z = 0. 
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and so E ar i 
= >i : O M2 dy | EE 
which is indeed the energy. 
In polar coordinates, we have 
: kr? 
E272 Ss +r- Z, (15.12) 
2 2 
and so P 
OL : OL m,.2 2A2 kr 
B= -9—-L= tr @) 4 ; 15.1 
ar” + 36 3 (t + r03) 5 (15.13) 


which is again the energy. As mentioned above, the Cartesian-coordinate E is always 
the energy. The fact that the polar-coordinate E is also the energy is consistent with 
Eq. (15.9), because the Cartesian coordinates are functions of the polar coordinates: 
x =rcos@ and y = rsin. In both cases, Æ is conserved because L has no explicit 
t dependence (see Claim 6.3 in Chapter 6). 


Example 2 (Bead on a rod): A bead of mass m is constrained to move along 
a massless rod that is pivoted at the origin and arranged (via an external torque) 
to rotate with constant angular speed w in a horizontal plane. A spring with spring 
constant k and relaxed length zero lies along the rod and connects the mass to the 
origin, as shown in Fig. 15.2. Find L and F in terms of the polar coordinate r, and 
show that E is conserved but it is not the energy. 


Solution: The kinetic energy comes from both radial and tangential motion, so 
the Lagrangian is 


2 
(Pee. (15.14) 
2 2 
E is then i 
A OL k _ Mm ,.2 22 kr 
E ar” L 5 (° — rw") 4 a (15.15) 


Since L has no explicit t dependence, Claim 6.3 tells us that E is conserved. However, 
E is not the energy, due to the minus sign in front of the r?w* term. This is consistent 
with the above discussion, because the relations between the Cartesian coordinates 
and the coordinate r (namely x = rcoswt and y = rsinwt) involve t and are 
therefore not of the form of Eq. (15.9). 


Example 3 (Accelerating rod): A bead of mass m is constrained to move on a 
horizontal rod that is accelerated vertically with constant acceleration a, as shown 
in Fig. 15.3. Find L and EF in terms of the Cartesian coordinate x. Is Æ conserved? 
Is E the energy? 


Solution: Since y = at and y = at?/2, the Lagrangian is 


m7 .2 2 at? 
L= zÊ + (at)”) — mg (=) À (15.16) 
E is then 3 
Si OL í — M/.2 2 i at 
E= zt Lsa (a? — (at)”) 4 ma ( 5 Ji (15.17) 


E is not conserved, due to the explicit t dependence in L. Also, E is not the energy, 
due to the minus sign in front of the (at)? term. This is consistent with the fact 
that the transformations from the single coordinate along the rod, x, to the two 
Cartesian coordinates (x,y) are x = x and y = 0-2 + at, and the latter involves t. 


Figure 15.2 


Figure 15.3 


XV-6 CHAPTER 15. THE HAMILTONIAN METHOD 


REMARKS: In this third example, the t dependence in the transformation between the 
two sets of coordinates (which caused E to not be the energy) in turn brought about the 
t dependence in L (which caused E to not be conserved, by Claim 6.3). However, this 
“bringing about” isn’t a logical necessity, as we saw in the second example above. There, 


the t dependence in the transformation didn’t show up in L, because all the t’s canceled 


out in the calculation of t? + y?, leaving only 7? + r?w?. 


The above three examples cover three of the four possible permutations of E equalling 
or not equalling the energy, and E being conserved or not conserved. An example of 
the fourth permutation (where E is the energy, but it isn’t conserved) is the Lagrangian 
L = mż?/2 — V (x,t). This yields E = mt?/2 + V (x,t), which is the energy. But E isn’t 
conserved, due to the t dependence in V (x,t). & 


15.2 Hamilton’s equations 


15.2.1 Defining the Hamiltonian 


Our goal in this section is to rewrite FE in a particular way that will lead to some very 
useful results, in particular Hamilton’s equations in Section 15.2.2 and Liouville’s 
theorem in Section 15.5. To start, note that L (and hence F) is a function of q 
and q. We’ll ignore the possibility of t dependence here, since it is irrelevant for the 
present purposes. Also, we’ll work in just 1-D for now, to concentrate on the main 
points. Let’s be explicit about the q and q dependence and write E as 


oL(g, å) 


E(q,ġ) 4 


q— L(q,4q). (15.18) 


Our strategy for rewriting E will be to exchange the variable ġ for the variable p, 
defined by 


2 (15.19) 


We already introduced p in Section 6.5.1; it is called the generalized momentum or 
the conjugate momentum associated with the coordinate q. It need not, however, 
have the units of standard linear momentum, as we saw in the examples in Section 
6.5.1. Once we exchange q for p, the variables in E will be q and p, instead of 
q and ġ. Unlike q and q (where one is simply the time derivative of the other), 
the variables q and p are truly independent; we’ll discuss this below at the end of 
Section 15.4.1. 

In order to make this exchange, we need to be able to write q in terms of q and 
p. We can do this (at least in theory) by inverting the definition of p, namely p = 
OL /0q, to solve for ġ in terms of q and p. In many cases this inversion is simple. For 
example, the linear momentum associated with the Lagrangian L = ma?/2— V (x) 
is p = OL/0« = mz, which can be inverted to yield t = p/m (which happens to 
involve p but not x), as we well know. And the angular momentum associated with 
the central-force Lagrangian L = m(+? + r26?)/2 — V(r) is pọ = OL/06 = mr?6 
(we’re including the subscript 6 just so that this angular momentum isn’t mistaken 
for a standard linear momentum), which can be inverted to yield Ê = pg /mr? (which 
involves both pg and r). However, in more involved setups this inversion can be 
complicated, or even impossible. 

Having written ġ in terms of q and p (that is, having produced the function 
q(q,p)), we can now replace all the q’s in E with q’s and p’s, thereby yielding a 
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function of only q’s and p’s. When written in this way, the accepted practice is to 
use the letter H (for Hamiltonian) instead of the letter E. So it is understood that 
H is a function of only q and p, with no q’s. Written explicitly, we have turned the 
E in Eq. (15.18) into the Hamiltonian given by 


H(q, p) = p4(q,p) — Lla, 4(4,))- (15.20) 


This point should be stressed: a Lagrangian is a function of q and q, whereas a 
Hamiltonian is a function of q and p. This switch from q to p isn’t something we’ve 
done on a whim simply because we like one letter more than another. Rather, there 
are definite motivations (and rewards) for using (q, p) instead of (q,q), as we'll see 
at the end of Section 15.3.3. 

If we have N coordinates q; instead of just one, then from Eq. (15.1) the Hamil- 
tonian is 


N 
H(q,p) = (do ita) — L(q,ġ(q,p)), (15.21) 


where 
=. (15.22) 


The arguments (q, p) above are shorthand for (q1,.-.,@n,P1,---,pn). (And likewise 
for the q in the last term.) So H is a function of (in general) 2N coordinates. And 
possibly the time t also (if L is a function of t), in which case the arguments (q, p) 
simply become (q, p, t) 


Example (Harmonic oscillator): Consider a 1-D harmonic oscillator described 
by the Lagrangian L = mt? /2 — kz? /2. The conjugate momentum is p = ðL/ðt = 
mg, which yields = p/m. So the Hamiltonian is 


k 
H=pł-L = pt (Ze a) 
2 
m 2 \m 
2 2 
p ka 
= ++ —_, 15.23 
Sia. 2° ( ) 


where we now have H in terms of x and p, with no «’s. H is simply the energy, 
expressed in terms of x and p. 


Example (Central force): Consider a 2-D central-force setup described by the 
Lagrangian L = m(t? + r?6?)/2 — V(r). The two conjugate momenta are 


= 2L = mrt i= Pe 
r = & =e 
pe = A =mr6 => 6= — (15.24) 


So the Hamiltonian is 


H = (ora) =z 
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H expresses the energy (it is indeed the energy, by Theorem 15.1, because the 
transformation from Cartesian to polar coordinates doesn’t involve t) in terms of r, 
pr, and pe. It happens to be independent of 0, and this will have consequences, as 
we'll see below in Section 15.2.3. 

Note that in both of these examples, the actual form of the potential energy is 
irrelevant. It simply gets carried through the calculation, with only a change in sign 
in going from L to H. 


15.2.2 Derivation of Hamilton’s equations 


Having constructed H as a function of the variables q and p, we can now derive two 
results that are collectively known as Hamilton’s equations. And then we’ll present 
three more derivations in Section 15.4, in the interest of going overboard. As in the 
previous section, we’ll ignore the possibility of t dependence (because it wouldn’t 
affect the discussion), and we’ll start by working in just 1-D (to keep things from 
getting too messy). 

Since H is a function of q and p, a reasonable thing to do is determine how 
H depends on these variables, that is, calculate the partial derivatives 0H /Oq and 
OH/Op. Let’s look at the latter derivative first. From the definition H = pq-— L, 


we have 
dH(q,p) _ O(pa(a.p)) AL (a,4(4,p)) (15.26) 
Op Op Op l 
where we have explicitly indicated the q and p dependences. In the first term on the 
right-hand side, p4q(q,p) depends on p partly because of the factor of p, and partly 
because ġ depends on p. So we have ô(pġ(q, p))/3p = ġ + p(ðġ/ðp). In the second 


term, L(q, q(q, p)) depends on p only through its dependence on q, so we have 


OL(q,4(4,P)) _ ƏL(q, å) 4(4,?) 
Op Og Op ` 


But p = OL(q,q)/0q. If we substitute these results into Eq. (15.26), we obtain 
(dropping the (q, p) arguments) 


OH (4 st) Og 
= |tz] Pa 
p 


‘Op 
= å. (15.28) 


(15.27) 


This nice result is no coincidence. It is a consequence of the properties of the 
Legendre transform (as we’ll see in Section 15.3), which explains the symmetry 
between the relations p = OL/0q and å = 0H/Op. 
Let’s now calculate 0H/Oq. From the definition H = pq — L, we have 
JH(q,p) _ O(pad(a.p)) — OL(4,4(a,p)) 
m . (15.29) 
ôq ôq ôq 

In the first term on the right-hand side, pq(q,p) depends on q only through its 
dependence on ġ, so we have ô(på(q,p))/ðq = p(ðġ/ðq). In the second term, 
L(q,4(q,p)) depends on q partly because q is the first argument, and partly because 
q depends on q. So we have 


ƏL(a, la, p)) _ AL(q,4) 
ôq © q 


ƏLla, å) Əla, p) 


15. 
ae (15.30) 
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As above, we have p = OL(q,q)/0q. But also, the Euler-Lagrange equation (which 
holds, since we’re looking at the actual classical motion of the particle) tells us that 


d (OL(q,q)\ _ ƏL, å) . _ ƏL(q, å) 


If we substitute these results into Eq. (15.29), we obtain (dropping the (q, p) argu- 


ments) 
dH ag, ð 


ôq 
= E (15.32) 


Note that we needed to use the E-L equation to derive Eq. (15.32) but not Eq. 
(15.28). Putting the two results together, we have Hamilton’s equations (bringing 
back in the (q, p) arguments): 

_3H(q, p) 


inl (q, p) + 
= OH See 15. 
q F and p (15.33) 


In the event that L (and hence H) is a function of t, the arguments (q, p) simply 
become (q, p, t). 


REMARKS: 


1. Some of the above equations look a bit cluttered due to all the arguments of the 
functions. But things can get confusing if the arguments aren’t written out explicitly. 
For example, the expression 0L/0q is ambiguous, as can be seen by looking at Eq. 
(15.30). Does 0L/0q refer to the ƏL(q, ålq, p))/3q on the left-hand side, or the 
OL(q, 4)/ðq on the right-hand side? These two partial derivatives aren’t equal; they 
differ by the second term on the right-hand side. So there is no way to tell what 
OL/0q means without explicitly including the arguments. 


2. Hamilton’s equations in Eq. (15.33) are two first-order differential equations in the 
variables q and p. But you might claim that the first one actually isn’t a differential 
equation, since we already know what ġ is in terms of q and p (because we previously 
needed to invert the expression p = OL(q,q)/Oq to solve for the function q(q,p)), 
which means that we can simply write g(q,p) = OH(q, p)/Op for the first equation, 
which has only q’s and p’s, with no time derivatives. It therefore seems like we can 
use this to quickly solve for q in terms of p, or vice versa. 


However, this actually doesn’t work, because the equation q(q,p) = OH(q,p)/Op 
is identically true (equivalent to 0 = 0) and thus contains no information. The 
reason for this is that the function ġ(q, p) is derived from the definitional equation 
p = OL(q,q)/0qg. And as we'll see in Section 15.3, this equation contains the same 
information as the equation q(q,p) = OH(q,p)/Op, due to the properties of the 
Legendre transform. So no new information is obtained by combining them. This 
will be evident in the examples below, where the first of Hamilton’s equations always 
simply reproduces the definition of p. 

Similarly, if you want to use the E-L equation, Eq. (15.31), to substitute OL(q, q)/0q 
for p in the second of Hamilton’s equations (and then plug in g(q,p) for q af- 
ter taking this partial derivative, because you’re interested in OL(q,q)/Oq and not 
OL(q, (q, p))/Oq), then you will find that you end up with 0 = 0 after simplifying. 
This happens because the equation OL(q, q)/0q = —ƏH (q, p)/Oq is identically true 
(see Eq. (15.43) below). & 
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Example (Harmonic oscillator): From the example in Section 15.2.1, the Hamil- 
tonian for a 1-D harmonic oscillator is 


2 2 
p x 
AS Se 15.34 
orn + 5 (15.34) 
Hamilton’s equations are then 
T on i=? : and 
Op m 
OH 
j = j = —kz. 15. 
p=-> p= —ke (15.35) 


The first of these simply reproduces the definition of p (namely p = OL/0z = mz). 
The second one, when mz is substituted for p, yields the F = ma equation, 
d(mz) 
dt 


ka më ka. (15.36) 


Many variables 


As with the Lagrangian method, the nice thing about the Hamiltonian method is 
that it quickly generalizes to more than one variable. That is, Hamilton’s equations 
in Eq. (15.33) hold for each coordinate q; and its conjugate momentum p;. So if 
there are N coordinates q; (and hence N momenta p;), then we end up with 2N 
equations in all. 

The derivation of Hamilton’s equations for each q; and p; proceeds in exactly 
the same manner as above, with the only differences being that the index 7 is tacked 
on to q and p, and certain terms become sums (but these cancel out in the end). 
We’ll simply write down the equations here, and you can work out the details in 
Problem 15.4. The 2N equations for the Hamiltonian in Eq. (15.21) are: 


. _ OH(q, p) _3H(q, p) 
qi Op; ? Ogi ’ 


The arguments (q,p) above are shorthand for (q,...,¢N,P1,---,Ppn). And as in 
the 1-D case, if L (and hence H) is a function of t, then a t is tacked on to these 
2N variables. 


and = pj = for 1<i<N. (15.37) 


Example (Central force): From the example in Section 15.2.1, the Hamiltonian 
for a 2-D central force setup is 


H= P 4 ve). (15.38) 


The four Hamilton’s equations are then 


t gii t Pr 
Op, m’ 
Pr Ər Pr rS V (r), 
OH À po 
0 0 
Ope mr?’ 
. OH : 
po = poe = 0. (15.39) 


00 
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The first and third of these simply reproduce the definitions of py and pọ. The 
second equation, when mř is substituted for pr, yields the equation of motion for r, 
namely 
2 
mi = 2 —V'(r). (15.40) 


mr3 


This agrees with Eq. (7.8) with pọ > L. And the fourth equation is the statement 
that angular momentum is conserved. If mr?6 is substituted for pg, it says that 


d(mr?0) 


g =? (15.41) 


Conservation of angular momentum arises because @ is a cyclic coordinate, so let’s 
talk a little about these... 


15.2.3 Cyclic coordinates 


In the Lagrangian formalism, the E-L equation tells us that 


(15.42) 


da (OL\ _ OL ._ OL 
dt \ a4 aq P= Aq" 


Therefore, if a given coordinate q is cyclic (that is, it doesn’t appear) in L(q, q), 
then OL(q,q)/Oq = 0, and so p = 0. That is, p is conserved. In the Hamiltonian 
formalism, this conservation comes about from the fact that q is also cyclic in 
H(q,p), which means that OH (q,p)/0q = 0, and so the Hamilton equation p = 
—OH(q,p)/0¢q yields p = 0. 

To show that everything is consistent here, we should check that a coordinate 
q is cyclic in H(q,p) if and only if it is cyclic in L(q,q). This must be the case, of 
course, in view of the fact that both the Lagrangian and Hamiltonian formalisms 
are logically sound descriptions of classical mechanics. But let’s verify it, just to 
get more practice throwing partial derivatives around. 

We’ll calculate 0H (q, p)/Oq in terms of OL(q, G)/Oq. This calculation is basically 
a repetition of the one leading up to Eq. (15.32). We’ll stick to 1-D (the calculation 
in the case of many variables is essentially done in Problem 15.4), and we’ll ignore 
any possible t dependence (since it would simply involve tacking a t on to the 
arguments below). From the definition H(q,p) = på(q, p) — L(q, 4(q,P)), we have 


OH(q,p) _ A(pa(q.p)) _ ƏL(a å4la p)) 
ôq ôq ôq 
_ „34l p) (maa OL(q, å) ian) 
ôq ôq Og ôq 
7 pil») (Mea p) 
= ae), (15.43) 
ôq 


Therefore, if one of the partial derivatives is zero, then the other is also, as we 
wanted to show. 


REMARK: Beware of the following incorrect derivation of the 0H/Oq = —ƏL/ðq result: 
From the definition H = pq—L, we have 0H/0q = 0(pq)/O0q—OL/0Oq. Since neither p nor 
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q depends on q, the first term on the right-hand side is zero, and we arrive at the desired 
result, 0H /0q = —OL/0q. 

The error in this reasoning is that q does depend on q, because it is understood 
that when writing down H, q must be eliminated and written in terms of q and p. So 
the first term on the right-hand side is in fact not equal to zero. Of course, now you 
might ask how we ended up with the correct result of 0H/0q = —OL/0q if we made 
an error in the process. The answer is that we actually didn’t end up with the cor- 
rect result, because if we include the arguments of the functions, what we really derived 
was OH(q,p)/0q = —dL(q,4(q,p)) /Oq, which is not correct. The correct statement is 
OH (q,p)/0q = —OL(q, q)/Oq. These two ƏL/ðq partial derivatives are not equal, as men- 
tioned in the first remark following Eq. (15.33). When the calculation is done correctly, 
the “extra” term in aL(q, q(q,p)) /Oq (the last term in the third line of Eq. (15.43)) can- 
cels the 0(p q)/0q contribution that we missed, and we end up with only the OL(q, q)/0q 
term, as desired. Failure to explicitly write down the arguments of the various functions, 
especially L, often leads to missteps like this, so the arguments should always be included 
in calculations of this sort. & 


15.2.4 Solving Hamilton’s equations 


Assuming we’ve written down Hamilton’s equations for a given problem, we still 
need to solve them, of course, to obtain the various coordinates as functions of time. 
But we need to solve the equations of motion in the Lagrangian and Newtonian 
formalisms, too. The difference is that for n coordinates, we now have 2n first- 
order differential equations in 2n variables (n coordinates q;, and n momenta p;), 
instead of n second-order differential equations in the n coordinates q; (namely, the 
n E-L equations or the n F = ma equations). 

In practice, the general procedure for solving the 2n Hamilton’s equations is to 
eliminate the n momenta p; in favor of the n coordinates q; and their derivatives 
di. This produces n second-order differential equations in the coordinates q; (as we 
found in the harmonic-oscillator and central-force examples above), which means 
that we’re basically in the same place we would be if we simply calculated the n E-L 
equations or wrote down the n F = ma equations. So certainly nothing is gained, 
at least in the above two examples, by using the Hamiltonian formalism over the 
Lagrangian method or F = ma. 

As mentioned in the introduction to this chapter, this is invariably the case with 
systems involving a few particles; nothing at all is gained (and usually much is lost, 
at least with regard to speed and simplicity) when using the Hamiltonian instead 
of the Lagrangian formalism. In the latter, you write down the Lagrangian and 
then calculate the E-L equations. In the former, you write down the Lagrangian, 
and then go through a series of other steps, and then finally arrive back at the 
E-L (or equivalent) equations. However, in other branches in physics, most notably 
statistical mechanics and quantum mechanics, the Hamiltonian formalism ranges 
from being extremely helpful for understanding what is going on, to being absolutely 
necessary for calculating anything useful. 

When solving ordinary mechanics problems with the Hamiltonian formalism, 
you should keep in mind that the purpose is not to gain anything in efficiency, but 
rather to become familiar with a branch of physics that has numerous indispensable 
applications to other branches. Having said all this, we’ll list here the concrete steps 
that you need to follow when using the Hamiltonian method: 


1. Calculate T and V, and write down the Lagrangian, L = T — V, in terms of 
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whatever coordinates q; (and their derivatives q;) you find convenient. 
2. Calculate p; = ƏL/ðq; for each of the N coordinates. 


3. Invert the expressions for the N p; to solve for the N q; in terms of the q; and 


pi? 


4. Write down the Hamiltonian, H = (X` piġi) — L, and then eliminate all the ġ; 
in favor of the q; and p;, as indicated in Eq. (15.21). 


5. Write down Hamilton’s equations, Eq. (15.37); two of them for each of the N 
coordinates. 


6. Solve Hamilton’s equations; the usual goal is to obtain the N functions q;(t). 
This generally involves eliminating the p; in favor of the q;. This will turn the 
2N first-order differential Hamilton’s equations into N second-order differen- 
tial equations. These will be equivalent, in one way or another, to what you 
would have obtained if you had simply written down the E-L equations after 
step (1) above. 


15.3 Legendre transforms 


15.3.1 Formulation 


The definition of E in Eq. (15.1 
bit arbitrary to you. However 


The H defined in Eq. 
(15.20) is called the Legendre transform of L, and we’ll now give a review of this 
topic. 

To isolate the essentials, let’s forget about H and L for now and consider a 
function of just one variable, F(x). The derivative of F(x) is dF(a)/dx = F'(z), 
but for notational convenience let’s relabel F’(x) as s(x) (s for “slope”). So we 
have dF'(x) /dx = s(x). Note that we can (at least in principle; and at least locally, 
assuming s isn’t constant) invert the function s(x) to solve for x as a function of s, 
yielding z(s). For example, if F(x) = x°, then s(x) = F’(x) = 327. Inverting this 
gives z(s) = (s/3)*/?. 

The purpose of the Legendre transform is to construct another function (call it 
G) that reverses the roles of x and s. That is, the goal is to construct a function 
G(s) with the property that 


dG 
Cer) (15.44) 
ds 
G(s) is then called the Legendre transform of F(x). Now, the most obvious way 


to find G(s) is to simply integrate this equation. In the case of F(x) = x3, we’re 


looking for a function G(s) whose derivative is (s/3)!/?. Integrating this, we quickly 
see that the desired function is G(s) = 2(s/3)°/?. So this is the Legendre transform 
of F(a) = x3. However, it turns out that there is another method that doesn’t 
involve the task of integrating. The derivation of this method is as follows. 

If such a G exists, then we can add the two equations, dF = s dx and dG = z ds, 
to obtain d(F + G) = sdu+ads = d(F + G) = dsz). This implies that 


?In some cases the expressions for the p; may be “coupled” (that is, a given q; may appear 
in more than one of them), thereby making the inversion difficult (or impossible). But in most 
cases the equations are uncoupled and easily invertible, as they were in the harmonic-oscillator 
and central-force examples above. 
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The Legendre transform of F is therefore 
G= sx — F = F'z— F. (15.45) 


This function G can be written as a function of either x or s (since each of these can 
be written as a function of the other). But when we talk about Legendre transforms, 
it is understood that G is a function of s. So to be precise, we should write 

G(s) = s2(s) — F(a(s)). (15.46) 


3 example. We have 


s(s/3)!/? — F((s/3)'/?) 

= (3-8/3) -(s/3)/? — (s/3)°/? 

(3 = 1)(s/3)9/ 

2(5/3)""*, (15.47) 


Let’s see what this gives in the F(x) = x 
G(s) = s2(s) — F(2(s)) 


in agreement with the above result obtained by direct integration. The advantage 
of this new method is that it involves the straightforward calculation in Eq. (15.46), 
whereas the process of integration can often be tricky. 

As a double check on Eq. (15.46), let’s verify that the G(s) given there does 
indeed have the property that dG(s)/ds equals x. Using the product and chain 
rules, along with the definition of s, we have 


dG(s) dx dF daz dx da 
= . { . . = | . . = 1 A 
ds eee =) de ds |. ds ds” ae) 
as desired. 


What is the Legendre transform of G(s)? From Eq. (15.45) we see that the rule 
for calculating the transform is to subtract the function from the product of the 
slope and the independent variable. In the case of G(s), the slope is z and the 
independent variable is s. So the Legendre transform of G(s) is 


G's -G=as—-G=F, (15.49) 


where the second equality follows from Eq. (15.45). Since F is a function of xz, we 
should make this clear and write xs(x) — G(s(x)) = F(a). We therefore see that 
two applications of the Legendre transform bring us back to the function we started 
with. In other words, the Legendre transform is the inverse of itself. 

Let’s calculate the Legendre transform of a few functions. 


Example 1: F(x) = 2: 
We have s = F’ (x) = 2x, so x(s) = s/2. The Legendre transform of F(x) is then 


G(s) = sa(s) — F((a(s)) = s- (s/2) — (s/2)? = s?/4. (15.50) 
Therefore, dG(s)/ds = s/2, which equals x, as expected. 
Example 2: F(x) = Inz: 
We have s = F’(x) = 1/z, so x(s) = 1/s. The Legendre transform of F(x) is then 
G(s) = sa(s) — F(a(s)) =s-(1/s) —In(1/s) = 1 + Ins. (15.51) 


Therefore, dG(s)/ds = 1/s, which equals x, as expected. 
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Example 3: F(x) 


= ev 
We have s = F’(x) = e”, so z(s) = Ins. The Legendre transform of F(x) is then 


G(s) = sx(s) — F(a(s)) =s-Ins—s. (15.52) 


Therefore, dG(s)/ds = (s -(1/s)+1-In s) — 1 = lns, which equals x, as expected. 


15.3.2 Geometric meaning 


It is easy to demonstrate geometrically the meaning of the Legendre transform given 
in Eq. (15.45). And a byproduct of this geometrical explanation is a quick way of 
seeing why dG/ds = x, as we verified algebraically in Eq. (15.48). Consider the 
function F(x) shown in Fig. 15.4, and draw the tangent line at the point (x, F(x)). 
Let the intersection of this line with the y axis be labeled as point A. And label 
point B as shown. From the definition of the slope s of the tangent line, the distance 
AB is sx. Therefore, since the distance OB is F(x), the distance AO is sx — F(x). 
Note that this distance is defined to be positive if point A lies below the origin. 
In other words, sx — F(x) equals the negative of the y value of point A. In view 
of Eq. (15.45), we therefore have our geometric interpretation: 


However, recall that the Legendre transform is understood to be a function of 
the slope s. So the step-by-step procedure for geometrically obtaining the Legendre 
transform is the following. Given a function F(x) and a particular value of x, we 
can draw the tangent line at the point (x, F(a)). Let the slope be s(x). We can 
then find the y intercept of this line. A given coordinate x produces a value of this y 
intercept, so the y intercept may be thought of as a function of x. However, we can 
(generally) invert s(x) to obtain the function 2(s), so the y intercept (the Legendre 
transform) may also be considered to be a function of s. In other words, 


G(s) = (y intercept) = s a(s) — F(x(s)). (15.53) 
As an example, Fig. 15.5 shows a tangent line to the function F(x) = x”. The slope 
is s(x) = F’(x) = 2z, so the distance AO is G = sx — F(x) = (2x)x — x? = x°. 


Writing this as a function of s, we have G(s) = x(s)? = (s/2)? = s?/4, as in the 
first example above. 
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REMARK: Can two different values of s yield the same value of G(s)? Yes indeed, as shown 
in Fig. 15.6. The tangent lines with slopes sı and s2 both yield the same y intercept, so 
G(s1) = G(s2). Note that in the special case where F(x) is an even function of x, both s 
and —s yield the same y intercept, so G(s) is also an even function of s (assuming it is well 
defined; see the next paragraph). Can two different values of s with the same sign yield 
the same y intercept? Yes again, as shown in Fig. 15.7. Can two different values of s (of 
any signs) yield the same y intercept if the two associated values of x have the same sign? 
Still yes, as shown in Fig. 15.8. The equality of the two G(s) values is shown schematically 
in Fig. 15.9. The task of Problem 15.14 is to shown that F(x) must have an inflection 
point (a point where the second derivative is zero) for this last scenario to be possible. 
Can a given value of s yield two different values of G(s) (which would make the function 
G(s) not well defined)? Yes, as shown in Fig. 15.10. The same slope yields two different y 
intercepts. The double-valued nature of G(s) is shown schematically in Fig. 15.11. The task 
of Problem 15.15 is to shown that F(x) must have an inflection point (a point where the 
second derivative is zero) for this scenario to be possible. Therefore, if there is no inflection 
point (that is, if F(x) is concave upward or downward), then G(s) is well defined. & 


(SPACE LEFT BLANK) 
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Armed with the above y-intercept interpretation of the Legendre transform, we slope=s 
can now see geometrically why dG(s)/ds = x. First, note that if we are given the 
function G(s), we can reconstruct the function F(x) in the following way. For a x 
given value of s, we can draw the line with slope s and y intercept G(s), as shown G(s) — 


in Fig. 15.12. We can then draw the line for another value of s, and so on. The 
envelope of all these lines is the function F(x). This follows from the geometric 
construction of G(s) we discussed above. Fig. 15.13 shows the parabolic F(x) = x? 
function that results from the function G(s) = s?/4. 

In Fig. 15.14 consider two nearby values of s (call them sı and s2), and draw 
the lines with slopes s; and y intercepts G(s;). Due to the above fact that the y slope=s—~ 
function F(x) is the envelope of the tangent lines, these two lines will meet at 
the point (x, F(x)), where x is the coordinate that generates the coordinate s via 
the Legendre transformation (where s = sı = sz in the limit where sı and sz are 
infinitesimally close to each other). 

Define the points P, B, Aı, and A> as shown. Using triangles PBA, and 
PBAg, along with the definition of the slope s, we see that the distances BA, 
and BA» are simply the slopes multiplied by x. So they are sx, and sx. But 
from our geometric interpretation of the Legendre transform, these distances are 
also BA; = F(x) + G(s1) and BA2 = F(x) + G(s2), where this is the same F(x) 
appearing in both of these equations. We therefore have Figure 15.13 


Figure 15.12 


sıx = F(x) + G(s1), and soz = F(x) + G(s2). (15.54) 


Subtracting these gives 


N (x,F(x)) 


x 


(sg — 81)a = G(so) —G(s1) => w= jo) Bcd) ; (15.55) 


52 — S1 


In the limit sı —> s2, this becomes « = dG(s)/ds, as desired. Basically, the point is slope=s; 


that multiplying x by the difference in slopes gives the difference in the y intercepts > slope=s2 
(the G(s) values). That is, rAs = AG x = dG/ds. This is the “mirror” 
statement of the standard fact that multiplying the slope of a function by the 
difference in x values gives the difference in the F(x) values (to first order, at Figure 15.14 


least). That is, sAx = AF => s=dF/dz. 


15.3.3 Application to the Hamiltonian 


Let’s now look at how the Legendre transform relates to the Hamiltonian, H. The 
Lagrangian L is a function of q and q (and possibly t, but we’ll ignore this depen- 
dence, since it is irrelevant for the present discussion). We will choose to form the 
Legendre transform of L with respect to the variable ġ. That is, we will ignore 
the q dependence and consider L to be a function of only g. (The terminology is 
then that q is the “active” variable and q is the “passive” variable.) So q is the 
analog of the variable x in the preceding discussion, and p = 0L/0q is the analog 
of s = dF/dz. Following Eq. (15.46), the Legendre transform of L (with ġ as the 
active variable) is defined to be the Hamiltonian, 


H(q, p) = påla, p) — L(4,4(4,P)) (15.56) 


This is simply Eq. (15.46) with G, F,2,s replaced by H, L,q,p, respectively, and 
with q (the passive variable) tacked on all the arguments. 

Having defined H as the Legendre transform of L in this way, the statement 
analogous to the « = dG/ds result in Eq. (15.44) is (with G, x, s replaced by H, ġ, p, 
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respectively, and with q tacked on all the arguments) 


G(q,P) = ae (15.57) 


This is the first of Hamilton’s equations in Eq. (15.33). As mentioned in the second 
remark following Eq. (15.33), this equation comes about simply by the definition of 
the Legendre transform. There is no physics involved. 


REMARK: In practice, when solving for q as a function of t in a given problem by combining 
Eq. (15.57) with the second Hamilton’s equation, the usual strategy is not to think of ġ 
as a function of q and p as indicated in Eq. (15.57), but rather to eliminate p by writing 
it in terms of q and q (see the examples in Section 15.2.2). & 


We’ve thus far ignored the variable q. But since it is indeed an argument of H, 
it can’t hurt to calculate the partial derivative OH/Og. But we already did this in 
Eq. (15.43), and the result is 

OH(q,p) _ _ OL(q, å) 
= : 15.58 
dq Dg (15.58) 
(Note that any t dependence in L and H wouldn’t affect this relation.) So far we’ve 
used only the definition of the Legendre transform; no mention has been made of 
any actual physics. But if we now combine Eq. (15.58) with the E-L equation, 
d(0L/0q)/dt = ƏL/ðq = > p= OL/0q, we obtain the second Hamilton’s equation, 


(15.59) 
This has basically been a repeat of the derivation of Eq. (15.32). 


Many variables 


How do we form a Legendre transform if there is more than one active variable? To 
answer this, consider a general function of two variables, F (x1, x2). The differential 


of F is aF aP 
dF = dxı + dz = sıdzı + sodzo, (15.60) 
xı Ox 


where sı = OF /0x, and sz = OF /0z2 are functions of zı and z2. 

Let’s assume we want to construct a function G(s1, s2) with the properties that 
OG/0s, = xı and 0OG/0s2 = x2 (so zı and zz are functions of sı and s2). If these 
relations hold, then the differential of G is 


dG = OF as, + Ga = zıdsı + L2d82. (15.61) 
ðsı Oso 
Adding the two previous equations gives 
d(F + G) = (sıdzı = s2dx2) ap (xıdsı oa x2ds2) 
= (sıdzı T zıdsı) he (sodxg ae X2d82) 
— d(s1x1 T $202). (15.62) 


The Legendre transform of F'(a1,22), with both xı and x as active variables, is 
therefore G = sızı + 82%2 — F. But since it is understood that G is a function of 
sı and s2, we should write 


G(s1, s2) = 8&1: zılsı, s2) + 89° x2(S1, s2) = F(2(s1, 82),@2(S1, 82)). (15.63) 
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You can quickly check that this procedure generalizes to N variables. So if we 
have a function F(x), where x stands for the N variables x;, then the Legendre 
transform, with all the x; as active variables, is 


G(s) = OS sizi) ~ F(2(s)), (15.64) 


where s; = OF /0xz;, and s stands for the N variables s;. This is exactly the 
form of the Hamiltonian in Eq. (15.21), where H, L, p,q take the place of G, F, s, x, 
respectively. So in the case of many variables, H is indeed (if you had any doubts) 
the Legendre transform of L, with the q; as the active variables. 


Motivation 


Let’s now take a step back and ask why we should consider applying a Legendre 
transform to the Lagrangian in the first place. There are some deeper reasons for 
considering a Legendre transform, but a practical motivation is the following. In 
the Lagrangian formalism, we have two basic equations at our disposal. First, there 
is the simple definitional equation, 


OL 
DEES (15.65) 
q 
And second, there is the E-L equation, 
OL 
>= —— 15. 
B= 5 (15.66) 


which involves some actual physics, namely the principle of stationary action. These 
two equations look a bit similar; they both have a p, q, L, and a “dot.” However, 
one dot is in the numerator, and the other is in the denominator. So a reasonable 
goal is to try to make the equations more symmetrical. It would be nice if we could 
somehow switch the locations of the p and q in the first equation and end up with 
something along the lines of ġ = OL/Op. Now, we of course can’t just go around 
switching the placements of variables in equations. But what we can do is recognize 
that the basic effect of a Legendre transform is precisely this switching. That is, it 
takes the equation s = dF/dx and produces the equation x = dG'/ds. In the case of 
the Legendre transform from the Lagrangian to the Hamiltonian, we can turn Eq. 
(15.65) into Eq. (15.57), 

. OH 

q= ðp ` 
We now have the problem that there is an H here instead of an L, so this equation 
has lost its symmetry with Eq. (15.66). But this is easy to remedy by exchanging 
the L in Eq. (15.66) for an H. Again, we can’t just go switching letters around, but 
Eq. (15.58) shows us how to do it legally. The only “penalty” is a minus sign, and 
we end up with 


(15.67) 


(15.68) 


The two preceding equations now have a remarkable amount of symmetry. The 
variables q and p are (nearly) interchangeable. The only nuisance is the minus sign 
in the second equation. However, two points: First, this minus sign is simply the 
way it is, and there’s no getting around it. And second, the minus sign is actually 
much more of a blessing than a nuisance, for various reasons, one of which we’ll see 
in Section 15.5 when we derive Liouville’s theorem. 
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Our goal of (near) symmetry has therefore been achieved. Of course, you might 
ask why this should be our goal. But we’ll just take this as an a priori fact: When a 
physical principle (which for the subject at hand is the principle of stationary action, 
because that is the critical ingredient in both the E-L and Hamilton’s equations) 
is written in a form where a symmetry is evident, interesting and useful results are 
invariably much easier to see. 


15.4 Three more derivations 


We now present three more derivations of Hamilton’s equations, in addition to the 
one we gave in Section 15.2.2. So we’ll call them the 2nd, 3rd, and 4th derivations. 
The first of these shows in detail how the equations follow from the principle of sta- 
tionary action, so it is well worth your attention, as is the “Discussion of coordinate 
independence, symmetry” that follows. However, the other two derivations aren’t 
critical, so feel free to skip them for now and return later. As you'll see, all four 
of the derivations essentially come down to the same two ingredients (the principle 
of stationary action, plus the definition of H via the Legendre transform), so it’s 
debatable whether they should be counted as distinct derivations. But since each 
one is instructive in its own right, we’ll include them all. 

In the following three derivations, we'll deal only with the 1-D case, lest the main 
points get buried in a swarm of indices. You can easily extend the reasoning to many 
dimensions; see the discussion at the end of Section 15.2.2. Also, time dependence 
is irrelevant in the 2nd and 3rd derivations, so we won’t bother including t in the 
arguments. But it is relevant in the 4th derivation, so we’ll include it there. 


15.4.1 Second derivation 


The derivation of Eq. (15.32), 6 = —OH/0q, made use of the Euler-Lagrange equa- 
tion. Therefore (as we’ve noted before), just as the E-L equation is a consequence 
of the principle of stationary action, so are Hamilton’s equations (or at least the 
p = —OH/0q one). However, the issue of stationarity got a bit buried in the earlier 
derivation, so let’s start from scratch here and derive Hamilton’s equations directly 
from the principle of stationary action. The interesting thing we'll find is that both 
of the equations arise from this principle. 

From the definition H = pq — L, we have L = pq—H. So the action S may be 
written as 


t2 ta t2 
s= Lat= | (pg Hat = | (pdq — H dt). (15.69) 
ty ty 


ti 


If we want to be explicit with all the arguments, then the integrand should be 
written as p(t)dq(t) — H (q(t), p(t)) dt. In the end, everything is a function of t. As 
t marches along, the integral is obtained by adding up the tiny changes due to the 
dq and dt terms, until we end up with the total integral from a given tı to a given 
tg. 

The integrand depends on the functions p(t) and q(t). We will now show that 
Hamilton’s equations are a consequence of demanding that the action be stationary 
with respect to variations in both p and q. It turns out that we will need the variation 
in q to vanish at the endpoints of the path (as was the case in Chapter 6), but there 
will be no such restriction on the variation in p. 
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Let the variations in p(t) and q(t) yield the functions p(t)+dp(t) and ¢(t)+dq(t).? 
Using the product rule, the S in Eq. (15.69) acquires the following first-order change 
due to these variations (dropping all the arguments, for ease of notation): 


t2 
6S =| (spain + võtan) Gaz | 5 tet) | ; (15.70) 
t ðq Op 


In the second term here, we can rewrite pd(dq) as pd(déq). That is, the variation 
in the change (between two nearby times, due to the traversing of the path) in q 
equals the change (due to the traversing) in the variation in q. 


REMARK: This is probably better explained in plain math than plain English. Let’s 
change the notation so that f(t) is the original function q(t), and g(t) is the modified 
function q(t) + dq(t). Then dq(t) = g(t) — f(t). (To repeat, so there isn’t any confusion: 
the 6 symbol is associated with the change from the function f(t) to the function g(t), and 
the d symbol is associated with changes due to time marching along.) If we consider two 
nearby times tı and t2, then what is 6(dq)? Well, df = f(t2)—f(t1) and dg = g(tz) —g(t1). 
Therefore, 6(dq) (which is the change in dq in going from f to g) is 


5(dq) = (g(t) — g(t1)) — (fF (t2) — f(t). (15.71) 


And what is d(éq)? Well, dq(t) = g(t) — f(t). Therefore, d(ôq) (which is the change in 
g(t) — f(t) in going from tı to t2) is 


d(5q) = (g(t2) — f(t2)) — (gta) — f(t). (15.72) 


The two preceding results are equal, as promised. & 


If we now integrate the p d(ôq) term by parts, we obtain f pd(5q) = pdq—f dp dq. 
The first of these terms vanishes, assuming that we are requiring ôq to be zero at 
the endpoints (which we are). So Eq. (15.70) becomes 


t2 
6S =| dp | dq ae dt ôq | dp4 GH dt) |. (15.73) 
tı Op ôq 


If we assume that the variations dp and dq are independent (see the following discus- 
sion for more on this assumption), then the terms in parentheses must independently 
vanish if we are to have 6S = 0. Dividing each term by dt, we obtain Hamilton’s 
equations, 


dq OH 
E jee Soe ee 15.74 
T ; and Ti Dg’ (15.74) 


as desired. 


Discussion of coordinate independence, symmetry 


In view of the fact that two equations popped out of this variational argument, while 
only one (the E-L equation) popped out of the variational argument in Chapter 6, 
you might wonder if we performed some sort of cheat. Or said in another way: to 
obtain the second Hamilton’s equation, we needed to integrate by parts and demand 
that the variation in q vanish at the endpoints. But we didn’t need to do anything 
at all to obtain the first Hamilton’s equation. This seems a little fishy. Or said in 
yet another way: there was no restriction on the variation in p (as long as it was 


3The dq(t) variation here is the same as the a(t) variation we used in Section 6.2. The present 
notation makes things look a little nicer. 
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infinitesimal, since we kept only the first-order changes in Eq. (15.70)); the variation 
didn’t need to vanish at the endpoints. So we seem to have gotten something for 
nothing. 

The explanation of all this is that we did get something for nothing. Or more 
precisely, we got something by definition. As we saw in Section 15.3.3, the first 
Hamilton’s equation, ¢ = O0H/Op is true by definition due to the fact that H is 
defined to be the Legendre transform of L. So it’s no surprise that we didn’t need 
to make any effort in deriving this equation, and no surprise that there weren’t any 
restrictions on the variation in p. The first term in parentheses in Eq. (15.73) is 
simply zero by definition. 

Having said this, we should note that there are (at least) two possible points of 
view concerning the stationary property of S when written in terms of H (and thus 
written in terms of q and p, instead of the q and ġ in the Lagrangian formalism). 


e FIRST POINT OF VIEW: The facts we will take are given are (1) the action 
if L is stationary with respect to variations in q that vanish at the endpoints, 
and (2) H is the Legendre transform of L. We then find: 


Looking at the second term in Eq. (15.73), fact (1) implies that Hamilton’s 
second equation holds. Looking at the first term in Eq. (15.73), we see that 
it is zero by definition (that is, Hamilton’s first equation holds by definition), 
because of fact (2).4 It then follows that 6S = 0, no matter how we choose to 
vary p. So we conclude that we can vary q and p independently and have $ 
be stationary, as long as Hamilton’s equations are satisfied. 


e SECOND POINT OF VIEW: The facts we will take are given are (1) we start 
with the Hamiltonian, H(p,q), and pretend that we don’t know anything 
about Legendre transforms or the Lagrangian (and in particular the definition 
p = OL/0q), and (2) the integral? {(pq— H)dt is stationary with respect to 
independent variations in the variables q and p, We then find: 


Upon deriving Eq. (15.73) above, facts (1) and (2) immediately imply that 
both of Hamilton’s equations must be satisfied. 


Note that in both of these points of view, the variables q and p are independent. 
In the first point of view, this independence is reached as a conclusion (by using the 
properties of the Legendre transform); while in the second, it is postulated. But 
in either case the result is the same: q and p are independent variables, unlike the 
(very dependent) variables q and q in the Lagrangian formalism. 

However, in the end this independence is fairly irrelevant. The important issue 
is the symmetry between q and p in Hamilton’s equations (the minus sign aside).” 
This fundamental difference between the Hamiltonian formalism (where q and p are 
symmetric) and the Lagrangian formalism (where q and ġ are not) leads to many 
added features in the former, such as Liouville’s theorem which we'll discuss in 
Section 15.5 below. 

4Note that this reasoning makes no mention of dp variations, in particular their independence 
of ôq variations. 

5 Yes, this integral comes out of the blue, but so does the fT — V) integral in the Lagrangian 
formalism. 

6Note that the q in the integral, being the time derivative of q, is very much dependent on q; 
the variation in qg is simply the time derivative of the variation in q. The ġ just stays as q here; 
we are not writing it as ġ(q, p), because this functional form is based on the definition p = 0L/0q, 
which we’re pretending we don’t know anything about. 

TOf course, if you take the second point of view above, you would say that this symmetry 


between q and p is a consequence of their independence. But in the first point of view, the 
symmetry is based on the Legendre transform. 
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15.4.2 Third derivation 


The strategy of this derivation will be to calculate the differential of H (q, p) by first 
calculating the differential of L(q,q), and to then read off the partial derivatives, 
OH/0q and 0H/Op, from the result. 

The Lagrangian L(q,q) is a function of q and q, so let’s consider the change in 
L(q,q) brought about by changes in q and q. If we label these changes as dq and 
dq, then the change in L(q, q) is, by definition, 

_ OL(a,@) 


dL(q, å) = dq + 


34 dq. (15.75) 


REMARK: As noted in Footnote 7 in Chapter 6, we are not assuming that the changes 
dq and dq are independent. They are in fact quite dependent; if q becomes q + €, then q 
becomes g + €. So we have dq = ¿ = de/dt = d(dq)/dt. The dependence here is clear, in 
that one change is the derivative of the other. This is perfectly fine, because in writing 
down the change in L(q,q) in terms of the partial derivatives in Eq. (15.75), nowhere is 
it assumed that these partials are independent. Equation (15.75) arises simply by letting 
q — q + dq and g — q+ dg in L(q,q), and by then looking at what the first order change 
is. For example, if L = Aq + Bq+Cq’q’, then you can quickly use the binomial expansion 
to show that the substitutions q — q + dq and q — q+ dg yield a first order change in L 
equal to 
dL = Adq+Bdgq+C(2qq° dq +38? dq) 
= (A+ 2Cqq°) dq + (B +304’) då 
OL OL 


= Z d+ då 15. 
F I+ 3; å, (15.76) 


as desired. Even if we have a function f(x,y) of two variables that are trivially dependent 
(say, y = 5x), the relation df = (Of /Ox) dx + (Of /Oy) dy still holds. & 


Recalling the definition p = OL(q,q)/0q in Eq. (15.19), we can rewrite Eq. 
(15.75) as 
OL(q, å : 
dL(q,4) = POD ag + pdi, (15.77) 
Our goal here is to say something about the Hamiltonian, which is function of q 
and p, but not of q, so let’s get rid of the dq term by using 


d(pq) =qdp+pdq => pdq=d(pq)— į dp. (15.78) 


This is basically the integration-by-parts trick that we’ve used many times in the 
past, which in the end is nothing more than the product rule for derivatives. Equa- 
tion (15.77) can now be written as 


OL(a, å Zs 
dL(q,4q) = “ee ag dlpd) — ġdp 
OL(q, g 
= d(pq-L) = ENED ag + gdp. (15.79) 


If we now use the definition p = OL(q, q)/Oq to invert and solve for q in terms of q 
and p, we can write the pq — L(q,q) on the left hand side in terms of only q and p, 
with no q’s, as pq(q,p) — L(q,q(q,p)). But this is simply the Hamiltonian H (q, p) 
defined in Eq. (15.20), so we have 


OL(q, å) 


dH (q, p) = — ag 


dq + ġ dp. (15.80) 
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But by definition, we also have 


oH (q, p) OH (q, p) 
dH = dq + dp. 15.81 
(ap) = 9 peep (15.81) 
Comparing the two previous equations, we therefore find 
UG SOE ps, a (15.82) 
ôq ôq Op 


The inclusion of the arguments of the functions is very important. In particular, 
the OL /0q term here is OL(q, q)/Oq and not OL(q, å(q, p))/3q; see the first remark 
following Eq. (15.33). 

Up to this point we have done nothing except make definitions and manipulate 
mathematical relations. We haven’t done any physics. Given the definitions of p and 
H we have made, the equations in Eq. (15.82) are are simply identically true. But 
we will now invoke some actual physics and use the fact that the E-L equation states 
that (d/dt)((OL(q, ¢)/0qg) = OL(q, 4)/Əq. Using the definition p = OL(q, ¢)/04, this 
becomes p = OL(q,q)/Oq. Substituting this in the first of Eqs. (15.82), we have 
Hamilton’s equations, 

_ _9H(q,p) 


OLE cg ge ONO) (15.83) 
ôq Op 
The second of these equations is an identically true mathematical statement, 
given the definitions of p and H in terms of L (with the definition of H being 
motivated by the Legendre transform); there is no physics is involved. But the first 
equation uses the E-L equation, which arose from the physical statement that the 
action is stationary. 


15.4.3 Fourth derivation 


In this derivation, we will use the fact that H is the Legendre transform of L. And 
we will also use the fact that dH /dt = —OL/0t (see below), which in turn depends 
on the Euler-Lagrange equation being satisfied, or equivalently on the action being 
stationary. The t argument of the functions will be important in this derivation, so 
we'll include it throughout. 

Because H is the Legendre transform of L, the definition p = OL(q, q,t)/0q 
implies the “mirror” statement, 


__ OH(q,p,t 
po CRG. 


D: (15.84) 


This is the first of Hamilton’s equations. But as we have already noted many times, 
this equation isn’t a physical statement, but rather a mathematical one arising 
from the definition of H as the Legendre transform of L. Alternatively, you can 
just derive it from scratch as we did in Eq. (15.28) (although this still relies on the 
fact that H is the Legendre transform of L). Therefore, as far as the first Hamilton’s 
equation is concerned, this fourth derivation is basically the same as the first one. 

Now let’s derive the second Hamilton’s equation. In Chapter 6, we showed in Eq. 
(6.54) that the E defined in Eq. (6.52/15.1) satisfies dE (q, q, t)/dt = —ƏL(q, å, t)/Ot. 
Therefore, since H and E are the same quantity, we also have 

dH(q,p,t) _ _ OL(q,4,t) 


ae a (15.85) 
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The fact that H’s arguments (q,p,t) are different from E’s arguments (q,4q,t) is 
irrelevant when taking the total time derivative, because in the end, H and E are 
the same function of t when all the coordinates are expressed in term of t. So perhaps 
we should write H(q(t), p(t), t), etc. here, but that gets to be rather cumbersome. 

We’ll need one more fact, namely 3H (q, p, t)/3t = —OL(q,q,t)/Ot. This can be 
shown as follows. Since H(q,p,t) = på(q, p, t) — L(q, q(q, p, t),t), we have 


ƏH(q, p.t) _ 34 _ (OL, 4,t) 0g | Lla åt) 
at Pat aq. at at 


(15.86) 


But p = OL(q, q, t)/Ot, so the first and second terms on the right-hand side cancel, 


and we are left with ; 
OH(q,p,t) _ OL(q, 4, t) 


Ot at 
as desired. Eqs. (15.85) and (15.87) therefore imply that dH/dt = 0H/dt. 
The second Hamilton equation now quickly follows. The chain rule yields 


(15.87) 


dH ƏH., ðH. OH 
dt ðq i 


(15.88) 


Using dH/dt = OH/dt, along with ¢ = OH/Op from Eq. (15.84), the preceding 
equation reduces to 


ðH. figs .[(0H <= 5 OH 
Opg ra = i(G +8) =0 => p=-—, (15.89) 
which is the second of Hamilton’s equations, as desired. (We’re ignoring the trivial 
case where ġ is identically zero.) 

As in the other three derivations of Hamilton’s equations, the first equation is 
simply a mathematical statement that follows from the properties of the Legendre 
transform, while the second equation is a physical statement that can be traced to 
the principle of stationary action. 


REMARK: The main ingredient in this derivation of Hamilton’s second equation was the 
dH/dt = —OL/0t fact in Eq. (15.85). In the special case where 0L/Ot = 0, this gives 
dH/dt = 0. That is, energy is conserved. So Hamilton’s second equation is closely related 
to conservation of energy (although it should be stressed that the equation holds even 
in the general case where 0L/0t # 0). This shouldn’t be a surprise, because we already 
know that Hamilton’s equations contain the same information as Newton’s F = ma law, 
and we originally derived conservation of energy in Section 5.1 by integrating the F = ma 
statement. & 


15.5 Phase space, Liouville’s theorem 


15.5.1 Phase space 


Consider a particle undergoing motion (let’s just deal with 1-D) governed by a given 
potential (or equivalently a given Lagrangian, or equivalently a given Hamiltonian). 
At each point in the motion, the particle has a certain value of q and a certain value 
of p. That is, each point in the particle’s motion is associated with a unique point 
(q, p) in the q-p plane. The space spanned by the q and p axes is called phase space. 
The coordinates q and p are often taken to be a standard Cartesian coordinate x 
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Figure 15.16 
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and the associated linear momentum mz, or perhaps an angular coordinate 0 and 
the associated angular momentum mr26. But it should be stressed that all of the 
following results hold for a generalized coordinate q and the conjugate momentum 
p = OL/04. 

If a particle is located at a particular point (qo, po) at a given instant, then its 
motion is completely determined for all time. This is true because given the values 
of qo and po at a given instant, Hamilton’s equations, Eqs. (15.33), yield the values 
of ġ and p at this instant. These derivatives in turn determine the values of q and p 
at a nearby time, which in turn yield the values of ġ and p at this nearby time, and 
so on. We can therefore march through time, successively obtaining values of (q, p) 
and (q,p). So the initial coordinates (qo, po), together with Hamilton’s equations, 
uniquely determine the path of the particle in phase space. 


Example 1 (Constant velocity): Consider a particle that moves with constant 
velocity in 1-D. Since there is no force, the potential energy is constant (which we 
will take to be zero), so the Hamiltonian is H = p? /2m. You can quickly show 
that Hamilton’s equations are ġ = p/m and p = 0. The second of these is the 
statement that p doesn’t change, as expected. So the possible “curves” in phase 
space are horizontal lines, as shown in Fig. 15.15. The actual line the particle is 
on is determined by the initial coordinates (qo, po). Note that the lines associated 
with negative p are traced out leftward (that is, in the direction of decreasing q), 
as should be the case. Note also that although all the lines look basically the same 
on the page, the particle traverses the ones with larger |p| more quickly, due to the 
above ġ = p/m equation; the larger the |p|, the greater the rate at which q changes, 
as expected. 


Example 2 (Falling balls): With positive q defined to be downward (which 
will reduce the number of minus signs in the expressions below), the Hamiltonian 
for a vertically thrown ball is H = p?/2m — mgq. You can quickly show that 
Hamilton’s equations are ġ = p/m and p = mg. Eliminating p gives the familiar 
equation of motion, g = g. Integrating twice then gives the standard result, q(t) = 
qo + vot + gt? /2. However, if we want to see what the path looks like in phase space, 
the best thing to do in general is to solve for q in terms of p (or p in terms of q, 


whichever is easier). Since p = mg po — p = mgt t = (p — po)/mg, we 
can eliminate t from the above expression for q(t) to obtain 
po\ (p—po\ , 9 (p—po\” p= pi 
= t = t ; 15.90 
q(p) = qo + (2) ( r ) 5 ( a ) 90 + mg (15.90) 


We see that q is a positive quadratic function of p, which means that the curves in 
phase space are rightward-opening parabolas, as shown in Fig. 15.16. Note that the 
curves are traced out leftward below the q axis (where p is negative), and rightward 
above the q axis (where p is positive), as expected. 


REMARK: Another way of obtaining Eq. (15.90) is to use the standard kinematic result for 
constant acceleration, v? = v? + 2a(x_¢ — x;). You can verify that this does yield the correct 
q(p). And yet another way is to simply note that H is constant (because it has no explicit 
time dependence), so p?/2m — mgq = C, where C is determined by the initial conditions. 


So C must equal pe /2m — mgqo, and we obtain Eq. (15.90). & 


The q in Eq. (15.90) takes the form of q(p) = Ap? + B, where B = qo — po/2m7g 
depends on the initial coordinates po and qo, but where A = 1/2m?g doesn’t. Be- 
cause A is the same for every curve, the curves are all parts of parabolas with the 
same curvature. The only difference between the curves is the q intercept (namely 
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B), and also the location along the underlying parabola where the motion starts. 
This is shown in Fig. 15.17. 


What is the slope of a curve at a given point (q,p)? From Eq. (15.90) we have 
dq/dp = p/m*g, so the slope of the curve in phase space is dp/dq = m7g/p. This 
has the properties of being infinite when p = 0 and being independent of q, both of 
which are evident in the above figures. This slope can also be obtained by dividing 
the two Hamilton’s equations from above: 


dp _ dp/dt p _ mg _ mg 
dq dq/dt q p/m p` 


(15.91) 


Example 3 (Harmonic oscillator): The Hamiltonian for a harmonic oscillator is 
H = p?’ /2m + kq?/2. You can quickly show that Hamilton’s equations are ġ = p/m 
and p = —kq. To solve for q in terms of p, you could solve for q in terms of t 
(the result would be a trig function) and then differentiate to obtain p (another trig 
function) and then eliminate t. You would find that p? /2m+kq?°/2 equals a constant 
that depends on the initial conditions (namely, pô /2m + kqĝ/2). Or you could do 
it the easy way and just note that H is constant (because it has no explicit time 
dependence), which immediately gives the same result. Either way, we have 


pkg _ po, ko 
2m 2 m 2° 


(15.92) 


This is the equation for an ellipse in the q-p plane. The (q, p) coordinates of the 
particle keep going around and around a particular ellipse as time goes by, as shown 
in Fig. 15.18. 

From Eq. (15.92), the q intercept (that is, where p = 0) of a given ellipse is located 
at q? = po/km + qê. And the p intercept (that is, where q = 0) is located at 
p? = pê + kmq. The curves are traced out leftward below the q axis (where p 
is negative), and rightward above the q axis (where p is positive), as expected. 
From Hamilton’s equations, the slope of a curve at a given point is dp/dq = p/q = 
—kq/(p/m) = —kmq/p. This is infinite when p = 0 and zero when q = 0, in 
agreement with Fig. 15.18. 

Although it isn’t obvious from the figure, each ellipse is traced out in the same 
amount of time (as we know from Chapter 4). But this is at least believable, because 
the larger ellipses have points with larger values of |p|, so the curves are traversed 
more quickly there, allowing (apparently) the larger total distance to be covered in 
the same amount of time. 


In the above examples, none of the paths cross each other. This is a general 
result; paths in phase space never cross. The proof is simple: if two paths did 
cross, then there would be two different velocity vectors (q,p) at a given point. 
However, Hamilton’s equations uniquely determine the velocity vector (q,p) = 
(OH /Op, —OH/0q) at a given point (q,p). So there must in fact be only one curve 
through any given point, and hence no crossing. Another fact along these lines is 
that a path in phase space can never branch into two paths, even if the velocities 
of the two branches are equal at the branching point. The demonstration of this is 
the task of Problem 15.17. 

Note that there is no mention of time in the above phase-space plots. So as we 
mentioned, you can’t just look at a curve and tell how fast the particle is traversing 
it. But you can easily determine this by using Hamilton’s equations (assuming 
you’ve been given the Hamiltonian) to write down the particle’s velocity vector in 


Figure 15.17 


(4o,Po) 


Figure 15.18 
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Figure 15.19 
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phase space: 
Son: OH OH 
(å, p) = (= =). (15.93) 


Note also that there is no mention of absolute time. If the Lagrangian (and hence 
Hamiltonian) doesn’t depend on time, so that the partial derivatives in Eq. (15.93) 
depend only on q and p, then if a particle has initial coordinates (qo, po) at, say, 
3:14 pm, it will trace out exactly the same path in phase space as another particle 
with the same initial coordinates that started earlier at, say, 1:59 pm. 

In the case of N dimensions, there are N coordinates q; and N momenta p;, so 
phase space has 2N dimensions. This is clearly a bit difficult to draw for N > 1, 
but the scenario is the same as above: a particle starts at a given location in phase 
space, and then the 2N Hamilton’s equations dictate the ensuing path. 

If you wanted, you could make a plot of q vs. q instead of p vs. q, but it wouldn’t 
be very useful. The advantage of q-p phase space is that the motion is governed by 
Hamilton’s equations which are (nearly) symmetric in q and p, whereas there is no 
such symmetry with q and q. This symmetry leads to many useful results, the most 
notable of which is Liouville’s theorem... 


15.5.2 Liouville’s theorem 


Loosely stated, Liouville’s theorem states that a given initial region in phase space 
keeps the same area (even though the shape invariably changes) as it moves though 
phase space due to the (q,p) = (OH/Op, -OH/0q) velocity vector of each point in 
the region. Before giving a proof of the theorem, let’s look at two examples to get 
a feel for it. 


Example (Constant velocity): Consider the setup in the first example (“Con- 
stant velocity” ) from Section 15.5.1, and consider the rectangle in phase space shown 
in Fig. 15.19. This rectangle (including its interior) may be thought of as represent- 
ing the (q, p) coordinates of a very large number of particles (so that we may view 
them as being essentially continuously distributed in phase space) at a given time.’ 
What shape does this rectangle get carried into at a later time? What is the area 
of this shape? 


Solution: Let the initial rectangle span a height from pı to p2 and a width from 
qi to q2. As time goes by, the points on the top side of the rectangle move faster to 
the right than the points on the bottom side, due to the different p (which equals 
mq) values. Therefore, at a later time we have a parallelogram as shown. For the 
sake of drawing the figure we have arbitrarily assumed that p2 = 2p1, which means 
that points on the top side move twice as far to the right as points on the bottom 
side. Note that the left and right sides do indeed remain straight lines, because the 
speeds with which points move to the right in phase space grow linearly with p. So 
the desired shape is a parallelogram, and it becomes more and more sheared as time 
goes by. 

What is the area of this parallelogram? The top and bottom sides always have the 
same lengths as the top and bottom sides of the initial rectangle (namely qi — q2), 
because points with the same value of p move with the same speed (namely p/m) to 
the right, so the relative distances don’t change. And the height of the parallelogram 


8In particular, the corners of the rectangle represent four objects at a given instant. Two start 
with momentum pj at positions qi and q2, and two start with momentum p2 at positions qı and 
q2. 
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is still p2 — pı, because none of the speeds change which means there is no vertical 
motion in phase space. Therefore, the area is (q2 — q1)(p2 — pi) which is the same 
as the area of the original rectangle. 


This “conservation of area” result actually holds for any arbitrarily-shaped initial 
region in phase space, not just a rectangle. This follows from the fact that we can 
break up the initial region into a large number of infinitesimal rectangles, and we 
can then use the reasoning in the previous paragraph for each little rectangle. So 
we see that the area in phase space is conserved. That is, if we look at any region 
in phase space and then look at the area that this region gets mapped into at any 
later time, the final area will equal the initial area. 


For all we know, this “conservation of phase space area” rule might hold only for 
the preceding specific scenario involving constant velocity. So let’s look at another 
example and see what happens to the area of an evolving phase-space region. 


Example (Falling balls): Consider the setup in the second example (“Falling 
balls”) from Section 15.5.1, and consider the rectangle in phase space shown in 
Fig. 15.20. What shape does this rectangle get carried into at a later time? What 
is the area of this shape? 


Solution: It turns out that we again end up with a parallelogram, and it again has 
the same area as the initial rectangle. The parallelogram shape follows from two 
facts. First, the top and bottom sides always have equal lengths (namely q2 — q1), 
because points with the same value of p move with the same speed (namely p/m) 
to the right in phase space.’° And second, the left and right sides remain straight 
lines, because the speeds with which points move to the right in phase space grow 
linearly with p, and also because all points move vertically with the same speed, 
namely p = mg from the second of Hamilton’s equations. (See Problem 15.16 for 
the general condition under which straight lines in phase space remain straight.) 
What is the area of the parallelogram? Since all points move vertically in phase 
space with the same speed (namely p = mg), the height of the parallelogram always 
remains p2 — pi. And since the “base” sides always have length q2 — qi, the area is 
always (q2 — q1)(p2 — pi), which is the same as the area of the original rectangle. 
Again, as in the preceding example, this result holds for any arbitrarily-shaped initial 
region in phase space, because we can break up this region into a large number of 
infinitesimal rectangles for which we know the result holds. 


Problem 15.19 deals with the third example (“Harmonic oscillator” ) from Section 
15.5.1, and the result is the same: the area of a given region in phase space doesn’t 
change as it moves though phase space.'! This “conservation of area” result is a 
general one; it holds for any system, not just for the simple setups above. It is 
known as Liouville’s theorem, and the proof is as follows. 


9Physically, the corners of the rectangle represent four balls at a given instant. Two are thrown 
downward with momentum pı from positions qi and q2, and two are thrown downward with 
momentum pz from positions qı and q2. 

10Tf you want to picture balls falling, the two balls associated with, say, the two bottom corners 
of the rectangle starts out a distance q2 — qi apart with the same speed. Therefore, since the 
acceleration is independent of position, they always have the same speed and hence always remain 
a distance q2 — qi apart. 

11The parallelogram result is actually the same too, but this doesn’t hold in general for other 
setups; see Problem 15.16. 


Figure 15.20 
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Theorem 15.2 (Liouville’s Theorem) Given a system of N coordinates qi, the 
2N-dimensional “volume” enclosed by a given (2N — 1)-dimensional “surface” in 
phase space is conserved (that is, independent of time) as the surface moves through 
phase space. 


Proof: Let’s restrict ourselves to the N = 1 case for simplicity. After proving the 
theorem for this case, it is fairly straightforward to generalize to higher dimensions 
(the task of Problem 15.18). For one-dimensional motion, phase space is two- 
dimensional (one q axis and one p axis), so the “volume” in the statement of the 
theorem is simply an area, and the “surface” is a closed curve that bounds this area. 
Our goal is therefore to show that if we look at the area bounded by a given closed 
curve in the q-p plane at a given time, and if we consider where this curve ends up 
at an arbitrary later time and look at the area it bounds, then these two areas are 
equal. 

We’ll demonstrate this equality by considering an arbitrary curve and its image 
an infinitesimal time dt later. If we can show that these two curves enclose the 
same area,!? then we can march forward through time in small steps, with the 
area remaining the same after each step. It then follows that the final region at an 
arbitrary later time has the same area as the initial region. 

The velocity vector of a given point (corresponding to a given object — a ball, 
atom, or whatever) as it moves through phase space is by definition v = (å, p). 
So the displacement vector in the small time dt is (dq,dp) = vdt = (q,p) dt. 
Fig. 15.21 shows how these displacement vectors take the points on the initial curve 
Co and carry them into the points on the new curve Cq;.'° Our goal is to show that 
the area of Cg, equals the area of Co. 

How might the area change as the curve C moves (which in general involves 
translation, rotation, and distortion) through pase space? It increases due to the 
area it picks up in the right part of the figure (at least in the scenario we’ve shown), 
and it decreases due to the area it loses in the left part. To get a handle on the 
increase in the right part, consider the motion of two nearby points, A and B, in 
Fig. 15.21. The corresponding increase in the area bounded by C is the tiny area 
of the shaded region shown. If A and B are sufficiently close together and if dt 
is sufficiently small, then this shaded region is essentially a parallelogram (because 
the two displacement vectors are essentially equal). 

A close-up view of the shaded region is shown in Fig. 15.22. Its area equals 
dé times the height dh, where dh is the component of the displacement vector, 
v dt = (q,p) dt, that is perpendicular to the curve. Using Eq. (B.2) from Appendix 
B, we can take advantage of the dot product to write dh as dh = n- (v dt), where n 
is defined to be the (outward) unit vector perpendicular to the curve. (The cos @ in 
the dot product picks out the component perpendicular to the curve.) The area of 
the shaded region is therefore dé dh = d£ (n -v dt). Note that this automatically gets 
the sign of the area correct. For example, in the left part of Fig. 15.21, v dt points 
inward and n points outward (by definition), so the area of a tiny parallelogram 
there is negative. That is, it signifies a decrease in the area bounded by the curve, 
as it should. 


12Technically, we need to show only that the areas are the same to first order in dt here, but 
the end result will be that they are equal to all orders. 

13 An infinitesimal dt wouldn’t yield the fairly large difference between Co and Cat that is shown 
in the figure; we’ve exaggerated the difference so that the displacement vectors are discernable. 
Also, note that since the velocity vector (ġ, p) is in general a function of the position (q, p), the 
displacement vectors’ lengths and directions can vary depending on the location on the curve, as 
shown. 
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The total change in area, dA, in going from Co to Ca is the sum of the little 
changes arising from all the little parallelograms. In other words, it is the integral 
of the parallelogram areas over the whole curve, which is 


dA 

dA= | (n-vat) at => <= [vs (nde) (15.94) 
C dt Jo 

But the right-hand side of this equation is exactly the type of integral that appears 

in the divergence theorem (or “Gauss’ theorem”). In the case of a 2-D surface 

bounding a 3-D volume, this theorem is stated in Eq. (B.19) as 


[v-pav= f F-aa, (15.95) 
y S 


but it holds generally for any (n — 1)-D “surface” bounding an n-D “volume.” 
In (reasonably) plain English, the divergence theorem says that the integral of 
the divergence of a vector field over a “volume” equals the flux of the vector field 
through the bounding “surface” (which is obtained by integrating the component of 
the vector perpendicular to the surface). The proof in the general n-D case proceeds 
by exactly the same reasoning we used in the 3-D “proof” in Section B.5.14 

The right-hand side of Eq. (15.94) is the flux of the vector v through the “sur- 
face” (which is just the 1-D curve C here), with ndé being analogous to the dA 
in Eq. (15.95).1° So in the 2-D case of a curve bounding an area, the divergence 
theorem says 


[v-vea= f vaan: (15.96) 


This actually holds for any vector field v, but we’re concerned only with the velocity 
vector v here. Combining Eq. (15.94) with Eq. (15.96) gives 


dA 
dt JA 


Og 4 
—+— J] dA. 15.97 
ib C Op ve 
This is valid for any general velocity field v = (q,p). But we will now invoke the 


fact that we are actually dealing with the motion of points in phase space, which 
means that the motion is governed by Hamilton’s equations, 


ee eggs eee (15.98) 
Op 


MBasically (in the language of the 3-D case), the divergence measures the flux through an 
infinitesimal cube, so if we break up the volume V into many little cubes, and if we add up (that 
is, integrate) the fluxes through all the little cubes throughout V (which equals the left-hand side 
of Eq. (15.95)), then the fluxes cancel in pairs in the interior of V (because whatever flows out of 
one cube flows into another), so the only fluxes that don’t get canceled are the ones through the 
faces of the cubes that lie on the boundary surface C. We therefore end up with the flux through 
C (which equals the right-hand side of Eq. (15.95)). In the present 2-D case, analogous reasoning 
holds if we break up the area bounded by C into many little rectangles. 

15 Recall that in the 3-D case, dA is defined to be the vector perpendicular to the surface, with its 
magnitude equal to the tiny area element. In the present 2-D case, n dé is the vector perpendicular 
to the curve, with its magnitude equal to the tiny arclength element. 
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Plugging these into Eq. (15.97) gives 


aw ~ L,(a5 (a) +35 (ay) 
7 [aa aan) 4 


= 0, (15.99) 


where we have used the fact that partial derivatives commute. W 


REMARKS: 


1. In the case where there are N coordinates qi, so that phase space has 2N dimensions, 
the proof of Liouville’s theorem is basically the same as above. The only difference 
is that now there are N pairs of the cancelations in Eq. (15.99), as you will see in 
Problem 15.18. 


2. As mentioned earlier, the (near) symmetry in Hamilton’s equations is what allows 
Liouville’s theorem to hold. It is this symmetry that leads to the two terms in Eq. 
(15.99) looking very similar. And it is the “nearness” of the symmetry (with the 
one minus sign) that leads to the two terms actually canceling. 


3. Liouville’s theorem is sometimes stated tersely as: the density in phase space remains 
constant. But you should be careful not to interpret this as the density at a given 
fixed point in phase space remaining constant. This is certainly not always true. For 
example, in the “Constant velocity” example above, the density at a given point 
inside the initial rectangle is nonzero. But after the parallelogram flows to the right, 
the density at the given point becomes zero. 


The correct interpretation is that if you follow a point as it moves through phase 
space, then the density in the near vicinity of this point remains constant. That is, if 
you look at the density of particles (balls, atoms, or whatever) inside a small region 
around the point, then this density remains constant as the point and surrounding 
region move through phase space. This is true because the area of the region remains 
constant due to Liouville’s theorem, and also because the number of particles remains 
constant (they don’t magically appear or disappear). & 
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15.6 Problems 


Section 15.1: Energy 


15.1. Time dependence * 


Consider a Cartesian coordinate x and a Lagrangian L = m&?/2 — V(x). 
Show that if another coordinate q depends on both x and t, or equivalently 
if x depends on both q and t (that is, x = x(q,t)), then L(q,q,t) yields an 
E = (OL/0q)q — L that takes the form, 


B=T+V—m (5) (3) ġ+ (57) . (15.100) 


15.2. Conservation of E x 


Consider the Lagrangian, L = mg? /2 — V (x). Show that the statement that 
the E in Eq. (15.1) is conserved is essentially the same statement as the Euler- 
Lagrange equation. 


Section 15.2: Hamilton’s equations 
15.3. Poisson brackets 
Consider a function f(q, p) of the coordinates q and p. Use Hamilton’s equa- 
tions to show that the time derivative of f can be written as 
df OfoH Of0H 
dt ðq Op Op ðq ` 


(15.101) 


Note: this combination of partial derivatives comes up often enough to warrant 
a name, so the Poisson bracket of two functions, fı and f2, is defined to be 


_ Of, Of2 ofif 


(15.102) 


With this definition, the time derivative of f takes the nice compact form, 
df /dt = { f, H}. 
15.4. Hamilton’s equations for many variables ** 


Show that for a system with N degrees of freedom, the 2N Hamilton’s equa- 
tions are 


oH OH 
=p and Pi=-Z-, for LSiSN. (15.103) 


15.5. Equivalent Lagrangians ** 


Given a Lagrangian L(q,q,t) that describes a certain system, is it possible 
to find another Lagrangian that describes the same system? That is, can we 
construct another L’(q,q,t) that yields the same equation of motion? The 
answer is certainly “yes,” in the trivial sense that we can add on a constant to 
L, which of course doesn’t change the equation of motion (which involves only 
derivatives of L). But are there any nontrivial changes to L we can make? 
The answer is again “yes.” Show that if we construct a new Lagrangian L’ 
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15.6. 


15.7. 


15.8. 


15.9. 


15.10. 


15.11. 


by adding to L the time derivative of an arbitrary function F'(q,t) depending 
on q and t (but not 4), 


dF (q,t 
L'(q,4,t) = L(q,4,t) + d (15.104) 


then L’ has the same equation of motion.!® 


Atwood’s 1 x 


Consider the Atwood’s machine shown in Fig. 15.23. Let x be the vertical posi- 
tion of the left mass, with upward taken to be positive. Find the Hamiltonian 
in terms of x and its conjugate momentum, and then write down Hamilton’s 
equations. 


Atwood’s 2 *x 


Consider the Atwood’s machine shown in Fig. 15.24 Let x and y be the vertical 
positions of the middle mass and right mass, respectively, with upward taken 
to be positive. Find the Hamiltonian in terms of x and y and their conjugate 
momenta, and then write down the four Hamilton’s equations. 


Two masses and a spring ** 


Two beads of mass m are connected by a spring (with spring constant k and 
relaxed length £) and are free to move along a frictionless horizontal wire. Let 
the position of the left bead be x, and let the stretch of the spring (relative 
to equilibrium) be z. Find the Hamiltonian in terms of x and z and their 
conjugate momenta, and then write down the four Hamilton’s equations. (See 
Exercise 15.28 for a slightly easier variation of this problem.) 


y = f(x) constraint ** 


A particle of mass m is constrained to move on a curve whose height is given by 
the function y = f(x). Find the Hamiltonian in terms of x and its conjugate 
momentum, and then write down Hamilton’s equations. Show that the p = 
—0OH/Ox equation reproduces the E-L equation. 


Spring and moving wall «x 


A mass m is connected to a wall by a horizontal spring with spring constant 
k and relaxed length o. The wall is arranged to move back and forth with 
position Xwa = Asinwt. Let z measure the stretch of the spring. Find the 
Hamiltonian in terms of z and its conjugate momentum, and then write down 
Hamilton’s equations. Is H the energy? Is H conserved? (See Exercise 15.26 
for a slightly easier variation of this problem.) 


Bead on a rotating hoop ** 


A bead is free to slide along a frictionless hoop of radius R. The hoop rotates 
with constant angular speed w around a vertical diameter (see Fig. 15.25). Find 
the Hamiltonian in terms of the angle 0 shown and its conjugate momentum, 
and then write down Hamilton’s equations. Is H the energy? Is H conserved? 


Section 15.3: Legendre transforms 


16This problem probably belongs in Chapter 6, since it deals only with Lagrangians. However, 
the existence of equivalent Lagrangians of the form in Eq. (15.104) is the basis of the so-called 
canonical transformations which lead to the Hamilton-Jacobi extension of Hamiltonian mechanics. 
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15.12. F(x) =a" * 
Find the Legendre transform of F(x) = x”, and verify that dG(s)/ds = a. 


15.13. Alternate definition of G(s) * 
Given a concave function F(x), an alternate definition of the Legendre trans- 
form G(s) is that it equals the maximum value of the quantity sa — F(z), 


where s is assumed given and z is allowed to vary.!” Explain why this agrees 
with the y-intercept definition of G(s) given in Eq. (15.53). 


15.14. Equal G(s) values ** 


Let G(s) be the Legendre transform of F(x). Show that if two different values 
of s yield the same value of G(s), and if additionally the two associated values 
of x have the same sign, then F(x) must have an inflection point (a point 
where the second derivative is zero). 


15.15. Double-valued G(s) * 


Let G(s) be the Legendre transform of F(x). Show that if G(s) is double 
valued (that is, if a given value of s yields two different values of G(s)), then 
F(x) must have an inflection point (a point where the second derivative is 
zero). 


Section 15.5: Phase space, Liouville’s theorem 
15.16. Straight lines ** 
Under what conditions do straight lines in phase space always remain straight 
lines as time evolves? 
15.17. No branching ** 


Show that a path in phase space can never branch into two paths, even if the 
velocities of the two branches are equal at the branching point, as shown in 
Fig. 15.26. 


15.18. Liouville for higher N * 
Prove Liouville’s theorem for the general case of N coordinates q;, with phase 
space being 2.N-dimensional. 

15.19. Harmonic oscillator ** 


Consider a harmonic oscillator governed by the Hamiltonian H = p?/2m + 
kx? /2. If a; and p; are the initial position and momentum, then from the 
results of Chapter 4, we know that x and p at a general later time are given 
by 


(x, p) = («i cos wt + om sinwt, —mw2j;sinwt + pi cos wt) ; (15.105) 
Mw 


Consider the rectangle shown in Fig. 15.27, with one corner at the point (£o, po) 
and sides of lengths Az and Ap. Show that at any later time, the image of 
the rectangle is a parallelogram with area Az Ap. 


17Note that no mention of the slope is made in this definition; it is not assumed that s represents 
the slope for the coordinate x (which is allowed to vary, whereas s is fixed). 


Figure 15.26 
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15.20. 


15.21. 


Harmonic oscillator, easier method * 


This problem demonstrates the result of the previous problem in a simpler 
way. To make Eq. (15.105) more symmetrical, consider the variable z = mwa. 
The coordinates (z,p) as functions of time are then 


(z,p) = (zi coswt + pj sinwt, —z sin wt + pi cos wt) . (15.106) 


Describe the motion of a given initial region in the z-p plane, and then use the 
result to show that the area of a given initial region in the x-p plane doesn’t 
change. 


Liuoville’s theorem **x* 


This problem gives another proof of Liouville’s theorem. Consider an infinites- 
imal rectangle in the z-p plane with side lengths Ax and Ap. In an infinites- 
imal time dt, this rectangle gets mapped into another region, as shown in 
Fig. 15.28. Show that this new region is a parallelogram and that its area is 
(to leading order in Ax, Ap, and dt) 


A= AzAp({1+ eae dt), (15.107) 
ðx Op 


where (t, p) is the velocity vector field. 


This is actually a general mathematical result, independent of how « and ù 
depend on position. That is, it need not have anything to do with physics and 
phase space. But if we now restrict ourselves to motion in phase space, which 
means that it is governed by a Hamiltonian H (x, p), then Hamilton’s equations 
quickly tell us that the two partial derivatives cancel (because « = OH /Op and 
p = —OH/0zx, and because partial derivatives commute), so we end up with 
A = AzxAp. That is, the area remains the same, in agreement with Liouville’s 
theorem. 
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15.7 Exercises 


Section 15.2: Hamilton’s equations 


15.22. 


15.23. 


15.24. 


15.25. 


15.26. 


15.27. 


15.28. 


General potential in 1-D 


Given the Hamiltonian, H = p?/2m + V(x), show that Hamilton’s equations 
are equivalent to F = ma. 


Atwood’s 1 x 


Consider the Atwood’s machine shown in Fig. 15.29. Let x be the vertical posi- 
tion of the right mass, with upward taken to be positive. Find the Hamiltonian 
in terms of x and its conjugate momentum, and then write down Hamilton’s 
equations. 


Atwood’s 2 *x 


Consider the Atwood’s machine shown in Fig. 15.30. Let x and y be the 
vertical positions of the left and right masses, respectively, with upward taken 
to be positive. Find the Hamiltonian in terms of x and y and their conjugate 
momenta, and then write down the four Hamilton’s equations. 


r = k0 spiral ** 


A particle of mass m is constrained to move on a spiral described by r = kð. 
The spiral lies in a horizontal plane. Find the Hamiltonian in terms of 0 and 
its conjugate momentum, and then write down Hamilton’s equations. Show 
that the p = —OH/00 equation reproduces the E-L equation. 


Spring and moving wall x 


A mass m is connected to a wall by a horizontal spring with spring constant 
k and relaxed length o. The wall is arranged to move back and forth with 
position Xwan = Asinwt. Let x be the position of the mass relative to the 
location of the wall at t = 0. Find the Hamiltonian in terms of x and its 
conjugate momentum, and then write down Hamilton’s equations. Is H the 
energy? Is H conserved? (See Problem 15.10 for a slightly harder variation 
of this exercise.) 


Spring and rolling wheel * 


The axle of a wheel of mass m and radius r is connected to a horizontal 
spring with spring constant k, and the spring is connected to a wall, as shown 
in Fig. 15.31. All of the mass of the wheel is on the rim (so its moment of 
inertia is J = mr”), and it rolls without slipping. Let @ be the angle relative 
to equilibrium through which the cylinder has rolled. Find the Hamiltonian 
in terms of 0 and its conjugate momentum, and then write down Hamilton’s 
equations. 


Two masses and a spring * 


Two beads of mass m are connected by a spring (with spring constant k and 
relaxed length /) and are free to move along a frictionless horizontal wire. 
Let their positions are zı and xg. Find the Hamiltonian in terms of x; and 
x2 and their conjugate momenta, and then write down the four Hamilton’s 
equations. (See Problem 15.8 for a slightly harder variation of this exercise.) 


mı 


my 


Figure 15.29 


m 2m 


4m 


Figure 15.30 


Figure 15.31 
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15.29. Atwood with a spring * 


A spring with spring constant k and relaxed length zero is inserted in a stan- 
dard Atwood’s machine to create the setup shown in Fig. 15.32. Let x1 and 
mi x2 be the positions of the two masses (with downward taken to be positive) 
relative to an arbitrary configuration where the spring has no stretch. Find 
the Hamiltonian in terms of xı and x2 and their conjugate momenta, and 
then write down the four Hamilton’s equations. (Assume that the spring is 
always stretched a positive amount.) 


Figure 15.32 


15.30. Bead on a uniformly moving rod * 


A bead is free to move along a frictionless horizontal rod which initially lies 
along the y axis in the horizontal plane. The rod is arranged to move with 
constant speed v in the x direction, and the bead is connected to the origin by 
a spring with spring constant k and relaxed length zero. Find the Hamiltonian 
in terms of y and its conjugate momentum, and then write down Hamilton’s 
equations. Is H the energy? Is H conserved? 


Section 15.5: Phase space, Liouville’s theorem 
15.31. Harmonic oscillator, quarter cycle * 


Consider a harmonic oscillator governed by the Hamiltonian H = p?/2m + 
kx? /2. If a; and p; are the initial position and momentum, then from the 

p results of Chapter 4, we know that x and p at a general later time are given 
by 


wo 
(a, p) = («i cos wt + Pi sin wt, —mw2x;sinwt + pi cos wt) : (15.108) 
Ax mw 


(x0.Po) 
Consider the rectangle shown in Fig. 15.33, with one corner at the point (29, po) 


and sides of lengths Az and Ap. After a time t = 7/2w, what is the image of 


Figure 15.33 this rectangle? Show that the area is still AxAp. 
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15.8 Solutions 


15.1. 


15.2. 


15.3. 


15.4. 


Time dependence 


Since x(q,t) depends on both q and t, we have « = (0x/0q)q + Ox/Ot. So the 
Lagrangian L = m&z?/2 — V (x) becomes 


a (7 Pe (=) (F) a ($) V(«(q.t)). (15.109) 


Therefore, 


2 
_ OL . = Ox Lh Ox Ox 
a ee m (32) a em ($2) (3) 4 ‘ 
2 
_ m(ðr\ z m (dry? | 
2 a) erg (S) +V(x(q,t)). (15.110) 


In view of the expression for the kinetic energy (the term in parentheses) in Eq. 
(15.109), we can write E as 


E=T+V n( (2) (F)a+(F)). (15.111) 


Conservation of E 
Equation (15.1) gives E = (OL/0«)¢@ — L = mt? /2 + V (x), and so 


dE w. (NNa ., a 
eT mee (Z) is (mi H =): (15.112) 


as desired. 


Ignoring the trivial case where & is identically zero,!8 conservation of E is equivalent 


to the statement that më + dV/dx = 0. But the E-L equation is (d/dt)(OL/0z) = 
OL/dxz => më = —dV/dz, so the two statements are indeed equivalent (and also 
equivalent to the F = ma statement). 


Poisson brackets 
Using Eq. (15.33) and the chain rule, we have 


df _— Of. Of. 
dt apo aq? 
_ OfOH Of OH 
by Oe. Be OY (15.113) 


as desired. 


Hamilton’s equations for many variables 
Let’s look at the 0H/Op; derivative first. Using H = (So pe år) — L, we have 


OH(q,p) Əl prila p)) ƏL(a dla p)) 
a ay pe (15.114) 


The arguments (q, p) above are shorthand for (q@,...,qN,p1,---,pN). And likewise 
for the q in the last term. We’ll ignore any possible t dependence, since it wouldn’t 
affect the discussion. 


18This clearly makes for a constant E, although it has nothing to do with the actual physical 
motion, except in the case of constant V(x) and an initial velocity of zero. Hence the word 
“essentially” in the statement of the problem. 
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In the first term on the right-hand side of Eq. (15.114), XO pe x(q, p) depends on 
pi partly because of the factor of p, when k = i, and partly because all the dz 
(possibly) depend on p;. So we have a(o pe Gx(q,P)) /Opi = Gi + >> pe (Ogn/Opi). 
In the second term, L(q, å(a, p)) depends on p; because all the q’s (possibly) depend 
on p;i, so we have 


aL(q -X OL(q, 4 = (q, p) (15.115) 


åk Pi 


But pr = OL(q, ¢)/Ədk, so if we substitute these results into Eq. (15.114), we obtain 
(dropping the (q, p) arguments) 


= (a4 Dott) -Dnit 


= hi, (15.116) 


as desired. 
Let’s now calculate OH /0q:. Using H = (ope år) — L, we have 


dH(q,p) AlE prårlap)) ƏL(q,åla, P)) 
Oa aqi aqi 


(15.117) 


In the first term on the right-hand side, X pr dk(q, p) depends on q; because all 
the q’s (possibly) depend on qi, so we have a(S pe år (q, p)) /3qi = Yo prôår/ðqi. 
In the second term, L(q, lq, p)) depends on q; partly because of the (possible) q; 
dependence in the first argument, and partly because all the q’s (possibly) depend 
on qi. So we have 


dL(4,4(4,P)) _ AL(q, å) >> OL (4,4) åk (q, p) 
aqi aqi 3k OG 


(15.118) 


As above, we can use pr = OL(q, q)/Odx in the second term here. But also, for the 
first term, the Euler-Lagrange equation for the ith coordinate (which holds, because 
we’re looking at the actual classical motion of the particle) tells us that 


d ( OL(4,4)\ _ Lla å) . _ OL(q,4) 
+ ( T Ba Di ane (15.119) 


If we substitute these results into Eq. (15.117), we obtain (dropping the (q, p) argu- 


ments) 
OH OG: |. dk 
E (n+ Dade) 


S he (15.120) 


as desired. Putting everything together, we have the 2N Hamilton’s equations: 


Ss r a, 2 OH 
qi = Op; , Pi = Ogi , 


for 1<i<N. (15.121) 


Equivalent Lagrangians 
FIRST SOLUTION: With L’ = L+dF/dt, our goal is to show that the E-L equation, 


a (F ) ae (15.122) 


Og ðq ’ 
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is exactly the same equation as the original E-L equation, 
d (ƏL OL 
—|—)=—. 15.123 
dt ( Og ) ôq ( ) 
We can demonstrate this by simply plugging L’ = L+dF/dt into Eq. (15.122). First 


note that the chain rule gives 


dF(q,t) _ OF(q,t) we OF (q,t) 
dt ôq g ðt ` 


(15.124) 


(The possible (OF /0q)g term is missing due to the assumption of no ġ dependence.) 
So we have (0/0q)(dF'/dt) = OF /Oq. Using this in Eq. (15.122) gives 


d (22 +dF/dt) ) _ Ə(L+dF/dt) 


dt oå Oq 
d (ðL OF 0 dF 
$ & ' E) a (z ! x): (15.125) 


This equation is the same as Eq. (15.123) if the terms involving F' cancel, that is, if 


d ( OF(q,t) Ə ( dF(qt) 
= . 15.12 
dt ( ôq ôq dt (igalag) 
Using the chain rule on the left side and substituting Eq. (15.124) into the paren- 
theses on the right, this is equivalent to 


PF. F F., F 
ag 1 toqg Og lT dqdt’ 


(15.127) 


which is indeed true, due to the commutativity of partial derivatives. 


SECOND SOLUTION: A general function F(q,t) is the sum of terms of the form 
f(q,t) = Aq”t”. So it will suffice to show explicitly that the function f(q,t) = 
Aq"t™ yields no effect on the equation of motion. This is straightforward but 
tedious. First, we have d(Aq"t’)/dt = Ang”~'qt™ + Aq’mt™—'. The new E-L 
equation, 


d ({ O0(L+df/dt)\ | 0(L + df/dt) 
T ( RE ) Da ; (15.128) 


is the same as the old E-L equation, (d/dt)(OL/0q) = ƏL/ðq, if the terms involving 
f cancel, that is, if 


d ( (df /dt) a(df /dt) 
= 15.12 
dt ( ôq ôq (19:120) 
d (Ə(Ang tit” + Aq mt™ t) = lAng åt” + Aq’mt™—") 
dt Og ôq 
nm—lym 
aang” ik ee An(n yg" 200" + Ang” 'mt™ 1, 


which is indeed true, as you can quickly verify. 


THIRD SOLUTION: The above two mathematical solutions probably weren’t too 
enlightening, so we’ll present a solution here that actually gets to the underlying 
physics of why L’ yields the same E-L equation as L. The E-L equation is derived 
from the principle of stationary action, so it might behoove us to back up and start 
our reasoning there. 
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15.6. 


15.7. 


The critical point to realize is that the addition of dF(q,t)/dt doesn’t affect the 
stationary property of the action, because the action for the new Lagrangian is 


a L' dt = T (L+dF/dt) dt = m Ldt+ (F (alts), t2)-F (at1)+t1)); (15.130) 


tı ty ty 


and the last terms here involving F don’t change when we vary the path q(t), because 
it is understood that the variation vanishes at the boundary points.'® So we have 
effectively added on a constant (assuming tı and tz are given), as far as the action 
is concerned. Therefore, since L and L’ differ by a constant, if a given path yields 
a stationary action for L, then it also yields a stationary action for L’. In other 
words, L and L’ describe the same system. And since the E-L equation is derived 
from the principle of stationary action, the path that satisfies the E-L equation for 
L also satisfies the E-L equation for L’. 


Atwood’s 1 
If the left mass moves up by x, then the right mass moves down by x, so the 
Lagrangian is 


1 1 
L= gmi? + gmi? — (mı — ma)a. (15.131) 


The momentum conjugate to x is p = OL/0x = (mı + m2)#, so the Hamiltonian is 


H=pi-L = =(mı + mə) + (mı — mə)x 


= + (mi — ma)a. (15.132) 


Hamilton’s equations are then 


ices é p 
Op (mı + m2) 
oH 
T p=—(m1 — m2)g. (15.133) 


The first of these simply reproduces the definition of p, while the second is identical 
to the Euler-Lagrange equation, 


d (OL OL dp OL f 
dt (sz) Ox dt Oz p (mi — m2)g. (15.134) 
Atwood’s 2 


If the right two masses move up by x and y, then the left mass moves down by 
(x + y)/2. This follows from conservation of string, because the average height 
of the right two masses always remains the same distance below the center of the 
right pulley, and the displacement of the left mass is equal and opposite to the 
displacement of the right pulley. The Lagrangian is therefore 


hu a Troarn i 2 (222) 
L = 5 (2m) (ASB) 4 smi? + 5(2m)p? + (2m)g (7) - mgs — (2m)gy 
od B 
go? + gmit + amy” — mgy. (15.135) 


The conjugate momenta are 


Pz = oe To 


ƏL 3mz my and 3 OL mi , dmy 
y = . 


a (15.136) 


19These terms depend on tı and t2, of course, but the point is that they don’t depend on the 
path, because q(t1) is assumed to be the same for all paths, and likewise for q(t2). 
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Inverting these to solve for t and y gives 
EN eer A Ge go ea) (15.137) 
t= Tm Px — Py an Y= Tm Px Py). . 


15.8. 


Using these expressions to eliminate t and y in favor of pz and py, you can verify 
that the Hamiltonian is 


H =psġ+pġ-L = Jm?’ 4 amy mg? + mgy 
ee ea RE 
Ip ŠP» — 2PzPy + 3p,) +mgy. (15.138) 
The four Hamilton’s equations are then 
OH 1 
; aoe l 
z Ope x Fm. P. Py) 
. OH , 
Pa Ox Pz =0, 
OH 1 
j j z + 3py), 
oe op, Y= 7 (Pe + Bpy) 
; OH : 
Py = ay Py = —mg (15.139) 


The first and third of these simply reproduce Eq. (15.137), while the second and 
fourth are identical to the Euler-Lagrange equations, 


dpr ƏL 


d ( OL ) i 
dt \ ðt Ox dt Ox Px , 

d (OL OL dpy ðL 

dt ( Oy ) Oy dt ay Py mg (15.140) 


REMARKS: The conservation of pz is a consequence of the fact that x is a cyclic coordinate, 
which in turn is a consequence of our specific choice of the masses. For randomly chosen 
masses, in general neither coordinate is cyclic. 

Using the pz in Eq. (15.136), ps = 0 implies that 7 = —3% at all times. That is, the 
downward acceleration of the right mass is three times the upward acceleration of the 
middle mass. If we plug this relation into the fourth of Hamilton’s equations, which says 
that (using the py in Eq. (15.136)) mz#/2 + 5mij/2 = —mg, we find that the accelerations 
of the right two masses are ë = g/7 and 4 = —3g/7. And then the acceleration of the left 
mass is the negative of the average of these, which equals g/7. If you wish, you can verify 
these accelerations by solving this problem with F = ma. 


If you want to demonstrate how the Hamiltonian method can be monumentally more cum- 
bersome than the Lagrangian method, you can try to solve this problem in the case of three 
general masses, m1, m2, m3. & 
Two masses and a spring 


The position of the right mass is x + z, so the Lagrangian is 


1 
L = sma” + sm(é 4 2)? she (15.141) 
The conjugate momenta are 
Dx = bL = Wma + mz and pz- = ðL =m(t+ż) (15.142) 
ðt O) 
Inverting these to solve for t and ż gives 
Oe. ath Aal edo (15.143) 
es Px — Pz = ae Px Pz : 
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Using these expressions to eliminate t and z in favor of pz and pz, you can verify 
that the Hamiltonian is 


H = prt +pz-ż- L = sma? Sm(è + 3)* + She? 
A 2), 1,2 

= Z — Prpz + pz ) + =k 15.144 

( 3 T Peps +P ) 5 kz ( ) 


The four Hamilton’s equations are then 


goer pad (pz — pz), 
pz 
ie = - 8 be = 0, 
z z = Px + 2pz), 
p- = a p- = —kz. (15.145) 


The first and third of these simply reproduce Eq. (15.143), while the second and 
fourth are identical to the Euler-Lagrange equations, 


d & aL dp, OL feats 

di\ ðr) Ox dt Oa ii 

d/ðL\ ƏL dp, ƏL , 

di ( az ) dz di dz Pee, (15.146) 


The pz equation is the statement that the total momentum of the system is con- 
served, because pr = mt+m(#+2) = mx1+mé4e2, where xı and x2 are the positions 
of the two masses. The pz = —kz equation is equivalent to (d/dt)(ma2) = —kz, 
which is simply the F = ma statement for the right mass. 


15.9. y = f(x) constraint 


The horizontal speed is %, and the vertical speed is times the slope, or x- f'(x). 
So the Lagrangian is (with f and f’ written as shorthand for f(x) and f’(x)) 


L=<m(1+ f?)#? — mgf. (15.147) 


The momentum conjugate to x is p = OL/0% = m(1 + f’”)&, so the Hamiltonian is 


H=pr-L = ima + FPE? + mgf 
p 
Hamilton’s equations are then 
ha k é £ 
dp m(1+ f?) ’ 
; ðH i PEI ; 
p Fa fA ml + f22 mgf'. (15.149) 


The first of these simply reproduces the definition of p. The second one can be 
rewritten as 


d ie. mOFE” , 
dt (m(1 +f *) a) ( aCe a mgf 
= “(a +P) = PrP- gf. (15.150) 


15.8. 


15.10. 


SOLUTIONS XV-45 


This is the same are the E-L equation, namely 
d (=) _ OL d 
dt \ Oz Ox dt 


(a +EP) = FFP- of’. (15.151) 


REMARK: Note that if we simplify Eq. (15.151), it becomes 
GESPEL EE = PP- gf 
(+ f?)a+ gf = 0. (15.152) 
This is just a disguised form of conservation of energy, because in view of the energy given 


by the first form of the Hamiltonian in Eq. (15.148), the conservation-of-energy statement 
is 


d 
Oy SS (Gna + f!?) a? 4 maf) 
0 = (14+ f?)ae+ (f f i)i? + oft 
0 = (+ f?)et+ fl fa? + gf’. (15.153) 


It is no surprise that the E-L equation (or Hamilton’s second equation) is equivalent to the 
conservation-of-energy statement, because the E-L equation was used in the derivation of 
energy conservation back in Chapter 6 (see Eq. (6.53)). Alternatively, we originally derived 
conservation of E by using F = ma back in Chapter 5 (see the beginning of Section 5.1), 
and we know that the E-L equation contains the same information as F = ma. & 


Spring and moving wall 
The position of the mass with respect to the location of the wall at t = 0 (which 


we'll arbitrarily take as our origin) is Zo + z + Xwan, because the definition of z 
implies that Zo + z is the position with respect to the wall. So the velocity of the 


mass is 2+ Xwan = 2+ Awcoswt. The Lagrangian is therefore 
L= imë + Aw cos wt)? — ake’. (15.154) 


The momentum conjugate to z is p = OL/0z = m(z+ Awcoswt), which can be 
inverted to give 2 = p/m — Aw coswt. So the Hamiltonian is 


2 
H =pż-L = p(2 Aw coswt) — Po lyp 
m 2m 2 
= Pg — p(Aw cos wt) + Lie (15.155) 
T m P?” gen 
Hamilton’s equations are then 
Zz on ¢= 2 — Awcos wt, 

Op m 
OH 

i= j = —kz. 15.1 

D Dz D Zz (15.156) 


The first of these simply reproduces the definition of p, while the second is identical 
to the Euler-Lagrange equation, 
d ( OL ) OL dp OL 
dt \ 02 Oz dt Oz 


p=—kz. (15.157) 


This is simply the F = ma statement, because the force on the mass is —kz (and 
p is indeed the actual linear momentum). If we use p = m(z + Awcoswt) to write 
everything in terms of z, the E-L equation becomes 


mz — mAw" sinwt = —kz mi + kz = mAw” sinwt, (15.158) 


which looks like the equation for a driven undamped oscillator. But note that z 
represents the stretch of the spring, and not the actual position of the mass. 
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The Hamiltonian in Eq. (15.155) is not the energy, because the relation between the 
stretch z and the Cartesian coordinate x is x = z+Xwan (plus a constant, depending 
on the choice of origin). And since Xwan = Asinwt, the relation between z and x 
involves t, thereby causing H to not be the energy, by Theorem 15.1. And H is not 
conserved, because there is explicit t dependence in L. 


REMARK: H isn’t conserved, and it isn’t the energy. So might the energy in fact be 
conserved? Let’s see. Since p is the actual linear momentum, the energy of the spring-plus- 
mass system is E = p?/2m + kz? /2, so we have 


dE d 2 kz? 
= (Z pe ) = P ý+ kzż. (15.159) 
m 


dt dt \ 2m 2 
Using the definition of p and the second of Hamilton’s equations to rewrite p and p, respec- 
tively, this becomes 
dE 
dt 


Since this is not identically equal to zero, the energy is not conserved. 


= (4+ Awcoswt)(—kz) + kzz = (—kz)(Aw cos wt). (15.160) 


The result in Eq. (15.160) makes sense, because it is the rate at which work is done (that 
is, the power P) on the end of the spring that is attached to the wall. This is true because 


p- dW _ pAXwan 
dt dt 


The force that the wall applies to the spring is F = —kz here, due to Newton’s third law 


= (—kz)(Aw cos wt) . (15.161) 


and the fact that the spring applies a force of +kz to the wall. & 


Bead on a rotating hoop 


The velocity components along the hoop and perpendicular to it (in the direction 
perpendicular to the plane of the page) are RO and (Rsin@)w. And the potential 
energy relative to the center is —mgR cos 0. So the Lagrangian is 


L= pm + R?w* sin? 0) + mgR cos 6. (15.162) 


The momentum conjugate to 0 is p= OL /a0 = mR?6, so the Hamiltonian is 


H=p0-L = 5m R60? — R’w sin? 0) — mgRcos 6 
p? 1 
= mR sR sin? 0 — mgRcos9. (15.163) 
Hamilton’s equations are then 
. OH . p 
0 0 
Op mR2’ 

‘ OH 22. r 
b=- p = mR w° sind cos 0 — mgRsin 0. (15.164) 


The first of these simply reproduces the definition of p, while the second is identical 
to the Euler-Lagrange equation, 


4 (5) a “ a p = mR?w’ sin 6 cos6—mgRsin@. (15.165) 
Using the definition of p, this can be written as mR?6 = mR?w*sin@cosé — 


mgRsin@. See Problem 6.11 for the interesting implications of this equation of 
motion. 

The Hamiltonian in Eq. (15.163) is not the energy, because the Cartesian coordinates 
(x,y) in the horizontal plane are related to 0 by (x,y) = Rsin @(coswt,sinwt), up 
to a phase. Since this relation involves t, the Hamiltonian is not the energy. But H 
is in fact conserved, because there is no t dependence in L. 
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REMARK: The Hamiltonian in Eq. (15.163) differs from the energy due to the minus sign 
in the second term. So this means that the energy equals H + mR?w? sin? 6. But as noted 
above, H is conserved. So the energy takes the form of a constant plus mR2w? sin? 0. That 
is, the energy is larger when mass is farther away from the axis of rotation. Where does 
this additional energy come from? It comes from the work that the hoop does on the bead, 
due to the normal force perpendicular to the plane of the hoop. We can be quantitative 
about this as follows. 

To find the normal force, note that the angular momentum around the vertical axis is given 
by m(Rsin0)?w. The time derivative of this (which equals the torque) is 


T = 2mR?w sin 0 cos 6 6 = (Rsin 0)(2mRw cos 06). (15.166) 


Since the “lever arm” is Rsin@, the normal force must be 2mRw cos 6 6. (You can also 
derive this by using the fact that the normal force must balance the Coriolis force in the 
rotating frame.) 


In a small time dt, the mass moves a distance (Rsin @)(w dt) horizontally, so the work done 
by the normal force during this time is (2mRw cos 0 0)(R sin Ow dt). The total work done by 
the time the mass reaches an angle 0 is therefore 


0 
J (2mRw cos 6(d0/dt)) (Rsin w dt) = f 2mR?w? sin 0 cos 0 dð = mR? w? sin? 0, 
0 
(15.167) 


which agrees with the additional energy we found above. & 
15.12. F(x) = zx” 
We have s = F’(x) = na", so x(s) = (s/n)'/"—). The Legendre transform of 


F(a) is then 
s \ 1/(n—1) s\ 1/(n-1) \ ” 
J H 
CLG ii aie 
= n {| — . — = a 
n n n 


S 


= (n-1) a . (15.168) 


n 


G(s) = sx(s) — F(a(s)) = 


w 
Pa 
| 


F(x) 
(xo, Fx0)) 
N 


Therefore, dG(s)/ds = (s/n)'/“"~)), which equals x, as expected. 
15.13. Alternate definition of G(s) 


For a given value of s, consider the line with slope s that passes through the point 
(ao, F(xo)), as shown in Fig. 15.34. From the reasoning in the first paragraph of 
Section 15.3.2, the negative of the y intercept of this line equals sap — F(x). (So 


slope=s 


Xo 


szo — F (xo) is the distance below the x axis, as shown.) Now imagine varying the sxo-F (xo) 

value of xo (while keeping s fixed) to obtain the series of lines shown in Fig. 15.35. 

(Note that s is not the slope of the lines at any of the intersection points in this Figure 15.34 
graph, except for the lowest line.) We see that the maximum value of sz — F(x) 

is achieved in the case of the lowest line which is tangent to the F(x) curve. (If F(x) 


we lower the line any farther, it won’t intersect with F(a).) Hence, the present 
definition of the Legendre transform (where G(s) is defined to be the maximum 
value of sx — F(x)) agrees with the definition in the text (where G(s) was defined 
to be the negative y intercept of the tangent line with slope s). Slope=s 


15.14. Equal G(s) values 


The value of G(s(zx)) as a function of x is F’(x)- a — F(x). We are given that this 
function has the same value for two different values of x. Call them x; and z2. The 
mean value theorem then tells us that there must be a point between zı and x2 


x 


where the derivative of F'(x) -x — F(x) is zero (in other words, F’(x) - x — F(x) Figure 15.35 
achieves a max or min somewhere between x; and x2). So there must be an x for j 
which 


d(F’(a) ‘2 F(a)) 


0 as, 
dx 
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15.15. 


15.16. 


= (F" (x) -£+ F'(zx)) — F'(x) 
= F"(2)-2. (15.169) 


Since we are assuming that xı and x2 have the same sign, the factor of x here can’t 
be zero. So we are left with F’’(x) = 0 as the only option, which means that there 
exists an inflection point. 

It’s possible to solve this problem by not calculating the derivative in Eq. (15.169) 
and instead by just looking at all the possible type of graphs (one of which is shown 
in Fig. 15.8). But there are a number of cases to consider (depending on which 
tangent line gets hit first, and whether F(a) is above or below each tangent line), 
and it’s hard to be sure you’ve covered them all. 


REMARK: The x = 0 root of Eq. (15.169) is relevant if we remove the restriction that 
the two «’s have the same sign. If you imagine increasing the value of x in Fig. 15.7, the 
tangent line “rolls” on F(x) as it migrates from slope sı to s2. The highest y intercept 
(which equals the negative of F' (æ) -x — F(x)) occurs when the tangent line is tangent to 
F(x) at x = 0 (because every other tangent line has a y intercept below this point). So the 
minimum value of F’(«x) -x — F(x) is achieved at x = 0, which means that its derivative is 
zero at x = 0, consistent with Eq. (15.169). & 


Double-valued G(s) 

Let x; and x2 be the associated values of x where the lines of slope s are tangent 
to F(x). Since zı # x2 (because F(x) is assumed to be well defined), we see that 
two different values of x yield the same slope. Consider the slope as a function of 
x. Because the slopes are equal at xı and x2, the mean value theorem tells us that 
there must be a point between x1 and x2 where the derivative of the slope is zero. 
In other words, the second derivative of F(x) is zero, as we wanted to show. 


Straight lines 
We claim that straight lines remain straight if and only if the velocity vector (q, p) 
is at most a linear function of the coordinates, that is, if it takes the form, 


v(q,p) = (àla, p) b(a, p)) = (a1 + a2q+ asp, bi + bog + bsp). (15.170) 


The constant (a1, 61) part of this velocity is irrelevant, because it corresponds to a 
uniform shift of all points in the plane, which doesn’t affect the straightness of lines. 
So we’ll ignore it. The velocity is then v(q,p) = (a2q + asp, beq + bsp). With this 
form, you can quickly check that (with r; = (qi, pi)) 


v(ri + re) = v(ri) + v(r2), and v(kr) = kv(r). (15.171) 


Consider now two points in phase space, r and r+a. Any other point lying on 
the line determined by these two points takes the form of r+ ka, where k is some 
number. 

Our strategy will be to show that after a small time dt, the points r, r+ a, and 
r+ ka still lie on a line (with k being an arbitrary number, thus accounting for the 
whole line). And since any flow in phase space can be built up from many little 
steps of time dt, it then follows that a line remains a line for all times if the velocity 
takes the form in Eq. (15.170). Our three points end up at 


r — r+v(r)dt, (15.172) 
r+a r+a+v(r+a)dt =r +a + v(r)dt + v(a)dt, 
r+ka — r+ka+v(r+ka)dt =r+ka+v(r)dt + kv(a)dt. 


Regrouping, the three new points are 
(r + v(r)dt), 
(r + v(r)dt) + (a + v(a)dt), 
(r + v(r)dt) + k(a + v(a)dt). (15.173) 
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These have the same general form (where the difference between the first and third is 
a multiple of the difference between the first and second) as the three initial points, 
so they do indeed lie on a line, as we wanted to show. 

Conversely, if the velocity is not of the form in Eq. (15.170), then the relations in 
Eq. (15.171) fail to hold for at least some positions, causing the final positions to 
not take the form in Eq. 15.173. 

Interestingly, for 1-D Hamiltonians of the standard form, p*/2m + V (q), the only 
potentials that lead to phase-space velocities of the form in Eq. (15.170) are V(q) = 
C, V(q) x q, and V(q) x q’, which exactly correspond to the “Constant velocity,” 
“Falling balls,” and “Harmonic oscillator” examples in Section 15.5.1. So, although 
these three examples might have led you to the inductive conclusion that straight 
lines remain straight in any setup, these examples are actually the only ones (with 
Hamiltonians of the form p?/2m + V(q)) for which this is true. 


No branching 

First, note that Hamilton’s equations uniquely determine the velocity vector (ġ, p) = 
(OH/Op, —ƏH/ðq) at a given point (q,p). So the two paths must have the same 
velocity at the branching point if there is any chance of a branch actually existing. 
However, this argument also applies to all higher derivatives. For example, taking 
the derivative of ġ = OH/Op yields g in terms of q and p, and also g and p by 
the chain rule. But ġ and p can be rewritten in terms of q and p via the previous 
knowledge of the velocity vector (q,p). Likewise for p. So the second derivatives 
can be written in terms of only q and p (that is, no derivatives are needed), and are 
therefore uniquely determined by a given point (q,p). In this manner, we can work 
our way up to all higher derivatives, expressing them in terms of only q and p. 
Therefore, at the branching point the derivatives of q and p to all orders for one 
path must be equal to the corresponding derivatives for the other path. But if two 
functions have their derivatives equal to all orders, they must actually be the same 
function. Hence, the two paths are actually the same path, and there is in fact no 
branching.?° This reasoning also implies that paths can’t merge, because we could 
imagine time running backwards, in which case we would end up with a fork. 


Liouville for higher N 
In the same manner that we obtained Eq. (15.94) in the proof for the N = 1 case 


in the text, we obtain 
dV 
— = | v-dA. (15.174) 
dt 7 
Here V is the 2N-dimensional “volume” enclosed by the (2N—1)-dimensional surface 
S. The vector dA points perpendicular to the surface and has its magnitude equal 
to the area of a small (2N — 1)-dimensional patch. The velocity vector v is the 
2N-dimensional vector (41,1, G2, p2, ..-, QN, PN). Combining Eq. (15.174) with the 


divergence theorem, 
i vevav= f vada, (15.175) 
Vv s 


gives 
dt = 
ð ə o o 
= ; RA ; - (G1, B1,---,4n, Dn) dV 
On , Opi Ogn , ODN 
= par: H dV. 15.176 
Í & Opi i Oqn ax) ( ) 


20 There are some classic pathological examples of unequal mathematical functions whose deriva- 


tives agree to all orders at a given point (for example, y = 0 and y = e 


2 
Ve" at g = 0), but we 


won’t worry about such things here. 
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But the 2N Hamilton’s equations are 


di = a : and pi = 2 A for 1<i< N. (15.177) 
Plugging these into Eq. (15.176) gives 
WV f PH H \  _ f OB PH i 
dt v \\On0p1 pð) \OqnOpn pnôqgN 
= i (15.178) 


where we have used the fact that partial derivatives commute. 


15.19. Harmonic oscillator 
The four corners of the initial rectangle have coordinates (zo, po), (£o + Az, po), 
(xo, po + Ap), and (zo + Az, po + Ap). Using Eq. (15.105), these four corners get 
mapped into the points, respectively (with S = sin wt and C = coswt), 


(z,p)ı = (zC Po S, -muros 4 PoC), (15.179) 
mw 
(x, p) = (o + Ax)C 4 cal S, —mw(ao + Ax)S 4 PoC), 
(x,p) = (zC | DORE g mwaoS + (po 4 Ap)C). 
mw 
(t,p)4 = (o + Ax)C 4 ats mw(zo + Ax)S + (po 4 Ap)C). 


(z,p)1 = (0,0), 
(z,p)2 = (^z C, —mwAx 5) ; 
(z,p)3 = (2s, ApC), 
mw 
(z,pa = (Acc + 2 g, —mwAr $+ Ap c). (15.180) 


These points form the vertices of a parallelogram, as shown in Fig. 15.36. The area 


mo sinot (x,y), 
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Figure 15.36 
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of this parallelogram equals the area of the dotted rectangle minus the area of the 
four triangles, so we have 


A 


(Ac C+ Ars) (muds Si Ap c) 
mw 


-2 (SAC: modes + LA? g, ApC) 
2 2 mw 


AxC . ApC + Any -mwAr S (only the cross terms survive) 
mw 


= ArAp(C? +8?) 
= AzdAp, (15.181) 


as desired. This result for the harmonic oscillator holds for a rectangle of any size, 
so in particular it holds for an infinitesimal rectangle. And since any region of 
arbitrary shape can be built up from infinitesimal rectangles, we see that the area 
of any region remains constant as it flows (and invariably distorts) through phase 
space. 


Harmonic oscillator, easier method ** 


If we write the time development in matrix form, we have 


(33 )-(S BS) CR) e 


This is the familiar matrix that represents a clockwise rotation in the plane through 
an angle wt. So all points in the z-p plane move in circles with the same frequency 
w, which means that any given initial region Ri simply rotates around the origin in 
the z-p plane, as shown in Fig. 15.37. Therefore, since every region keeps its same 
shape as it rotates in the plane, areas are conserved in the z-p plane. 


But what about the x-p plane? To see how regions evolve in this plane, we simply 
have to shrink the z-p plane by a factor of 1/mw on the horizontal axis (because 
x = z/mw), while keeping the vertical axis the same. Fig. 15.38 shows the result for 
the case where mw = 2 (in appropriate units). The transformation in the x-p plane 
doesn’t represent a rotation anymore (the shape of the region clearly gets distorted). 
But the area does remain constant, because the uniform horizontal scaling by the 
factor 1/mw changes the areas of both Ri and R(t) by this same factor. (Imagine 
slicing each region up into many thin horizontal rectangles; every rectangle has its 
area decreased by the factor 1/mw, so the same result must be true for the entire 
region.) And since we showed above that the areas are equal in the z-p plane, they 
must therefore also be equal in the x-p plane. 


Liuoville’s theorem 


If every point moves through the plane with the same velocity, then the initial 
rectangle simply gets translated into another rectangle, and the area trivially remains 
the same. However, if the velocities differ throughout the plane, then the area need 
not remain the same. For example, if points on the right side of the rectangle have 
a larger t than points on the left, then the rectangle gets stretched, and the area 
increases (due to this effect, at least). 


Let’s be quantitative about this. From the definition of the partial derivative of t, 
the value of & at point 2 in Fig. 15.39 is larger than the value at point 1 by (0@/0xr)Ax 
(to leading order in Ax). So after an infinitesimal time dt, the difference in the x 
values of points 1 and 2 has grown by (0%/0x)Az dt compared with what it was 
initially (which was Ax). The difference in the x values at time dt is therefore Ax + 
(0z/0x)Azx dt. Likewise, the difference in the p values at time dt is 0+(0p/0x)Az dt. 
By analogous reasoning (or by simply switching x and p), The difference in the p 
values of points 1 and 3 at time dt is Ap + (0p/Op)Ap dt, and the difference in the 
x values is 0 + (04%/Op)Ap dt. 
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Figure 15.40 


Finally (continuing to use the definition of the partial derivative), the difference in 
the x values of points 1 and 4 at time dt is Ax + (04%/0x) Ax dt + (0%/Op)Ap dt, and 
the difference in the p values is Ap + (Op/Op)Ap dt + (Op/Ox) Az dt. We therefore 
end up with the parallelogram shown in Fig. 15.40. 

One method of finding the area of this parallelogram is to calculate the area of 
the “circumscribing” rectangle and then subtract off the area of the four triangles 
shown in Fig. 15.40, along with the two tiny rectangles at the upper left and lower 
right corners. To first order in dt, this does indeed yield the desired area for the 
parallelogram, given in Eq. (15.107), as you can check. (To keep things from getting 
too messy, you should immediately drop all terms of order dt?. In particular, the 
two tiny rectangles are irrelevant.) But let’s find the area in a nicer geometric way. 
Consider the situation shown in Fig. 15.41. To more easily compare the initial and 
final areas, we have translated the parallelogram 1,2,3,4 (which doesn’t affect its 
area, of course) so that its lower left vertex coincides with the lower left corner 
of the original rectangle A, B,C, D. We can now imagine chopping off the lightly- 
shaded triangle from the parallelogram shown on the right side of the figure, and 
pasting into the left side of the figure, as shown. Likewise with the darker triangle. 
We therefore see that the area of the parallelogram equals the area of the rectangle 
A’, B’,C’, D’. You might object that we have double counted the region where the 
shaded triangles overlap in the upper right part of the figure. However, using the 


Figure 15.41 
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lengths shown in Fig. 15.40, the area of this region is of order ArAp dt?, so it can 
be ignored due to the dé?. 
Our task therefore reduces to finding the area of rectangle A’, B’,C’, D’. This area 
equals the area of the original rectangle A, B,C, D (which is AxvAp) plus the areas 
of the two striped rectangles. (We are ignoring the tiny rectangle in the upper right 
corner because its area is of order dt?.) Using the lengths shown, the sum of the 
areas of these two rectangles is 

Ap- t Ardit + Ar. OP Apd = Reap (2-2) dt. (15.183) 

Ox Op Ox Op 


Adding on the area AxvAy of rectangle A, B,C, D, we see that (to leading order) 
the area of rectangle A’, B’,C’, D’ (which is essentially the same as the area of the 


parallelogram) equals 
ðt Op 
AzAp (: + (= + 2) at) $ (15.184) 


as desired. Note that in the case of Hamiltonian flow in phase space, we know that 
the two partial-derivative terms sum to zero, so one of them must be negative. This 
means that one of the striped rectangles in Fig. 15.41 actually has negative area. 
That is, B’ and D’ lie to the left of B and D, or C’ and D’ lie below C and D. 


Corrections to 
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With Problems and Solutions 


All printings up to 2013 
David Morin 


Please email morin@physics.harvard.edu if you find any errors. 


The corrections below are listed by page number. They are grouped into three categories: 
(1) Important errors that will cause confusion, (2) minor errors that might cause confusion, 
and (3) trivial errors that should not cause confusion. Additional clarifications are listed at 
the end. 


Important errors: 


88: 


188: 


Solution to Problem 3.7: The first sentence of the second paragraph states, “Because 
the magnitudes of the friction force and the gravitational force along the plane are 
equal, the acceleration along the direction of motion equals the negative of the accel- 
eration in the direction down the plane.” Although this statement is correct, more 
explanation is required. This issue is that these two directions aren’t orthogonal, so 
the accelerations in the two directions aren’t independent. 


The correct justification is given in the figure below. At a given time t, let the velocity 
be Vola, as shown. A small time At later, the velocity is Vnew. The gravitational and 
frictional forces (both with magnitude mg sin 0) are indicated, along with the net force 
obtained by completing the parallelogram (which is a rhombus in this case) formed 
by the two individual forces and the dotted lines shown. Since we are drawing things 
alongside the velocity vectors, we have actually drawn the changes in velocity acquired 
during the small time At. That is, the total acceleration is anet = Fnet/m, so the 
total change in the velocity vector is ane, At = Fret At/m, as shown. 


A 


decrease in v 


4 M tip of Vog vector 
increase in v, 


direction down 


the plane — Foray Ad/m 


F,.Ad/m 


In going from Vola tO Vnew, the increase in vy is indicated in the figure (positive 
vy points down the plane). Additionally, since At is assumed to be small, Vnew is 
essentially parallel to Voja. So the length AB in the figure is essentially equal to 
the length AC. The length of v (that is, v = ,/v2+ v2) therefore decreases by 
the amount indicated. But since the parallelogram formed by the individual forces 
is in fact a rhombus, the decrease in v equals the increase in vy. The sum v + vy 
is therefore constant throughout the motion, as claimed. The rest of the solution 
proceeds as stated. 


Exercise 5.48: Modify part (b) of this problem to read: “(b) the ball is not allowed 
to touch the pipe, but assume that it skims the top of the pipe.” Without this extra 


427: 


484: 


condition, the problem is more difficult, and it turns out (as you are encouraged to 
show) that the minimum-speed trajectory actually skims the pipe at points 45° down 
from the top, as opposed to skimming it at the top. 


Exercise 9.55: The level of this exercise should be increased to four stars, and the 
latter part should now read: 


of radius R/2. Assume that the coefficient of friction between the ball and the 
cone is arbitrarily large. What is the frequency of precession, Q? Show that 
the condition on £ for the setup to be possible is £ > (3v3/4)R. If we instead 
have a solid ball with I = (2/5)mR?, find Q and show that the condition on £ 
is (5V/3/2)R > £ > (5\/3/8)R. What about a general J = GmR?? There is a 
special value of 8; what is it, and why is it special? 


Exercise 10.24 should be replaced with: 
At a polar angle 6, a projectile is fired eastward with speed vp at an angle a 
above the ground. Show that the southward (in the northern hemisphere) and 
eastward deflections due to the Coriolis force are (to first order in w) 
dsouth = (4wug/g?) cos@cosasin? a, 
deast = (4wve/g?) sin 6( cos” asin a — (1/3) sin? a)) : 


Hint: The first term in deast arises because the flight time is modified due to the 
vertical component of the Coriolis force. 


Minor errors: 


237. 


259: 
287: 
314: 


319: 


349: 


364: 


541. 


588: 


590: 


In Eq. (6.59) we’re concerned with infinitesimal e, so the derivative dL/de is being 
evaluated at e = 0. So it should read “0 = th | <0 =...” The other terms throughout 
the proof should technically also have the “|e=0” restriction. 


6th line on page: “numerically using Eq. (6.125)” 
2nd sentence of Section 7.4.1: The order of Mo and m should be reversed. 


2nd line of 2nd paragraph of the Remarks: The square root should be removed. So it 
should read, “a = gsin6/(1+ 8)” 


Eq. (8.27): After the second “=” sign the exponent should be 4, not 2. So it should 
read, “(R*/4) px” 


3rd line of solution to Problem 8.1: The second “2” should not be there. So it should 
read, “—2mgd + mgd = —mgd.” 


Eq. (8.124): There is an extra factor of Vo in the expression for w; the Vo in front of 
the fraction should be removed. 


Eq. (11.72): The upper-right entry in the matrix on the righthand side should be 
V1 + V2. 


2nd line of Remark 3: We haven’t dropped the c’s at this point, so it should read, 
“E = ym?” 


Last line on page: Again, we haven’t dropped the c’s yet, so it should read, “E? — 


|p? e = m2” 


Trivial errors: 


259: 8th line on page: “in the first case” 

320: Eq. (8.30): There should be a dé in the last integral in the first line. 
343: Fig. 8.51: The m’s should be capital M’s. 

868: Fig. 8.80: The V should be a Vo. 


Clarifications: 


72: Problem 3.13: In this pencil-balancing problem, we’re assuming that the rod is in- 
finitely thin, or equivalently that the tip is infinitely sharp. This is therefore a hypo- 
thetic problem. If you want to get realistic and assume that the tip is at least as wide 
as an atom, then things get very tricky: Is the bottom atom flat, in which case the 
pencil has to extract itself from a potential well? Or is the atom round, in which case 
there is no potential well? And what are the forces between the atom and the one(s) 
it touches on the table? Tricky indeed. 


336: 4th line of Problem 8.14: Clarification: “free to rotate about its fixed axis” 


